help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | idna | languageid
Category | Datatype | Source | Property | Values |
---|---|---|---|---|
Bidirectional | Binary | UCD | Bidi_Control | No (N), Yes (Y) |
Bidi_Mirrored | No (N), Yes (Y) | |||
Enumerated | Bidi_Class | Show Values | ||
Bidi_Paired_Bracket_Type | Close, None, Open | |||
String | Bidi_Mirroring_Glyph | Show Values | ||
Case | Binary | UCD | Case_Ignorable | No (N), Yes (Y) |
Cased | No (N), Yes (Y) | |||
Changes_When_Casefolded | No (N), Yes (Y) | |||
Changes_When_Casemapped | No (N), Yes (Y) | |||
Changes_When_Lowercased | No (N), Yes (Y) | |||
Changes_When_Titlecased | No (N), Yes (Y) | |||
Changes_When_Uppercased | No (N), Yes (Y) | |||
Lowercase | No (N), Yes (Y) | |||
Soft_Dotted | No (N), Yes (Y) | |||
Uppercase | No (N), Yes (Y) | |||
Unicode | isCased | No (N), Yes (Y) | ||
isCasefolded | No (N), Yes (Y) | |||
isLowercase | No (N), Yes (Y) | |||
isTitlecase | No (N), Yes (Y) | |||
isUppercase | No (N), Yes (Y) | |||
X-ICU | Case_Sensitive | No (N), Yes (Y) | ||
String | UCD | Case_Folding | Show Values | |
Lowercase_Mapping | Show Values | |||
Simple_Case_Folding | Show Values | |||
Simple_Lowercase_Mapping | Show Values | |||
Simple_Titlecase_Mapping | Show Values | |||
Simple_Uppercase_Mapping | Show Values | |||
Titlecase_Mapping | Show Values | |||
Uppercase_Mapping | Show Values | |||
Unicode | toCasefold | Show Values | ||
toLowercase | Show Values | |||
toTitlecase | Show Values | |||
toUppercase | Show Values | |||
CJK | Binary | UCD | IDS_Binary_Operator | No (N), Yes (Y) |
IDS_Trinary_Operator | No (N), Yes (Y) | |||
Ideographic | No (N), Yes (Y) | |||
Radical | No (N), Yes (Y) | |||
Unified_Ideograph | No (N), Yes (Y) | |||
Enumerated | X-Demo | HanType | Han, Hans, Hant, na | |
General | Binary | UCD | Alphabetic | No (N), Yes (Y) |
Default_Ignorable_Code_Point | No (N), Yes (Y) | |||
Deprecated | No (N), Yes (Y) | |||
Logical_Order_Exception | No (N), Yes (Y) | |||
Noncharacter_Code_Point | No (N), Yes (Y) | |||
Variation_Selector | No (N), Yes (Y) | |||
White_Space | No (N), Yes (Y) | |||
Catalog | Age | Show Values | ||
Block | Show Values | |||
Script | Show Values | |||
Enumerated | General_Category | Show Values | ||
Hangul_Syllable_Type | Leading_Jamo (L), LV_Syllable (LV), LVT_Syllable (LVT), Not_Applicable (NA), Trailing_Jamo (T), Vowel_Jamo (V) | |||
String | Name | Show Values | ||
Script_Extensions | Show Values | |||
subhead | Show Values | |||
Identifiers | Binary | UCD | ID_Continue | No (N), Yes (Y) |
ID_Start | No (N), Yes (Y) | |||
Pattern_Syntax | No (N), Yes (Y) | |||
Pattern_White_Space | No (N), Yes (Y) | |||
XID_Continue | No (N), Yes (Y) | |||
XID_Start | No (N), Yes (Y) | |||
Miscellaneous | Binary | UCD | Dash | No (N), Yes (Y) |
Diacritic | No (N), Yes (Y) | |||
Extender | No (N), Yes (Y) | |||
Grapheme_Base | No (N), Yes (Y) | |||
Grapheme_Extend | No (N), Yes (Y) | |||
Grapheme_Link | No (N), Yes (Y) | |||
Hyphen | No (N), Yes (Y) | |||
Math | No (N), Yes (Y) | |||
Quotation_Mark | No (N), Yes (Y) | |||
STerm | No (N), Yes (Y) | |||
Terminal_Punctuation | No (N), Yes (Y) | |||
Miscellaneous | ISO_Comment | Show Values | ||
Unicode_1_Name | Show Values | |||
Normalization | Binary | UCD | Changes_When_NFKC_Casefolded | No (N), Yes (Y) |
Full_Composition_Exclusion | No (N), Yes (Y) | |||
Unicode | isNFC | No, Yes | ||
isNFD | No, Yes | |||
isNFKC | No, Yes | |||
isNFKD | No, Yes | |||
X-ICU | NFC_Inert | No (N), Yes (Y) | ||
NFD_Inert | No (N), Yes (Y) | |||
NFKC_Inert | No (N), Yes (Y) | |||
NFKD_Inert | No (N), Yes (Y) | |||
Enumerated | UCD | Canonical_Combining_Class | Show Values | |
Decomposition_Type | Show Values | |||
NFC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFD_Quick_Check | No (N), Yes (Y) | |||
NFKC_Quick_Check | Maybe (M), No (N), Yes (Y) | |||
NFKD_Quick_Check | No (N), Yes (Y) | |||
X-ICU | Lead_Canonical_Combining_Class | Show Values | ||
Trail_Canonical_Combining_Class | Show Values | |||
String | UCD | NFKC_Casefold | Show Values | |
Unicode | toNfc | Show Values | ||
toNfd | Show Values | |||
toNfkc | Show Values | |||
toNfkd | Show Values | |||
Numeric | Binary | UCD | ASCII_Hex_Digit | No (N), Yes (Y) |
Hex_Digit | No (N), Yes (Y) | |||
Enumerated | Numeric_Type | Decimal (De), Digit (Di), None (None), Numeric (Nu) | ||
Numeric | Numeric_Value | Show Values | ||
Shaping and Rendering | Binary | UCD | Join_Control | No (N), Yes (Y) |
X-ICU | Segment_Starter | No (N), Yes (Y) | ||
Enumerated | UCD | East_Asian_Width | Ambiguous (A), Fullwidth (F), Halfwidth (H), Narrow (Na), Neutral (N), Wide (W) | |
Grapheme_Cluster_Break | Show Values | |||
Joining_Group | Show Values | |||
Joining_Type | Dual_Joining (D), Join_Causing (C), Left_Joining (L), Non_Joining (U), Right_Joining (R), Transparent (T) | |||
Line_Break | Show Values | |||
Sentence_Break | Show Values | |||
Word_Break | Show Values | |||
X-Emoji | Enumerated | UTR | emoji | face, flag, group, keycap, modifier, no, other, primary, secondary |
X-IDNA | Binary | UTS | idna2008 | CONTEXTJ, CONTEXTO, DISALLOWED, PVALID, UNASSIGNED |
Enumerated | idna2003 | deviation, disallowed, ignored, mapped, valid | ||
idna2008c | deviation, disallowed, ignored, mapped, valid | |||
uts46 | deviation, disallowed, ignored, mapped, valid | |||
String | toIdna2003 | Show Values | ||
toUts46n | Show Values | |||
toUts46t | Show Values | |||
X-Regex | Binary | UTS | ANY | No, Yes |
ASCII | No, Yes | |||
alnum | No (N), Yes (Y) | |||
blank | No (N), Yes (Y) | |||
bmp | No, Yes | |||
graph | No (N), Yes (Y) | |||
No (N), Yes (Y) | ||||
xdigit | No (N), Yes (Y) | |||
X-Security | Enumerated | UTS | Identifier_Status | Allowed (Allowed), Restricted (Restricted) |
Identifier_Type | Show Values | |||
confusable | Show Values | |||
X-UCA | Binary | UTS | uca | Show Values |
uca2 | Show Values | |||
uca2.5 | Show Values | |||
uca3 | Show Values |
The Categories are from UCD Table 8. Property Summary Table, with some extended categories: X-Encoding, X-IDNA, X-Regex, and X-Security.
The Datatypes are from UCD Table 5. Property Type Key.
The Sources are:
Fonts and Display. If you don't have a good set of Unicode fonts (and modern browser), you may not be able to read some of the characters. Some suggested fonts that you can add for coverage are: Unicode Fonts for Ancient Scripts, Noto Fonts site, Large, multi-script Unicode fonts. See also: Unicode Display Problems.
Version 3.7; ICU version: 56.0.1.0; Unicode version: 8.0.0.0