Unicode Utilities: Character Properties
Unmarked properties are from Unicode V16.0.0; the beta properties are from Unicode V17.0.0β. For more information, see Unicode Utilities Beta.
help | character
| properties
| confusables
| unicode-set
| compare-sets
| regex
| bnf-regex
| breaks
| transform
| bidi
| bidi-c
| idna
| languageid
|
2014 |
EM DASH |
Dash Punctuation |
confuse: , , , , , , , , , , |
Normative, Informative, Contributory, and (Provisional) UCD properties for U+2014 |
---|
|
Joining_Type | Non_Joining |
---|
(kEH_AltSeq) | | 17.0β: null |
---|
kEH_Cat | null |
---|
(kEH_Core) | None |
---|
kEH_Desc | null |
---|
(kEH_Func) | null |
---|
(kEH_FVal) | null |
---|
kEH_HG | null |
---|
kEH_IFAO | null |
---|
kEH_JSesh | null |
---|
kEH_NoMirror | No |
---|
kEH_NoRotate | No |
---|
(kEH_UniK) | null |
---|
Line_Break | Break_Both |
---|
Logical_Order_Exception | No |
---|
Lowercase | No |
---|
Lowercase_Mapping | — <U+2014 > |
---|
Math | No |
---|
Modifier_Combining_Mark | No |
---|
Name | EM DASH |
---|
Name_Alias | null |
---|
NFC_Quick_Check | Yes |
---|
NFD_Quick_Check | Yes |
---|
NFKC_Casefold | — <U+2014 > |
---|
NFKC_Quick_Check | Yes |
---|
NFKC_Simple_Casefold | — <U+2014 > |
---|
NFKD_Quick_Check | Yes |
---|
Noncharacter_Code_Point | No |
---|
Numeric_Type | None |
---|
Numeric_Value | NaN |
---|
Other_Alphabetic | No |
---|
Other_Default_Ignorable_Code_Point | No |
---|
Other_Grapheme_Extend | No |
---|
Other_ID_Continue | No |
---|
Other_ID_Start | No |
---|
Other_Lowercase | No |
---|
Other_Math | No |
---|
Other_Uppercase | No |
---|
Pattern_Syntax | Yes |
---|
Pattern_White_Space | No |
---|
Prepended_Concatenation_Mark | No |
---|
Quotation_Mark | No |
---|
Radical | No |
---|
Regional_Indicator | No |
---|
Script | Common |
---|
Script_Extensions | Common |
---|
Sentence_Break | SContinue |
---|
Sentence_Terminal | No |
---|
Simple_Case_Folding | — <U+2014 > |
---|
Simple_Lowercase_Mapping | — <U+2014 > |
---|
Simple_Titlecase_Mapping | — <U+2014 > |
---|
Simple_Uppercase_Mapping | — <U+2014 > |
---|
Soft_Dotted | No |
---|
Terminal_Punctuation | No |
---|
Titlecase_Mapping | — <U+2014 > |
---|
Unicode_1_Name | null |
---|
Unified_Ideograph | No |
---|
Uppercase | No |
---|
Uppercase_Mapping | — <U+2014 > |
---|
Variation_Selector | No |
---|
Vertical_Orientation | Rotated |
---|
White_Space | No |
---|
Word_Break | Other |
---|
XID_Continue | No |
---|
XID_Start | No |
---|
|
Non-UCD properties for U+2014 |
---|
|
RGI_Emoji_Flag_Sequence | No |
---|
RGI_Emoji_Keycap_Sequence | No |
---|
RGI_Emoji_Modifier_Sequence | No |
---|
RGI_Emoji_Qualification | None |
---|
RGI_Emoji_Tag_Sequence | No |
---|
RGI_Emoji_Zwj_Sequence | No |
---|
|
Other UCD data for U+2014 |
---|
|
Names_List_Alias | null |
---|
Names_List_Comment | may be used in pairs to offset parenthetical text |
---|
Names_List_Cross_Ref | ⸺ <U+2E3A >|ー <U+30FC > |
---|
Names_List_Subheader | Dashes |
---|
Names_List_Subheader_Notice | null |
---|
Non_Unihan_Numeric_Value | null |
---|
normalization_correction_corrected | null |
---|
normalization_correction_original | null |
---|
normalization_correction_version | null |
---|
Other_Joining_Type | Deduce_From_General_Category |
---|
Standardized_Variant | null |
---|
|
Other information on U+2014 |
---|
ANY | Yes |
---|
ASCII | No |
---|
bmp | Yes |
---|
Confusable_MA | ー <U+30FC > |
---|
exemplar | |
---|
exemplar_aux | |
---|
exemplar_punct | af|ar|as|ast|az|bg|bho|blo|bn|brx|bs|ca|ccp|ce|chr|ckb|csw|cv|cy|de|doi|dsb|dz|ee|el|en|eo|es|eu|fil|fr|fy|ga|gaa|gd|gl|gu|gv|he|hi|hi-Latn|hr|hsb|ia|id|ie|ii|is|it|ja|ka|kea|kgp|kk|kn|ko|kok|ks|ksh|ku|kxv|ky|lb|lij|lkt|lmo|lo|lt|lv|mai|mk|ml|mn|mni|mr|ms|my|nds|ne|nl|oc|om|or|os|pa|pcm|pl|prg|pt|qu|ro|ru|sa|sc|sd-Deva|si|so|sq|st|su|sv|syr|szl|ta|tg|th|tk|tn|to|tr|tt|uz|vai|vec|vi|vmw|xh|yi|yo|yrl|yue|yue-Hans|za|zh|zh-Hant |
---|
HanType | na |
---|
Idn_2008 | NV8 |
---|
Idn_Mapping | — <U+2014 > |
---|
Idn_Status | valid |
---|
idna2003 | valid |
---|
idna2008c | disallowed |
---|
|
|
The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)