Unicode Utilities: Character Properties

Unmarked properties are from Unicode V16.0.0; the beta properties are from Unicode V17.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid


 « 
00AB
LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
Initial Punctuation
confuse: none
Normative, Informative, Contributory, and (Provisional) UCD properties for U+00AB
AgeV1_1
AlphabeticNo
ASCII_Hex_DigitNo
Bidi_ClassOther_Neutral
Bidi_ControlNo
Bidi_MirroredYes
Bidi_Mirroring_Glyph» <U+00BB>
Bidi_Paired_Bracketnull
Bidi_Paired_Bracket_TypeNone
BlockLatin_1_Supplement
Canonical_Combining_ClassNot_Reordered
Case_Folding« <U+00AB>
Case_IgnorableNo
CasedNo
Changes_When_CasefoldedNo
Changes_When_CasemappedNo
Changes_When_LowercasedNo
Changes_When_NFKC_CasefoldedNo
Changes_When_TitlecasedNo
Changes_When_UppercasedNo
Composition_ExclusionNo
DashNo
Decomposition_Mapping« <U+00AB>
Decomposition_TypeNone
Default_Ignorable_Code_PointNo
DeprecatedNo
DiacriticNo
East_Asian_WidthNeutral
EmojiNo
Emoji_ComponentNo
Emoji_ModifierNo
Emoji_Modifier_BaseNo
Emoji_PresentationNo
Equivalent_Unified_Ideographnull
Expands_On_NFCNo
Expands_On_NFDNo
Expands_On_NFKCNo
Expands_On_NFKDNo
Extended_PictographicNo
ExtenderNo
FC_NFKC_Closure« <U+00AB>
Full_Composition_ExclusionNo
General_CategoryInitial_Punctuation
Grapheme_BaseYes
Grapheme_Cluster_BreakOther
Grapheme_ExtendNo
Grapheme_LinkNo
Hangul_Syllable_TypeNot_Applicable
Hex_DigitNo
HyphenNo
ID_Compat_Math_ContinueNo
ID_Compat_Math_StartNo
ID_ContinueNo
ID_StartNo
IdeographicNo
IDS_Binary_OperatorNo
IDS_Trinary_OperatorNo
IDS_Unary_OperatorNo
Indic_Conjunct_BreakNone
Indic_Positional_CategoryNA
Indic_Syllabic_CategoryOther
ISO_Commentnull
Jamo_Short_Namenull
Join_ControlNo
Joining_GroupNo_Joining_Group
Joining_TypeNon_Joining
(kEH_AltSeq)17.0β: null
kEH_Catnull
(kEH_Core)None
kEH_Descnull
(kEH_Func)null
(kEH_FVal)null
kEH_HGnull
kEH_IFAOnull
kEH_JSeshnull
kEH_NoMirrorNo
kEH_NoRotateNo
(kEH_UniK)null
Line_BreakQuotation
Logical_Order_ExceptionNo
LowercaseNo
Lowercase_Mapping« <U+00AB>
MathNo
Modifier_Combining_MarkNo
NameLEFT-POINTING DOUBLE ANGLE QUOTATION MARK
Name_Aliasnull
NFC_Quick_CheckYes
NFD_Quick_CheckYes
NFKC_Casefold« <U+00AB>
NFKC_Quick_CheckYes
NFKC_Simple_Casefold« <U+00AB>
NFKD_Quick_CheckYes
Noncharacter_Code_PointNo
Numeric_TypeNone
Numeric_ValueNaN
Other_AlphabeticNo
Other_Default_Ignorable_Code_PointNo
Other_Grapheme_ExtendNo
Other_ID_ContinueNo
Other_ID_StartNo
Other_LowercaseNo
Other_MathNo
Other_UppercaseNo
Pattern_SyntaxYes
Pattern_White_SpaceNo
Prepended_Concatenation_MarkNo
Quotation_MarkYes
RadicalNo
Regional_IndicatorNo
ScriptCommon
Script_ExtensionsCommon
Sentence_BreakClose
Sentence_TerminalNo
Simple_Case_Folding« <U+00AB>
Simple_Lowercase_Mapping« <U+00AB>
Simple_Titlecase_Mapping« <U+00AB>
Simple_Uppercase_Mapping« <U+00AB>
Soft_DottedNo
Terminal_PunctuationNo
Titlecase_Mapping« <U+00AB>
Unicode_1_NameLEFT POINTING GUILLEMET
Unified_IdeographNo
UppercaseNo
Uppercase_Mapping« <U+00AB>
Variation_SelectorNo
Vertical_OrientationRotated
White_SpaceNo
Word_BreakOther
XID_ContinueNo
XID_StartNo
Non-UCD properties for U+00AB
Basic_EmojiNo
Identifier_StatusRestricted
Identifier_TypeNot_XID
IDNA2008_CategoryDisallowed
RGI_EmojiNo
RGI_Emoji_Flag_SequenceNo
RGI_Emoji_Keycap_SequenceNo
RGI_Emoji_Modifier_SequenceNo
RGI_Emoji_QualificationNone
RGI_Emoji_Tag_SequenceNo
RGI_Emoji_Zwj_SequenceNo
Other UCD data for U+00AB
CJK_Radicalnull
Do_Not_Emit_Preferrednull
Do_Not_Emit_TypeNone
Emoji_DCMnull
Emoji_KDDInull
Emoji_SBnull
emoji_variation_sequencenull
kNSHU_DubenSrcnull
kNSHU_Readingnull
kTGT_MergedSrcnull
kTGT_RSUnicodenull
Named_Sequencesnull
Names_List_Aliasleft guillemet|⁠chevrons (in typography)
Names_List_Commentusually opening, sometimes closing
Names_List_Cross_Ref≪ <U+226A>|⁠《 <U+300A>
Names_List_SubheaderLatin-1 punctuation and symbols
Names_List_Subheader_NoticeBased on ISO/IEC 8859-1 (aka Latin-1) from here.
Non_Unihan_Numeric_Valuenull
normalization_correction_correctednull
normalization_correction_originalnull
normalization_correction_versionnull
Other_Joining_TypeDeduce_From_General_Category
Standardized_Variantnull
Other information on U+00AB
ANYYes
ASCIINo
bmpYes
Confusable_MA« <U+00AB>
exemplar
exemplar_aux
exemplar_punctam|⁠ar|⁠ast|⁠be|⁠blo|⁠ca|⁠ce|⁠ckb|⁠cv|⁠de|⁠dsb|⁠el|⁠es|⁠fa|⁠fr|⁠gl|⁠hsb|⁠hu|⁠hy|⁠ie|⁠it|⁠jgo|⁠ka|⁠kea|⁠kk|⁠kkj|⁠ky|⁠lb|⁠lij|⁠lmo|⁠lrc|⁠mzn|⁠nb|⁠nds|⁠nn|⁠nnh|⁠no|⁠oc|⁠os|⁠pl|⁠ro|⁠ru|⁠sc|⁠sl|⁠sq|⁠syr|⁠szl|⁠tg|⁠ug|⁠uk|⁠vec|⁠yrl
HanTypena
Idn_2008NV8
Idn_Mapping« <U+00AB>
Idn_Statusvalid
idna2003valid
idna2008cdisallowed
isNFCYes
isNFDYes
isNFKCYes
isNFKDYes
isNFMYes
toIdna2003null
toNFC« <U+00AB>
toNFD« <U+00AB>
toNFKC« <U+00AB>
toNFKD« <U+00AB>
toNFM« <U+00AB>
toUts46nnull
toUts46tnull
uca0976
uca205
uca2.501
uca305

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)