Unicode Utilities: Character Properties

Unmarked properties are from Unicode V16.0.0; the beta properties are from Unicode V17.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid


 · 
00B7
MIDDLE DOT
Other Punctuation
confuse: , , , , , , , , , , ,
Normative, Informative, Contributory, and (Provisional) UCD properties for U+00B7
AgeV1_1
AlphabeticNo
ASCII_Hex_DigitNo
Bidi_ClassOther_Neutral
Bidi_ControlNo
Bidi_MirroredNo
Bidi_Mirroring_Glyphnull
Bidi_Paired_Bracketnull
Bidi_Paired_Bracket_TypeNone
BlockLatin_1_Supplement
Canonical_Combining_ClassNot_Reordered
Case_Folding· <U+00B7>
Case_IgnorableYes
CasedNo
Changes_When_CasefoldedNo
Changes_When_CasemappedNo
Changes_When_LowercasedNo
Changes_When_NFKC_CasefoldedNo
Changes_When_TitlecasedNo
Changes_When_UppercasedNo
Composition_ExclusionNo
DashNo
Decomposition_Mapping· <U+00B7>
Decomposition_TypeNone
Default_Ignorable_Code_PointNo
DeprecatedNo
DiacriticYes
East_Asian_WidthAmbiguous
EmojiNo
Emoji_ComponentNo
Emoji_ModifierNo
Emoji_Modifier_BaseNo
Emoji_PresentationNo
Equivalent_Unified_Ideographnull
Expands_On_NFCNo
Expands_On_NFDNo
Expands_On_NFKCNo
Expands_On_NFKDNo
Extended_PictographicNo
ExtenderYes
FC_NFKC_Closure· <U+00B7>
Full_Composition_ExclusionNo
General_CategoryOther_Punctuation
Grapheme_BaseYes
Grapheme_Cluster_BreakOther
Grapheme_ExtendNo
Grapheme_LinkNo
Hangul_Syllable_TypeNot_Applicable
Hex_DigitNo
HyphenNo
ID_Compat_Math_ContinueNo
ID_Compat_Math_StartNo
ID_ContinueYes
ID_StartNo
IdeographicNo
IDS_Binary_OperatorNo
IDS_Trinary_OperatorNo
IDS_Unary_OperatorNo
Indic_Conjunct_BreakNone
Indic_Positional_CategoryNA
Indic_Syllabic_CategoryOther
ISO_Commentnull
Jamo_Short_Namenull
Join_ControlNo
Joining_GroupNo_Joining_Group
Joining_TypeNon_Joining
(kEH_AltSeq)17.0β: null
kEH_Catnull
(kEH_Core)None
kEH_Descnull
(kEH_Func)null
(kEH_FVal)null
kEH_HGnull
kEH_IFAOnull
kEH_JSeshnull
kEH_NoMirrorNo
kEH_NoRotateNo
(kEH_UniK)null
Line_BreakAmbiguous
Logical_Order_ExceptionNo
LowercaseNo
Lowercase_Mapping· <U+00B7>
MathNo
Modifier_Combining_MarkNo
NameMIDDLE DOT
Name_Aliasnull
NFC_Quick_CheckYes
NFD_Quick_CheckYes
NFKC_Casefold· <U+00B7>
NFKC_Quick_CheckYes
NFKC_Simple_Casefold· <U+00B7>
NFKD_Quick_CheckYes
Noncharacter_Code_PointNo
Numeric_TypeNone
Numeric_ValueNaN
Other_AlphabeticNo
Other_Default_Ignorable_Code_PointNo
Other_Grapheme_ExtendNo
Other_ID_ContinueYes
Other_ID_StartNo
Other_LowercaseNo
Other_MathNo
Other_UppercaseNo
Pattern_SyntaxNo
Pattern_White_SpaceNo
Prepended_Concatenation_MarkNo
Quotation_MarkNo
RadicalNo
Regional_IndicatorNo
ScriptCommon
Script_ExtensionsAvestan|⁠Carian|⁠Coptic|⁠Duployan|⁠Elbasan|⁠Georgian|⁠Glagolitic|⁠Gunjala_Gondi|⁠Gothic|⁠Greek|⁠Han|⁠Latin|⁠Lydian|⁠Mahajani|⁠Old_Permic|⁠Shavian
Sentence_BreakOther
Sentence_TerminalNo
Simple_Case_Folding· <U+00B7>
Simple_Lowercase_Mapping· <U+00B7>
Simple_Titlecase_Mapping· <U+00B7>
Simple_Uppercase_Mapping· <U+00B7>
Soft_DottedNo
Terminal_PunctuationNo
Titlecase_Mapping· <U+00B7>
Unicode_1_Namenull
Unified_IdeographNo
UppercaseNo
Uppercase_Mapping· <U+00B7>
Variation_SelectorNo
Vertical_OrientationRotated
White_SpaceNo
Word_BreakMidLetter
XID_ContinueYes
XID_StartNo
Non-UCD properties for U+00B7
Basic_EmojiNo
Identifier_StatusAllowed
Identifier_TypeInclusion
IDNA2008_CategoryContextual_Rule_Required_Other
RGI_EmojiNo
RGI_Emoji_Flag_SequenceNo
RGI_Emoji_Keycap_SequenceNo
RGI_Emoji_Modifier_SequenceNo
RGI_Emoji_QualificationNone
RGI_Emoji_Tag_SequenceNo
RGI_Emoji_Zwj_SequenceNo
Other UCD data for U+00B7
CJK_Radicalnull
Do_Not_Emit_Preferrednull
Do_Not_Emit_TypeNone
Emoji_DCMnull
Emoji_KDDInull
Emoji_SBnull
emoji_variation_sequencenull
kNSHU_DubenSrcnull
kNSHU_Readingnull
kTGT_MergedSrcnull
kTGT_RSUnicodenull
Named_Sequencesnull
Names_List_Aliasmidpoint (in typography)|⁠Georgian comma|⁠Greek middle dot (ano teleia)
Names_List_Commentalso used as a raised decimal point or to denote multiplication; for multiplication 22C5 is preferred|⁠used as a vowel length mark (part of words) in many Amerindian orthographies
Names_List_Cross_Ref. <U+002E>|⁠˙ <U+02D9>|⁠· <U+0387>|⁠• <U+2022>|⁠․ <U+2024>|⁠‧ <U+2027>|⁠∙ <U+2219>|⁠⋅ <U+22C5>|⁠⸱ <U+2E31>|⁠⸳ <U+2E33>|⁠・ <U+30FB>|⁠ꞏ <U+A78F>
Names_List_SubheaderLatin-1 punctuation and symbols
Names_List_Subheader_NoticeBased on ISO/IEC 8859-1 (aka Latin-1) from here.
Non_Unihan_Numeric_Valuenull
normalization_correction_correctednull
normalization_correction_originalnull
normalization_correction_versionnull
Other_Joining_TypeDeduce_From_General_Category
Standardized_Variantnull
Other information on U+00B7
ANYYes
ASCIINo
bmpYes
Confusable_MA· <U+00B7>
exemplarca
exemplar_aux
exemplar_punctgd|⁠ii|⁠ko|⁠sc|⁠vec|⁠yue|⁠yue-Hans|⁠zh|⁠zh-Hant
HanTypena
Idn_2008na
Idn_Mapping· <U+00B7>
Idn_Statusvalid
idna2003valid
idna2008cvalid
isNFCYes
isNFDYes
isNFKCYes
isNFKDYes
isNFMYes
toIdna2003null
toNFC· <U+00B7>
toNFD· <U+00B7>
toNFKC· <U+00B7>
toNFKD· <U+00B7>
toNFM· <U+00B7>
toUts46nnull
toUts46tnull
uca091A
uca205
uca2.501
uca305

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)