Unicode Utilities: Character Properties

Unmarked properties are from Unicode V16.0.0; the beta properties are from Unicode V17.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid


 一 
4E00
CJK UNIFIED IDEOGRAPH-4E00
Han Script
confuse: , , , , , , , , , ,
Non-Unihan Normative, Informative, Contributory, and (Provisional) UCD properties for U+4E00
AgeV1_1
AlphabeticYes
ASCII_Hex_DigitNo
Bidi_ClassLeft_To_Right
Bidi_ControlNo
Bidi_MirroredNo
Bidi_Mirroring_Glyphnull
Bidi_Paired_Bracketnull
Bidi_Paired_Bracket_TypeNone
BlockCJK_Unified_Ideographs
Canonical_Combining_ClassNot_Reordered
Case_Folding <U+4E00>
Case_IgnorableNo
CasedNo
Changes_When_CasefoldedNo
Changes_When_CasemappedNo
Changes_When_LowercasedNo
Changes_When_NFKC_CasefoldedNo
Changes_When_TitlecasedNo
Changes_When_UppercasedNo
Composition_ExclusionNo
DashNo
Decomposition_Mapping <U+4E00>
Decomposition_TypeNone
Default_Ignorable_Code_PointNo
DeprecatedNo
DiacriticNo
East_Asian_WidthWide
EmojiNo
Emoji_ComponentNo
Emoji_ModifierNo
Emoji_Modifier_BaseNo
Emoji_PresentationNo
Equivalent_Unified_Ideographnull
Expands_On_NFCNo
Expands_On_NFDNo
Expands_On_NFKCNo
Expands_On_NFKDNo
Extended_PictographicNo
ExtenderNo
FC_NFKC_Closure <U+4E00>
Full_Composition_ExclusionNo
General_CategoryOther_Letter
Grapheme_BaseYes
Grapheme_Cluster_BreakOther
Grapheme_ExtendNo
Grapheme_LinkNo
Hangul_Syllable_TypeNot_Applicable
Hex_DigitNo
HyphenNo
ID_Compat_Math_ContinueNo
ID_Compat_Math_StartNo
ID_ContinueYes
ID_StartYes
IdeographicYes
IDS_Binary_OperatorNo
IDS_Trinary_OperatorNo
IDS_Unary_OperatorNo
Indic_Conjunct_BreakNone
Indic_Positional_CategoryNA
Indic_Syllabic_CategoryOther
ISO_Commentnull
Jamo_Short_Namenull
Join_ControlNo
Joining_GroupNo_Joining_Group
Joining_TypeNon_Joining
(kEH_AltSeq)17.0β: null
kEH_Catnull
(kEH_Core)None
kEH_Descnull
(kEH_Func)null
(kEH_FVal)null
kEH_HGnull
kEH_IFAOnull
kEH_JSeshnull
kEH_NoMirrorNo
kEH_NoRotateNo
(kEH_UniK)null
Line_BreakIdeographic
Logical_Order_ExceptionNo
LowercaseNo
Lowercase_Mapping <U+4E00>
MathNo
Modifier_Combining_MarkNo
NameCJK UNIFIED IDEOGRAPH-4E00
Name_Aliasnull
NFC_Quick_CheckYes
NFD_Quick_CheckYes
NFKC_Casefold <U+4E00>
NFKC_Quick_CheckYes
NFKC_Simple_Casefold <U+4E00>
NFKD_Quick_CheckYes
Noncharacter_Code_PointNo
Numeric_TypeNumeric
Numeric_Value1
Other_AlphabeticNo
Other_Default_Ignorable_Code_PointNo
Other_Grapheme_ExtendNo
Other_ID_ContinueNo
Other_ID_StartNo
Other_LowercaseNo
Other_MathNo
Other_UppercaseNo
Pattern_SyntaxNo
Pattern_White_SpaceNo
Prepended_Concatenation_MarkNo
Quotation_MarkNo
RadicalNo
Regional_IndicatorNo
ScriptHan
Script_ExtensionsHan
Sentence_BreakOLetter
Sentence_TerminalNo
Simple_Case_Folding <U+4E00>
Simple_Lowercase_Mapping <U+4E00>
Simple_Titlecase_Mapping <U+4E00>
Simple_Uppercase_Mapping <U+4E00>
Soft_DottedNo
Terminal_PunctuationNo
Titlecase_Mapping <U+4E00>
Unicode_1_Namenull
Unified_IdeographYes
UppercaseNo
Uppercase_Mapping <U+4E00>
Variation_SelectorNo
Vertical_OrientationUpright
White_SpaceNo
Word_BreakOther
XID_ContinueYes
XID_StartYes
Non-UCD properties for U+4E00
Basic_EmojiNo
Identifier_StatusAllowed
Identifier_TypeRecommended
IDNA2008_CategoryProtocol_Valid
RGI_EmojiNo
RGI_Emoji_Flag_SequenceNo
RGI_Emoji_Keycap_SequenceNo
RGI_Emoji_Modifier_SequenceNo
RGI_Emoji_QualificationNone
RGI_Emoji_Tag_SequenceNo
RGI_Emoji_Zwj_SequenceNo
Other non-Unihan UCD data for U+4E00
CJK_Radical1
Do_Not_Emit_Preferrednull
Do_Not_Emit_TypeNone
Emoji_DCMnull
Emoji_KDDInull
Emoji_SBnull
emoji_variation_sequencenull
kNSHU_DubenSrcnull
kNSHU_Readingnull
kTGT_MergedSrcnull
kTGT_RSUnicodenull
Named_Sequencesnull
Names_List_Aliasnull
Names_List_Commentnull
Names_List_Cross_Refnull
Names_List_Subheadernull
Names_List_Subheader_Noticenull
Non_Unihan_Numeric_Valuenull
normalization_correction_correctednull
normalization_correction_originalnull
normalization_correction_versionnull
Other_Joining_TypeDeduce_From_General_Category
Standardized_Variantnull
Unihan Normative, Informative, and (Provisional) properties for U+4E00
kAccountingNumericNaN
(kAlternateTotalStrokes)null
(kBigFive)A440
(kCangjie)M
(kCantonese)jat1
(kCCCII)213021
(kCheungBauer)null
(kCheungBauerIndex)null
(kCihaiT)1.101
(kCNS1986)1-4421
(kCNS1992)1-4421
kCompatibilityVariant <U+4E00>
(kCowles)5133
(kDaeJaweon)0129.010
(kDefinition)one; a, an; alone
(kEACC)213021
(kFanqie)於悉
(kFenn)1A
(kFennIndex)216.01|⁠217.06|⁠218.01|⁠220.06
(kFourCornerCode)1000.0
(kGB0)5027
(kGB1)5027
(kGB3)null
(kGB5)null
(kGB7)16.0: null
(kGB8)null
(kGradeLevel)1
(kGSR)0394a
(kHangul)일:0E
(kHanYu)10001.010
(kHanyuPinlu)yī(32747)
(kHanyuPinyin)10001.010:yī
(kHDZRadBreak)⼀[U+2F00]:10001.010
(kHKGlyph)0001
(kIBMJapan)null
kIICoreAGTJHKMP
kIRG_GSourceG0-523B
kIRG_HSourceHB1-A440
kIRG_JSourceJ0-306C
kIRG_KPSourceKP0-FCD6
kIRG_KSourceK0-6C69
kIRG_MSourcenull
kIRG_SSourcenull
kIRG_TSourceT1-4421
kIRG_UKSourcenull
kIRG_USourcenull
kIRG_VSourceV1-4A21
(kIRGDaeJaweon)0129.010
(kIRGHanyuDaZidian)10001.010
(kIRGKangXi)0075.010
(kJa)16.0: null
(kJapanese)イチ|⁠イツ|⁠ひと|⁠ひとつ
(kJapaneseKun)HITOTSU|⁠HITOTABI|⁠HAJIME
(kJapaneseOn)ICHI|⁠ITSU
(kJinmeiyoKanji)null
(kJis0)1676
(kJis1)null
(kJIS0213)null
(kJoyoKanji)2010
(kKangXi)0075.010
(kKarlgren)175
(kKorean)IL
(kKoreanEducationHanja)2007
(kKoreanName)null
(kLau)3341
(kMainlandTelegraph)0001
kMandarin
(kMatthews)3016
(kMeyerWempe)3837
(kMojiJoho)MJ006294
(kMorohashi)00001
(kNelson)0001
kOtherNumericNaN
(kPhonetic)1499
kPrimaryNumeric1
(kPseudoGB1)null
(kRSAdobe_Japan1_6)C+1200+1.1.0
kRSUnicode1.0
(kSBGY)468.40
(kSemanticVariant)U+5F0C<kLau,kMatthews,kMeyerWempe|⁠U+58F9<kLau,kMatthews,kMeyerWempe
(kSimplifiedVariant)null
(kSMSZD2003Index)1.01
(kSMSZD2003Readings)yī粵jat1
(kSpecializedSemanticVariant)U+58F9
(kSpoofingVariant)null
(kStrange)null
(kTaiwanTelegraph)0001
(kTang)*qit|⁠qit
(kTayNumeric)17.0β: null
(kTGH)2013:1
(kTGHZ2013)430.150:yī
kTotalStrokes1
(kTraditionalVariant)null
kUnihanCore2020GHJKMPT
(kVietnamese)nhất
(kVietnameseNumeric)null
(kXerox)241:042
(kXHC1983)1351.020:yī|⁠1360.040:yí|⁠1368.160:yì
(kZhuang)null
(kZhuangNumeric)null
(kZVariant)null
Other information on U+4E00
ANYYes
ASCIINo
bmpYes
Confusable_MA <U+30FC>
exemplarja|⁠yue|⁠yue-Hans|⁠zh|⁠zh-Hant
exemplar_aux
exemplar_punct
HanTypeHan
Idn_2008na
Idn_Mapping <U+4E00>
Idn_Statusvalid
idna2003valid
idna2008cvalid
isNFCYes
isNFDYes
isNFKCYes
isNFKDYes
isNFMYes
toIdna2003null
toNFC <U+4E00>
toNFD <U+4E00>
toNFKC <U+4E00>
toNFKD <U+4E00>
toNFM <U+4E00>
toUts46nnull
toUts46tnull
ucanull
uca2null
uca2.5null
uca3null

The list includes both Unicode Character Properties and some additions (like idna2003 or subhead)