Unicode Utilities: UnicodeSet

Unmarked properties are from Unicode V16.0.0; the beta properties are from Unicode V17.0.0β. For more information, see Unicode Utilities Beta.

help | character | properties | confusables | unicode-set | compare-sets | regex | bnf-regex | breaks | transform | bidi | bidi-c | idna | languageid

Input
              

3,893 Code Points


[\u0000-\u0008\u000E-\u001F\u007F-\u0084\u0086-\u009F\u00AD\u061C\u180E\u200B\u200E\u200F\u202A-\u202E\u2060-\u2064\u2066-\u206F\uFEFF\uFFF9-\uFFFB\U00013430-\U0001343F\U0001BCA0-\U0001BCA3\U0001D173-\U0001D17A\U000E0001 \u0009 \u000B \u000C \u0085 \u2028 \u2029 \u2065 \uFFF0-\uFFF8 \U000E0000 \U000E0002-\U000E001F \U000E0080-\U000E00FF \U000E01F0-\U000E0FFF]


Unassigned, Private use, or Surrogates
items: 3,769

  U+2065no name
  U+FFF0no name
…{7}…
  U+FFF8no name
 󠀀 U+E0000no name
 󠀂 U+E0002no name
…{28}…
 󠀟 U+E001Fno name
 󠂀 U+E0080no name
…{126}…
 󠃿 U+E00FFno name
 󠇰 U+E01F0no name
…{3598}…
 󠿿 U+E0FFFno name

Basic LatinC0 controls
items: 30

  U+0000NULL|NUL
  U+0001START OF HEADING|SOH
  U+0002START OF TEXT|STX
  U+0003END OF TEXT|ETX
  U+0004END OF TRANSMISSION|EOT
  U+0005ENQUIRY|ENQ
  U+0006ACKNOWLEDGE|ACK
  U+0007ALERT|BEL
  U+0008BACKSPACE|BS
   U+0009CHARACTER TABULATION|HORIZONTAL TABULATION|HT|TAB
   U+000BLINE TABULATION|VERTICAL TABULATION|VT
   U+000CFORM FEED|FF
  U+000ESHIFT OUT|LOCKING-SHIFT ONE|SO
  U+000FSHIFT IN|LOCKING-SHIFT ZERO|SI
  U+0010DATA LINK ESCAPE|DLE
  U+0011DEVICE CONTROL ONE|DC1
  U+0012DEVICE CONTROL TWO|DC2
  U+0013DEVICE CONTROL THREE|DC3
  U+0014DEVICE CONTROL FOUR|DC4
  U+0015NEGATIVE ACKNOWLEDGE|NAK
  U+0016SYNCHRONOUS IDLE|SYN
  U+0017END OF TRANSMISSION BLOCK|ETB
  U+0018CANCEL|CAN
  U+0019END OF MEDIUM|EOM|EM
  U+001ASUBSTITUTE|SUB
  U+001BESCAPE|ESC
  U+001CINFORMATION SEPARATOR FOUR|FILE SEPARATOR|FS
  U+001DINFORMATION SEPARATOR THREE|GROUP SEPARATOR|GS
  U+001EINFORMATION SEPARATOR TWO|RECORD SEPARATOR|RS
  U+001FINFORMATION SEPARATOR ONE|UNIT SEPARATOR|US

Basic LatinControl character
items: 1

  U+007FDELETE|DEL

Latin 1 SupplementC1 controls
items: 32

 € U+0080PADDING CHARACTER|PAD
  U+0081HIGH OCTET PRESET|HOP
 ‚ U+0082BREAK PERMITTED HERE|BPH
 ƒ U+0083NO BREAK HERE|NBH
 „ U+0084INDEX|IND
 … U+0085NEXT LINE|NEL
 † U+0086START OF SELECTED AREA|SSA
 ‡ U+0087END OF SELECTED AREA|ESA
 ˆ U+0088CHARACTER TABULATION SET|HORIZONTAL TABULATION SET|HTS
 ‰ U+0089CHARACTER TABULATION WITH JUSTIFICATION|HORIZONTAL TABULATION WITH JUSTIFICATION|HTJ
 Š U+008ALINE TABULATION SET|VERTICAL TABULATION SET|VTS
 ‹ U+008BPARTIAL LINE FORWARD|PARTIAL LINE DOWN|PLD
 Œ U+008CPARTIAL LINE BACKWARD|PARTIAL LINE UP|PLU
  U+008DREVERSE LINE FEED|REVERSE INDEX|RI
 Ž U+008ESINGLE SHIFT TWO|SINGLE-SHIFT-2|SS2
  U+008FSINGLE SHIFT THREE|SINGLE-SHIFT-3|SS3
  U+0090DEVICE CONTROL STRING|DCS
 ‘ U+0091PRIVATE USE ONE|PRIVATE USE-1|PU1
 ’ U+0092PRIVATE USE TWO|PRIVATE USE-2|PU2
 “ U+0093SET TRANSMIT STATE|STS
 ” U+0094CANCEL CHARACTER|CCH
 • U+0095MESSAGE WAITING|MW
 – U+0096START OF GUARDED AREA|START OF PROTECTED AREA|SPA
 — U+0097END OF GUARDED AREA|END OF PROTECTED AREA|EPA
 ˜ U+0098START OF STRING|SOS
 ™ U+0099SINGLE GRAPHIC CHARACTER INTRODUCER|SGC
 š U+009ASINGLE CHARACTER INTRODUCER|SCI
 › U+009BCONTROL SEQUENCE INTRODUCER|CSI
 œ U+009CSTRING TERMINATOR|ST
  U+009DOPERATING SYSTEM COMMAND|OSC
 ž U+009EPRIVACY MESSAGE|PM
 Ÿ U+009FAPPLICATION PROGRAM COMMAND|APC

Latin 1 SupplementLatin-1 punctuation and symbols
items: 1

 ­ U+00ADSOFT HYPHEN

ArabicFormat character
items: 1

 ‎؜‎ U+061CARABIC LETTER MARK

MongolianFormat controls
items: 1

 ᠎ U+180EMONGOLIAN VOWEL SEPARATOR

General PunctuationFormat characters
items: 12

 ​ U+200BZERO WIDTH SPACE
 ‎ U+200ELEFT-TO-RIGHT MARK
 ‎‏‎ U+200FRIGHT-TO-LEFT MARK
 ‪ U+202ALEFT-TO-RIGHT EMBEDDING
 ‫ U+202BRIGHT-TO-LEFT EMBEDDING
 ‬ U+202CPOP DIRECTIONAL FORMATTING
 ‭ U+202DLEFT-TO-RIGHT OVERRIDE
 ‮ U+202ERIGHT-TO-LEFT OVERRIDE
 ⁦ U+2066LEFT-TO-RIGHT ISOLATE
 ⁧ U+2067RIGHT-TO-LEFT ISOLATE
 ⁨ U+2068FIRST STRONG ISOLATE
 ⁩ U+2069POP DIRECTIONAL ISOLATE

General PunctuationSeparators
items: 2

 
 U+2028LINE SEPARATOR
 
 U+2029PARAGRAPH SEPARATOR

General PunctuationFormat character
items: 1

 ⁠ U+2060WORD JOINER

General PunctuationInvisible operators
items: 4

 ⁡ U+2061FUNCTION APPLICATION
 ⁢ U+2062INVISIBLE TIMES
 ⁣ U+2063INVISIBLE SEPARATOR
 ⁤ U+2064INVISIBLE PLUS

General PunctuationDeprecated
items: 6

  U+206AINHIBIT SYMMETRIC SWAPPING
  U+206BACTIVATE SYMMETRIC SWAPPING
  U+206CINHIBIT ARABIC FORM SHAPING
  U+206DACTIVATE ARABIC FORM SHAPING
  U+206ENATIONAL DIGIT SHAPES
  U+206FNOMINAL DIGIT SHAPES

Arabic Presentation Forms BSpecial
items: 1

  U+FEFFZERO WIDTH NO-BREAK SPACE

SpecialsInterlinear annotation
items: 3

  U+FFF9INTERLINEAR ANNOTATION ANCHOR
  U+FFFAINTERLINEAR ANNOTATION SEPARATOR
  U+FFFBINTERLINEAR ANNOTATION TERMINATOR

Egyptian Hieroglyph Format ControlsJoiners
items: 2

 𓐰 U+13430EGYPTIAN HIEROGLYPH VERTICAL JOINER
 𓐱 U+13431EGYPTIAN HIEROGLYPH HORIZONTAL JOINER

Egyptian Hieroglyph Format ControlsSign insertion controls
items: 7

 𓐲 U+13432EGYPTIAN HIEROGLYPH INSERT AT TOP START
 𓐳 U+13433EGYPTIAN HIEROGLYPH INSERT AT BOTTOM START
 𓐴 U+13434EGYPTIAN HIEROGLYPH INSERT AT TOP END
 𓐵 U+13435EGYPTIAN HIEROGLYPH INSERT AT BOTTOM END
 𓐹 U+13439EGYPTIAN HIEROGLYPH INSERT AT MIDDLE
 𓐺 U+1343AEGYPTIAN HIEROGLYPH INSERT AT TOP
 𓐻 U+1343BEGYPTIAN HIEROGLYPH INSERT AT BOTTOM

Egyptian Hieroglyph Format ControlsSign stacking control
items: 1

 𓐶 U+13436EGYPTIAN HIEROGLYPH OVERLAY MIDDLE

Egyptian Hieroglyph Format ControlsSegment scoping delimiters
items: 2

 𓐷 U+13437EGYPTIAN HIEROGLYPH BEGIN SEGMENT
 𓐸 U+13438EGYPTIAN HIEROGLYPH END SEGMENT

Egyptian Hieroglyph Format ControlsEnclosure controls
items: 4

 𓐼 U+1343CEGYPTIAN HIEROGLYPH BEGIN ENCLOSURE
 𓐽 U+1343DEGYPTIAN HIEROGLYPH END ENCLOSURE
 𓐾 U+1343EEGYPTIAN HIEROGLYPH BEGIN WALLED ENCLOSURE
 𓐿 U+1343FEGYPTIAN HIEROGLYPH END WALLED ENCLOSURE

Shorthand Format ControlsShorthand format controls
items: 4

 𛲠 U+1BCA0SHORTHAND FORMAT LETTER OVERLAP
 𛲡 U+1BCA1SHORTHAND FORMAT CONTINUING OVERLAP
 𛲢 U+1BCA2SHORTHAND FORMAT DOWN STEP
 𛲣 U+1BCA3SHORTHAND FORMAT UP STEP

Musical SymbolsBeams and slurs
items: 8

 𝅳 U+1D173MUSICAL SYMBOL BEGIN BEAM
 𝅴 U+1D174MUSICAL SYMBOL END BEAM
 𝅵 U+1D175MUSICAL SYMBOL BEGIN TIE
 𝅶 U+1D176MUSICAL SYMBOL END TIE
 𝅷 U+1D177MUSICAL SYMBOL BEGIN SLUR
 𝅸 U+1D178MUSICAL SYMBOL END SLUR
 𝅹 U+1D179MUSICAL SYMBOL BEGIN PHRASE
 𝅺 U+1D17AMUSICAL SYMBOL END PHRASE

TagsTag identifiers
items: 1

 󠀁 U+E0001LANGUAGE TAG