Jump to content

Template:Bidi Class (Unicode)

From Wikipedia, the free encyclopedia
Bidirectional character type (Bidi_Class Unicode character property)[1]
Type[2] Description Strength Directionality General scope Bidi_Control character[3]
L Left-to-Right Strong L-to-R Most alphabetic and syllabic characters, Chinese characters, non-European or non-Arabic digits, LRM character, ... U+200E LEFT-TO-RIGHT MARK (LRM)
R Right-to-Left Strong R-to-L Adlam, Garay, Hebrew, Mandaic, Mende Kikakui, N'Ko, Samaritan, ancient scripts like Kharoshthi and Nabataean, RLM character, ... U+200F RIGHT-TO-LEFT MARK (RLM)
AL Arabic Letter Strong R-to-L Arabic, Hanifi Rohingya, Sogdian, Syriac, and Thaana alphabets, and most punctuation specific to those scripts, ALM character, ... U+061C ARABIC LETTER MARK (ALM)
EN European Number Weak European digits, Eastern Arabic-Indic digits, Coptic epact numbers, ...
ES European Separator Weak plus sign, minus sign, ...
ET European Number Terminator Weak degree sign, currency symbols, ...
AN Arabic Number Weak Arabic-Indic digits, Arabic decimal and thousands separators, Rumi digits, Hanifi Rohingya digits, ...
CS Common Number Separator Weak colon, comma, full stop, no-break space, ...
NSM Nonspacing Mark Weak Characters in General Categories Mark, nonspacing, and Mark, enclosing (Mn, Me)
BN Boundary Neutral Weak Default ignorables, non-characters, control characters other than those explicitly given other types
B Paragraph Separator Neutral paragraph separator, appropriate Newline Functions, higher-level protocol paragraph determination
S Segment Separator Neutral Tabs
WS Whitespace Neutral space, figure space, line separator, form feed, General Punctuation block spaces (smaller set than the Unicode whitespace list)
ON Other Neutrals Neutral All other characters, including object replacement character
LRE Left-to-Right Embedding Explicit L-to-R LRE character only U+202A LEFT-TO-RIGHT EMBEDDING (LRE)
LRO Left-to-Right Override Explicit L-to-R LRO character only U+202D LEFT-TO-RIGHT OVERRIDE (LRO)
RLE Right-to-Left Embedding Explicit R-to-L RLE character only U+202B RIGHT-TO-LEFT EMBEDDING (RLE)
RLO Right-to-Left Override Explicit R-to-L RLO character only U+202E RIGHT-TO-LEFT OVERRIDE (RLO)
PDF Pop Directional Format Explicit PDF character only U+202C POP DIRECTIONAL FORMATTING (PDF)
LRI Left-to-Right Isolate Explicit L-to-R LRI character only U+2066 LEFT-TO-RIGHT ISOLATE (LRI)
RLI Right-to-Left Isolate Explicit R-to-L RLI character only U+2067 RIGHT-TO-LEFT ISOLATE (RLI)
FSI First Strong Isolate Explicit FSI character only U+2068 FIRST STRONG ISOLATE (FSI)
PDI Pop Directional Isolate Explicit PDI character only U+2069 POP DIRECTIONAL ISOLATE (PDI)
Notes
1.^ Unicode Bidirectional Algorithm (UAX#9), As of Unicode version 16.0
2.^ Possible Bidirectional character types for character property: Bidi_Class or 'type'
3.^ Bidi_Control characters: Twelve Bidi_Control formatting characters are defined. They are invisible, and have no effect apart from directionality. Nine of them have a unique, overruling BiDi-type that is used by the algorithm. Their type is also their acronym (e.g. character 'LRE' has BiDi type 'LRE').