26.08.2013 Views

Representing Myanmar in Unicode - Evertype

Representing Myanmar in Unicode - Evertype

Representing Myanmar in Unicode - Evertype

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Tones<br />

ါ့ ါး<br />

1037 1038<br />

F<strong>in</strong>al Consonants<br />

F<strong>in</strong>al consonants are those that are marked as hav<strong>in</strong>g their <strong>in</strong>herent vowel killed. That is they are consonants<br />

that are either followed by a U+103A MYANMAR LETTER ASAT ါ် or they are <strong>in</strong> a stack<strong>in</strong>g relationship with a<br />

follow<strong>in</strong>g subjo<strong>in</strong>ed full consonant, <strong>in</strong> which case they are followed by U+1039 MYANMAR LETTER VIRAMA.<br />

The k<strong>in</strong>zi character U+1004 MYANMAR LETTER NGA U+103A MYANMER LETTER ASAT U+1039 MYANMAR<br />

LETTER VIRAMA ါင is stored before the base character it occurs over and is treated as a f<strong>in</strong>al consonant of the<br />

previous syllable to that base character.<br />

Note that the f<strong>in</strong>al ဥ် is encoded U+1009 MYANMAR LETTER NYA U+1039 MYANMAR SIGN ASAT and not us<strong>in</strong>g<br />

U+1025 MYANMAR LETTER U.<br />

Symbols<br />

The charts show various symbols, how they are encoded and their correspond<strong>in</strong>g sort equivalent sequences.<br />

ဿ ၌ ၍ ၎င်း ၏ ါံ<br />

103F 104C 104D 104E 1004 103A 1038 104F 1036<br />

သ္ နှ ိက် နရွ လည်းနကာင်း အိ မ်<br />

101E 1039 101E 1014 103E 102D<br />

102F 1000 103A<br />

101B 103D<br />

1031<br />

Note that U+1036 acts as a f<strong>in</strong>al consonant for sort<strong>in</strong>g purposes.<br />

101C 100A 103A 1038 1000<br />

1031 102C 1004 103A 1038<br />

1021<br />

102D<br />

1019<br />

103A<br />

Sequences<br />

There are a few words <strong>in</strong>volv<strong>in</strong>g contractions which ideally sort differently from how they are spelled. A<br />

complete list is not <strong>in</strong>cluded here and processes may sort such words us<strong>in</strong>g default character sort<strong>in</strong>g as<br />

though they were not special.<br />

နါာက်ျ န် ု ပ် လက်ျ သ္ ဦ ထ္င်း လ္က်<br />

1031 102C 1000<br />

103A 103B<br />

1014 103A 102F<br />

1015 103A<br />

101C 1000 103A<br />

103B<br />

101E 1039<br />

1019 102E<br />

1011 1039 1019<br />

1004 103A 1038<br />

101C 1039 1018<br />

1000 103A<br />

နါာက်ကျ န်နု ပ် လက်ယာ သမဦ ထမင်း လက်ဘက်<br />

1031 102C 1000<br />

103A 1000 103B<br />

Punctuation<br />

၊ ။<br />

104A 104B<br />

Render<strong>in</strong>g<br />

1014 103A 1014<br />

102F 1015 103A<br />

101C 1000 103A<br />

101A 102C<br />

101E 1019<br />

102E<br />

1011 1019 1004<br />

103A 1038<br />

101C 1000 103A<br />

1018 1000 103A<br />

Subjo<strong>in</strong>ed Consonants<br />

Not all consonants have a correspond<strong>in</strong>g subjo<strong>in</strong>ed form. In some cases the correspond<strong>in</strong>g medial character<br />

is used s<strong>in</strong>ce a subjo<strong>in</strong>ed consonant <strong>in</strong>dicates a new syllable.<br />

<strong>Represent<strong>in</strong>g</strong> <strong>Myanmar</strong> <strong>in</strong> <strong>Unicode</strong> Page 14 of 37 Version: 433

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!