26.08.2013 Views

Representing Myanmar in Unicode - Evertype

Representing Myanmar in Unicode - Evertype

Representing Myanmar in Unicode - Evertype

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Mon<br />

Language Tag<br />

mnw-Mymr<br />

Alphabet<br />

Consonants<br />

က ခ ဂ ဃ ၚ စ ဆ ဇ ၛ ဉ ည ဋ ဌ ဍ ဎ ဏ<br />

1000 1001 1002 1003 105A 1005 1006 1007 105B 1009 100A 100B 100C 100D 100E 100F<br />

တ ထ ဒ ဓ န ပ ဖ ဗ ဘ မ ယ ရ လ ဝ သ ဟ<br />

1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 101A 101B 101C 101D 101E 101F<br />

ဠ အ ၜ ၝ<br />

1020 1021 105C 105D<br />

The nga letter <strong>in</strong> Mon is encoded U+105A ၚ and not U+1004 င as <strong>in</strong> Burmese. Independently, these characters<br />

look very different. But <strong>in</strong> the context of someth<strong>in</strong>g occur<strong>in</strong>g below the character, the Mon nga (U+105A)<br />

loses its tail. Thus a Mon k<strong>in</strong>zi is encoded us<strong>in</strong>g U+105A U+103A U+1039. In addition, the medial form of<br />

Mon nga is simply the tail: ါ္ (U+1039 U+105A).<br />

Mon has a character 'great nya' which is encoded ည U+100A U+1039 U+100A. But this is stylistic and the<br />

same sequence may also be rendered ည္ U+100A U+1039 U+100A.<br />

Independent Vowels<br />

အ အာ ဣ ဣဦ ဥ ဥု ဨ ြသ ဪ<br />

1021 1021 102C 1023 1023 102E 1025 1025 102F 1028 1029 102A<br />

Dependent Vowels<br />

ါာ ါိ ါဦ ါု ါူ နါ နါာ ါိ ု ါံ<br />

102C 102D 102E 102F 1030 1031 1031 102C 102D 102F 1036<br />

Mon has a sequence U+102C U+1036 ါာံ and correspond<strong>in</strong>gly U+102B U+1036, but here the dot is rendered<br />

over the previous consonant: ါံ ါ and for consistency this is encoded with the dot after the vowel.<br />

ကံ ာ်<br />

ကာံ<br />

ဂံ ါ<br />

U+1000 U+1036 U+102C U+103A<br />

U+1000 U+102C U+1036<br />

U+1002 U+102B U+1036<br />

The order<strong>in</strong>g of U+1036 U+102C U+103A follows Burmese encod<strong>in</strong>g order and keeps consistency across the<br />

script.<br />

Contractions<br />

Mon has the concept of f<strong>in</strong>al character contractions. One of these is where ဟ် becomes ါှ ် on the f<strong>in</strong>al<br />

character of the syllable. Thus one can have ါာှ ်. The natural order for these would be U+102C U+103E<br />

<strong>Represent<strong>in</strong>g</strong> <strong>Myanmar</strong> <strong>in</strong> <strong>Unicode</strong> Page 18 of 37 Version: 433

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!