Abstract

The scheme is a unified character coding set of Sibe, Manchu, Mongolian and Todo languages, and the letters in the set are arranged in Latin alphabetical order. Compared with the system based on the nominal character encoding, this set does no need free variation selectors in Sibe and Manchu, and greatly reduce the use of the selectors in other languages. Thus, it is more in line with the general user's habits, and improve the information query, search, and transmission.

Highlights

  • Traditional Mongolian, Todo, Sibe and Manchu are similar in that they all have four different forms of letters: isolate, initial, medial and final forms

  • The coded character set given in the National Recommendation Standard GB/T 26226-2010 [1] of China Mongolian Information Technology Working Group is based on the traditional Mongolian language

  • The selected character form is called the nominal character of the corresponding letter, and the character set is named the nominal character set [2, 3]

Read more

Summary

Introduction

Traditional Mongolian, Todo, Sibe and Manchu are similar in that they all have four different forms of letters: isolate, initial, medial and final forms. In Sibe and Manchu, the vowel letter I has isolate form " ", and its initial, medial and final form are same or similar with Mongolian, but the form is not recognized by the nominal character of Mongolian. If the characters of traditional Mongolian, Todo, Sibe, Manchu and Aligali are in a unified sequence, and their order is like that of Latin letters, it will be of great benefit to the information processing and application of these languages. To solve the above problems, this paper proposes a unified character coding scheme based on normative characters for traditional Mongolian, Todo, Sibe and Manchu languages. The font marked 1 (using normative characters) has no change in the shape of each character after inserting spaces, while the characters in the subsequent three fonts

Definitions of Several Terms
Description of Definitions
Integration of Multilingual Normative Character Sets
Some Need to Be Clarified
About Aligali
Ordering of Characters in a Subset of the Normative Character Set
Order of Subsets in the Normative Character Set
Normative Character Set Encoding
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call