Abstract

Old Turkic language is the basis of all modern Turkic languages. Its study is very important for Turkic peoples who possess modern Turkic languages. This is important both from a historical point of view and for the study of modern issues of neural machine translation, issues of the linguistic distance of modern Turkic languages from their progenitor. This paper proposes the development of a computational model of the morphology of Old Turkic language based on the CSE (Complete Set of Endings) – model of morphology and a study on this basis of the issue of morphological segmentation of the texts of Old Turkic language, which will subsequently be used for neural machine translation of Old Turkic language into modern Turkic languages. Since most of the modern Turkic languages, except for the Turkish language, belong to low-resource languages, the issues of developing computational models of morphology, developing models, algorithms and software for processing Turkic languages are relevant.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call