Abstract

Morphological analysis is a very critical issue especially for natural language processing related tasks on agglutinative languages. This study gives the implementation details of a rule-based morphological analyzer of Kazakh language which is an agglutinative language. A detailed computational analysis of Kazakh language morphology such as formalization of alternation and morphotactic rules for Kazakh language is worked out in order to create the morphological analyzer. In the implementation of the morphological analyzer, alternation and morphotactic rules of Kazakh language are represented by two-level morphology rules and Foma finite state compiler is employed. This is the first detailed computational analysis of Kazakh language from morphological view. A word can have more than one morphological parse but only one of its morphological parses is valid in a given sentence. A morphological disambiguator disambiguates words by selecting one of possible parses of words. In this paper, we also present a transformation-based morphological disambiguator for Kazakh language and it is a variation of Brill tagger.

Highlights

  • Kazakh Language is a Turkic language which belongs to Kipchak branch of Ural-Altaic language family, and it is spoken approximately by 8 million people

  • We developed a morphological disambiguator for Kazakh language in order to select intended parses of words

  • The morphological disambiguator for Kazakh language described in this paper use transformation-based approach and it is a variation of Brill tagger [5]

Read more

Summary

Introduction

Kazakh Language is a Turkic language which belongs to Kipchak branch of Ural-Altaic language family, and it is spoken approximately by 8 million people. Surface level realizations of these morphemes are governed by the root word vowel harmony property of the language. Finite state environment tools such as Foma [11] can create a rule-based morphological analyzer for a natural language from its two-level morphology rules that represent alternation and morphotactic rules of that language. This paper gives a deep analysis of Kazakh language morphology by creating a rule-based morphological processor for Kazakh language. A lightweight analysis of the language has to deal with more errors in stage For this reason, we preferred the creation of a rule-based morphological analyzer for Kazakh language with a deep analysis of its morphology. We developed a morphological disambiguator for Kazakh language in order to select intended parses of words.

Related Work
Vowel and Consonant Harmony
Morphological Disambiguation
Tests and Analysis
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call