Abstract

A part of speech (POS) tagging is one of the most well studied problem in the field of Natural Language Processing (NLP). A POS Tagger is the process of assigning correct tag like noun, adjective, verb, adverb etc to each word of the input sentence. Disambiguation rules and Tagset is vital parts of POS tagger. POS tagging is difficult for Marathi language due to unavailability of corpus for computational processing. In this paper, a POS Tagger for Marathi language using Rule based technique is presented. Our proposed system find root word using morphological analyzer and compare the root word with corpus to assign appropriate tag. If word has assigned more than one tags then by using grammar rules ambiguity is removed. Meaningful rules are provided to improve the performance of the system. General Terms Part of Speech, Marathi

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call