Abstract

With the advent of social media, the amount of text available for processing across different natural languages has become enormous. In the past few decades, there has been tremendous increase in the number of language processing applications. The tools for natural language computing of various languages are very different because each language has its own set of grammatical rules. This paper focuses on identifying the basic inflectional principles of Tamil language at word level. Three levels of word inflection concepts are considered – Patterns, Rules and Exceptions. How grammatical principles for word inflections in Tamil can be grouped in these three levels and applied for obtaining different word forms is the focus of this paper. These can be made use of in a wide variety of natural language applications like morphological analysis, morphological generation, word level translation, spelling and grammar check, information extraction etc. The tools using these rules will account for faster operation and better implementation of Tamil grammatical rules referred from [?????????????? | tholgaappiyam] and [ ?????? | nannool] in NLP applications.

Highlights

  • The number of social media platforms for various purposes like blogging, micro-blogging, photo sharing, social networking etc., has risen steadily

  • Different tools need to be used for processing different natural languages because of the differences that exist in terms of the syntax and grammatical rules

  • A three level grouping of grammatical principles for word inflections in Tamil language is proposed for better grammatical analysis

Read more

Summary

INTRODUCTION

The number of social media platforms for various purposes like blogging, micro-blogging, photo sharing, social networking etc., has risen steadily. Inflection in grammar refers to the modification of a root word to convey various characteristics like tense, gender, number, person, case etc. [கிளிஞ்சல் தளொல் | kiLinjalgaLaal | sea shell ] is another inflected form of the same noun where the root word கிளிஞ்சல் is added with two suffixes – [ ள் | kaL |] and [ஆல் | aal |] to denote plural and instrumental case respectively. An Exception to a rule will handle special cases where the rule cannot be applied directly These three levels of grammatical forms discussed in this paper facilitate simple and efficient implementation of grammatical rules in basic NLP tools like morphological analyser, morphological generator, word level translator etc

LITERATURE REVIEW
DESCRIPTION OF INFLECTIONAL RULES IN TAMIL
PATTERNS
EXCEPTIONS
RESULTS AND DISCUSSION
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call