Abstract

Spelling, morphology, syntax and semantics are the important areas of Natural Language (NL) sentence analysis. Syntax checking of a sentence is broadly referred as a ‘grammar checking’, however it also involves morphological analysis hence technically it is a multidimensional problem. Syntax of a natural language defines permissible sentence structures and constraints on constituents such as their order and unification constraints. It is a purely theoretical aspect and considered as computationally trivial rule enforcement problem. Rule formulation needs expert labour work and is costly and time consuming affair. Modern data driven language engineering approach advocates use of minimal knowledge base (linguistic information) and relies on knowledge extraction from tagged data. It is difficult to find such tagged data for non-English natural languages like Marathi (Indian Language). Considering these facts for grammar checking problem, we have come up with intuitional heuristic method for Marathi grammar checking which uses basic syntactic cues and minimal lexical information. We have modeled this heuristic method scientifically using basic matrix comparison operation. Our approach relies on syntactic cues like word ending, verb ending. We have tested our method on handcrafted Marathi sentences catering different Marathi sentence structures (one hundred and fifty three). The performance is measured using precision and recall metrics. The system has yielded 83% precision and 93% recall on sample data. This approach can be exploited for well structured text documents typically in the closed domains like legal, official, educational etc.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call