Abstract

This framework gives a detailed research on recognizing Tamil handwritten characters using locational and directional approaches embedded with different combinations of zone and quad methodologies. Tamil language has 247 character classes and is widely spoken by the people in India (Tamil Nadu), Malaysia, Singapore, Sri Lanka and so on. For considering the large character sets with their general and handwritten complexities, the two-stage feature extraction process has been experimented with to represent the character's structure. In the initial stage, the character's image is divided into nine equal zones and the structural features were extracted from each zone by the directional algorithmic approach, which denotes unique shape possibilities represented in zone divisions. A classification test has been performed to identify characters in this stage, but a structural portion of handwritten characters like unwanted loops and curves leads to negative results. Hence, locational features have been introduced to identify the position of structures. Each zone is subdivided into four quads further and the pixel availability has been taken as features from the quads to provide the solution for unnecessary portions and loops. With directional features taken from upper (3 columns × 1 row) and lower zones (3 columns × 1 row), corresponding location features have been added up for labeling a unique shape. Finally, to classify the characters, the directional features taken from middle zones (3 columns × 1 row) and their respective locational features have been added with labeled shapes of upper and lower zones. A suitable machine learning algorithm has been chosen for classifying the character classes. HP-Lab-India dataset and two different handwritten documents collected from the people of Tamil Nadu, India, have been tested by these approaches. This experimental research shows significant improvement in recognizing accurate characters. The final results of this approach have created a benchmark for the recognition of handwritten Tamil characters.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call