A comparison of features for POS tagging in Kannada

Shriya Atmakuri,Muralikrishna S N,Ashwath Rao B,Bhavya Shahi

doi:10.14419/ijet.v7i4.14900

Shriya Atmakuri, Muralikrishna S N + Show 2 more

Open Access

https://doi.org/10.14419/ijet.v7i4.14900

Copy DOI

Abstract

This paper proposes a system of part of speech tagging for the South Indian language Kannada using supervised machine learning. POS tagging is an important step in Natural Language Processing and has varied applications such as word sense disambiguation, natural language understanding etc. Based on extensive research into methods used for POS tagging, Conditional Random fields have been chosen as our algorithm. CRFs are used for sequence modeling in POS tagging, named entity recognition and as an alternative to Hidden Markov Models. Three very large corpora are used and their results are compared. The feature sets for all three corpora are also varied. The best method for the task is determined using these results.

Highlights

Part-of-speech tagging is a fundamental task in Natural Language Processing and Computational Linguistics
This paper proposes a system of part of speech tagging for the South Indian language Kannada using supervised machine learning
Parts of Speech (POS) tagging is an important step in Natural Language Processing and has varied applications such as word sense disambiguation, natural language understanding etc

Summary

Introduction

Part-of-speech tagging is a fundamental task in Natural Language Processing and Computational Linguistics. Part of speech tags are frequently used as an important feature for other natural language processing tasks such as word-sense disambiguation, named entity recognition, information retrieval, and machine translation. The Sanskrit grammarian Yaska defined only four categories in his 5th century BC work, Nirukta. These are nama which includes nouns and adjectives, akhyata or verb, upasarga, which is a pre-verb or prefix, and nipata or particle. The Brown Corpus, one of the first English language corpora created for processing by a computer, use 87 tags. Part-of-speech tag will help in parsing, word-sense disambiguation algorithms and in shallow parsing to find names, times, dates or other named entities in the information extraction applications

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Engineering & Technology	Publication Date: Sep 19, 2018
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

A comparison of features for POS tagging in Kannada

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Engineering & Technology

Lead the way for us

Similar Papers

End to End Parts of Speech Tagging and Named Entity Recognition in Bangla Language
Jillur Rahman Saurav ... Summit Haque
-
Jillur Rahman Saurav, et. al.Jillur Rahman Saurav ... Summit Haque
01 Sep 2019
01 Sep 2019

A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
Ying Xiong ... Buzhou Tang
BMC Medical Informatics and Decision Making | VOL. 19
Ying Xiong, et. al.Ying Xiong ... Buzhou Tang
01 Apr 2019
BMC Medical Informatics and Decision Making | VOL. 19

Hidden Markov Model based Part of Speech Tagging for Nepali language
Abhijit Paul ... Bipul Syam Purkayastha
-
Abhijit Paul, et. al.Abhijit Paul ... Bipul Syam Purkayastha
01 Sep 2015
01 Sep 2015

Topics in machine learning for biomedical literature analysis and text retrieval
Rezarta Islamaj Doğan ... Lana Yeganova
BMC Bioinformatics | VOL. 12
Rezarta Islamaj Doğan, et. al.Rezarta Islamaj Doğan ... Lana Yeganova
09 Jun 2011
BMC Bioinformatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparison of features for POS tagging in Kannada

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Engineering &amp; Technology

More From: International Journal of Engineering & Technology