Malayalam POS Tagger—A Comparison Using SVM and HMM

K Usha,S Lakshmana Pandian

doi:10.1007/978-981-15-5788-0_40

Abstract

Many Parts Of Speech (POS) taggers for the Malayalam language has been implemented using Support Vector Machine (SVM), Memory-Based Language Processing (MBLP), Hidden Markov Model (HMM) and other similar techniques. The objective was to find an improved POS tagger for the Malayalam language. This work proposed a comparison of the Malayalam POS tagger using the SVM and Hidden Markov model (HMM). The tagset used was the popular Bureau of Indian Standard (BIS) tag set. A manually created data set which has around 52,000 words has been taken from various Malayalam news sites. The preprocessing steps that have done for news text are also mentioned. Then POS tagging has been done using SVM and HMM. As POS tagging requires the extraction of multiple class labels, a multi-class SVM is used. It also performs feature extraction, feature selection, and classification. The word sense disambiguation and misclassification of words are the two major issues identified in SVM. Hidden Markov Model predicts the hidden sequence based on maximum observation likelihood which reduces ambiguity and misclassification rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Malayalam POS Tagger—A Comparison Using SVM and HMM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Part of Speech Tagging in Bengali Using Support Vector Machine
Asif Ekbal ... Sivaji Bandyopadhyay
-
Asif Ekbal, et. al.Asif Ekbal ... Sivaji Bandyopadhyay
01 Dec 2008
01 Dec 2008

Web-Based Bengali News Corpus for Lexicon Development and POS Tagging
Asif Ekbal ... Sivaji Bandyopadhyay
Polibits | VOL. 37
Asif Ekbal, et. al.Asif Ekbal ... Sivaji Bandyopadhyay
30 Jun 2008
Polibits | VOL. 37

A Comparative Study of Standard Part-of-Speech Taggers
Imad Zeroual ... Abdelhak Lakhouaja
-
Imad Zeroual, et. al.Imad Zeroual ... Abdelhak Lakhouaja
01 Jan 2019
01 Jan 2019

Hidden Markov Model based Part of Speech Tagging for Nepali language
Abhijit Paul ... Bipul Syam Purkayastha
-
Abhijit Paul, et. al.Abhijit Paul ... Bipul Syam Purkayastha
01 Sep 2015
01 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Malayalam POS Tagger—A Comparison Using SVM and HMM

Abstract

Talk to us

Similar Papers