Morpheme based Language Model for Part-of-Speech Tagging

S Lakshmana Pandian,T.V Geetha

doi:10.17562/pb-38-2

Morpheme based Language Model for Part-of-Speech Tagging

S Lakshmana Pandian, T.V Geetha

https://doi.org/10.17562/pb-38-2

Copy DOI

Journal: Polibits	Publication Date: Dec 31, 2008
Citations: 23

#Part Of Speech #Language Model + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The paper describes a Tamil Part of Speech (POS) tagging using a corpus-based approach by formulating a Language Model using morpheme components of words. Rule based tagging, Markov model taggers, Hidden Markov Model taggers and transformation-based learning tagger are some of the methods available for part of speech tagging. In this paper, we present a language model based on the information of the stem type, last morpheme, and previous to the last morpheme part of the word for categorizing its part of speech. For estimating the contribution factors of the model, we follow generalized iterative scaling technique. Presented model has the overall F-measure of 96%.

Full Text