Combination POS taggers on Amazigh texts

Samir Amri,Lahbib Zenkouar,Mohamed Outahajala

doi:10.1109/cloudtech.2017.8284725

Abstract

Part of Speech (PoS) tagging is the task to assign the appropriate morphosyntactic category to each word according to the context. Several probabilistic methods have been adapted for PoS tagging such as Conditional Random Fields, Support Vector Machines, and Decision Trees. Based on these methods, language independent PoS taggers have been developed such as CRF++, Yamcha and TreeTagger. These POS taggers implement the process of assigning the correct PoS (noun, verb, adjective, adverb …) to each word of the sentence. PoS taggers are developed by modeling the morphosyntactic structure of natural language text. In this paper, we tried to improve the accuracy of existing Amazigh POS taggers using a voting algorithm. The three used Amazigh POS taggers are: (1) Conditional Random Fields (CRF) tagger (2) Support Vector Machines (SVM) tagger (3) TreeTagger (TT). These taggers are developed with an accuracy of 86.79 %, 84.64 % and 86.57 % respectively. An annotated corpus of 60,000 words is used to form all these taggers. An error analysis is done to find out the mistakes made by these taggers. Then, a voting algorithm is proposed to construct an Amazigh PoS tagger to achieve better results and we can reach an accuracy of 89.06 %. This accurate POS tagger could be used for a variety of NLP applications to offer the students and the researchers an opportunity to work with language data with variety of tools and techniques in terms of computational procedures and programs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combination POS taggers on Amazigh texts

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Part of Speech Tagging in Bengali Using Support Vector Machine
Asif Ekbal ... Sivaji Bandyopadhyay
-
Asif Ekbal, et. al.Asif Ekbal ... Sivaji Bandyopadhyay
01 Dec 2008
01 Dec 2008

Improving Persian POS tagging using the maximum entropy model
Ahmad A Kardan ... Maryam Bahojb Imani
-
Ahmad A Kardan, et. al.Ahmad A Kardan ... Maryam Bahojb Imani
01 Feb 2014
01 Feb 2014

An Hybrid Part of Speech Tagger for Setswana Language using a Voting Method
Mary Dibitso ... Sunday O Ojo
International Conference on Intelligent and Innovative Computing Applications | VOL. 2022
Mary Dibitso, et. al.Mary Dibitso ... Sunday O Ojo
31 Dec 2022
International Conference on Intelligent and Innovative Computing Applications | VOL. 2022

Part of Speech Tagging in Urdu: Comparison of Machine and Deep Learning Approaches
Wahab Khan ... Ali Daud
IEEE Access | VOL. 7
Wahab Khan, et. al.Wahab Khan ... Ali Daud
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combination POS taggers on Amazigh texts

Abstract

Talk to us

Similar Papers