Parts of Speech Taggers for Indo Aryan Languages: A critical Review of Approaches and Performances

Kuwali Talukdar,Shikhar Kumar Sarma

doi:10.1109/i3cs58314.2023.10127336

Abstract

Automatic Parts of Speech (PoS) Tagging is a sequence labeling problem. PoS Tagging research has undergone an evolutionary journey starting with Dictionary Lookup PoS Tagger model, and then using rule based and statistical schemes, and later on adopting hybrid methodology for enhanced performance. Emergence of Machine Learning (ML) has boosted the activities adding newer dimensions to look at the problem with deeper linguistics and computational perspectives, and in recent times this has shifted to completely self-learning models with incorporation of Deep Learning (DL) tools. Here we have recorded and analyzed this trajectory for the PoS tagger development and experimentation for the Indo Aryan languages. Rule based and statistical models performed to an acceptable level, but are not robust and dynamic. ML and DL based models outperformed all other models, and started giving higher performances, with reported accuracy to the tune of upto 97% in few cases. Various customized models using DL have been experimented in very recent days, and different groups have reported best performed models using a variety of combination of pre-processing methods to that with DL tools, substantiating with quantitative performance matrix reports. Structured reports, inclusive methodologies adopted, and quantitative performance evaluation comparisons are elaborated in the paper. This comprehensive and critical review shall act as a foundational backbone for any Indo Aryan language PoS Tagger modeling experiment, either fresh, or attempt to enhance performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parts of Speech Taggers for Indo Aryan Languages: A critical Review of Approaches and Performances

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Part of speech tagging: a systematic review of deep learning and machine learning approaches
Alebachew Chiche ... Betselot Yitagesu
Journal of Big Data | VOL. 9
Alebachew Chiche, et. al.Alebachew Chiche ... Betselot Yitagesu
24 Jan 2022
Journal of Big Data | VOL. 9

A REVIEW ON DIFFERENT APPROACHES OF POS TAGGING IN NLP
K Aparna ... Pooja Bhakta
-
K Aparna, et. al. K Aparna ... Pooja Bhakta
01 Jan 2020
01 Jan 2020

Part of Speech Tagging in Urdu: Comparison of Machine and Deep Learning Approaches
Wahab Khan ... Ali Daud
IEEE Access | VOL. 7
Wahab Khan, et. al.Wahab Khan ... Ali Daud
01 Jan 2019
IEEE Access | VOL. 7

Combination of Genetic Algorithm and Brill Tagger Algorithm for Part of Speech Tagging Bahasa Madura
Nindian Puspa Dewi ... Ubaidi Ubaidi
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7
Nindian Puspa Dewi, et. al.Nindian Puspa Dewi ... Ubaidi Ubaidi
01 Oct 2020
Proceeding of the Electrical Engineering Computer Science and Informatics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parts of Speech Taggers for Indo Aryan Languages: A critical Review of Approaches and Performances

Abstract

Talk to us

Similar Papers