Empirical evaluation and study of text stemming algorithms

Abdul Jabbar,Sajid Iqbal,Manzoor Ilahi Tamimy,Shafiq Hussain,Adnan Akhunzada

doi:10.1007/s10462-020-09828-3

Abstract

Text stemming is one of the basic preprocessing step for Natural Language Processing applications which is used to transform different word forms into a standard root form. For Arabic script based languages, adequate analysis of text by stemmers is a challenging task due to large number of ambigious structures of the language. In literature, multiple performance evaluation metrics exist for stemmers, each describing the performance from particular aspect. In this work, we review and analyze the text stemming evaluation methods in order to devise criteria for better measurement of stemmer performance. Role of different aspects of stemmer performance measurement like main features, merits and shortcomings are discussed using a resource scarce language i.e. Urdu. Through our experiments we conclude that the current evaluation metrics can only measure an average conflation of words regardless of the correctness of the stem. Moreover, some evaluation metrics favor some type of languages only. None of the existing evaluation metrics can perfectly measure the stemmer performance for all kind of languages. This study will help researchers to evaluate their stemmer using right methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Empirical evaluation and study of text stemming algorithms

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review

Lead the way for us

Journal: Artificial Intelligence Review	Publication Date: Apr 15, 2020
Citations: 24

Similar Papers

Small Segment Emphasized Performance Evaluation Metric for Medical Images
R Ammu ... Neelam Sinha
-
R Ammu, et. al.R Ammu ... Neelam Sinha
01 Jul 2020
01 Jul 2020

A Multi-metric Selection Strategy for Evolutionary Symbolic Regression
Hu Zhang ... Aimin Zhou
-
Hu Zhang, et. al.Hu Zhang ... Aimin Zhou
11 Oct 2020
11 Oct 2020

Assessment of right ventricular size and function from cardiovascular magnetic resonance images using artificial intelligence
Shuo Wang ... Silke Friedrich
Journal of Cardiovascular Magnetic Resonance | VOL. 24
Shuo Wang, et. al.Shuo Wang ... Silke Friedrich
01 Jan 2021
Journal of Cardiovascular Magnetic Resonance | VOL. 24

Estimating the Axial Compression Capacity of Concrete-Filled Double-Skin Tubular Columns with Metallic and Non-Metallic Composite Materials.
Pavithra Chandramouli ... Nikolai Ivanovich Vatin
Materials | VOL. 15
Pavithra Chandramouli, et. al.Pavithra Chandramouli ... Nikolai Ivanovich Vatin
16 May 2022
Materials | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Empirical evaluation and study of text stemming algorithms

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review