Algorithmic stemmers or morphological analysis? An evaluation

Claire Fautsch,Jacques Savoy

doi:10.1002/asi.21093

Abstract

AbstractIt is important in information retrieval (IR), information extraction, or classification tasks that morphologically related forms are conflated under the same stem (using stemmer) or lemma (using morphological analyzer). To achieve this for the English language, algorithmic stemming or various morphological analysis approaches have been suggested. Based on Cross‐Language Evaluation Forum test collections containing 284 queries and various IR models, this article evaluates these word‐normalization proposals. Stemming improves the mean average precision significantly by around 7% while performance differences are not significant when comparing various algorithmic stemmers or algorithmic stemmers and morphological analysis. Accounting for thesaurus class numbers during indexing does not modify overall retrieval performances. Finally, we demonstrate that including a stop word list, even one containing only around 10 terms, might significantly improve retrieval performance, depending on the IR model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Algorithmic stemmers or morphological analysis? An evaluation

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology

Lead the way for us

Journal: Journal of the American Society for Information Science and Technology	Publication Date: May 6, 2009
Citations: 29

Similar Papers

BioMedBERT: A Pre-trained Biomedical Language Model for QA and IR
Souradip Chakraborty ... Thomas Wagner
-
Souradip Chakraborty, et. al.Souradip Chakraborty ... Thomas Wagner
01 Jan 2020
01 Jan 2020

Neural models for information retrieval without labeled data
Hamed Zamani
ACM SIGIR Forum | VOL. 53
Hamed ZamaniHamed Zamani
01 Dec 2019
ACM SIGIR Forum | VOL. 53

Generative user models for adaptive information retrieval
Y Motomura ... K Fujimoto
-
Y Motomura, et. al.Y Motomura ... K Fujimoto
08 Oct 2000
08 Oct 2000

IR meets NLP
Dmitrijs Milajevs ... Mehrnoosh Sadrzadeh
-
Dmitrijs Milajevs, et. al.Dmitrijs Milajevs ... Mehrnoosh Sadrzadeh
27 Sep 2015
27 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Algorithmic stemmers or morphological analysis? An evaluation

Abstract

Talk to us

Similar Papers

More From: Journal of the American Society for Information Science and Technology