Information Retrieval with Hindi, Bengali, and Marathi Languages: Evaluation and Analysis

Jacques Savoy,Mitra Akasereh,Ljiljana Dolamic

doi:10.1007/978-3-642-40087-2_30

Abstract

Our first objective in participating in FIRE evaluation campaigns is to analyze the retrieval effectiveness of various indexing and search strategies when dealing with corpora written in Hindi, Bengali and Marathi languages. As a second goal, we have developed new and more aggressive stemming strategies for both Marathi and Hindi languages during this second campaign. We have compared their retrieval effectiveness with both light stemming strategy and n-gram language-independent approach. As another language-independent indexing strategy, we have evaluated the trunc-n method in which the indexing term is formed by considering only the first n letters of each word. To evaluate these solutions we have used various IR models including models derived from Divergence from Randomness (DFR), Language Model (LM) as well as Okapi, or the classical tf idf vector-processing approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information Retrieval with Hindi, Bengali, and Marathi Languages: Evaluation and Analysis

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Comparative Study of Indexing and Search Strategies for the Hindi, Marathi, and Bengali Languages
Ljiljana Dolamic ... Jacques Savoy
ACM Transactions on Asian Language Information Processing | VOL. 9
Ljiljana Dolamic, et. al.Ljiljana Dolamic ... Jacques Savoy
01 Sep 2010
ACM Transactions on Asian Language Information Processing | VOL. 9

Indexing and stemming approaches for the Czech language
Ljiljana Dolamic ... Jacques Savoy
Information Processing and Management | VOL. 45
Ljiljana Dolamic, et. al.Ljiljana Dolamic ... Jacques Savoy
16 Jul 2009
Information Processing and Management | VOL. 45

Persian Language, Is Stemming Efficient?
Ljiljana Dolamic ... Jacques Savoy
-
Ljiljana Dolamic, et. al.Ljiljana Dolamic ... Jacques Savoy
01 Jan 2009
01 Jan 2009

Indexing and searching strategies for the Russian language
Ljiljana Dolamic ... Jacques Savoy
Journal of the American Society for Information Science and Technology | VOL. 60
Ljiljana Dolamic, et. al.Ljiljana Dolamic ... Jacques Savoy
07 Aug 2009
Journal of the American Society for Information Science and Technology | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information Retrieval with Hindi, Bengali, and Marathi Languages: Evaluation and Analysis

Abstract

Talk to us

Similar Papers