Application of Latent Semantic Indexing for Hindi-English CLIR Irrespective of Context Similarity

A P Sivakumar,P Premchand,A Govardhan

doi:10.1007/978-3-642-22543-7_73

Abstract

Retrieving information from different languages may lead to many problems like polysemy and synonymy, which can be resolved by Latent Semantic Indexing (LSI) techniques. This paper uses the Singular Value Decomposition (SVD) of LSI technique to achieve effective indexing for English and Hindi languages. Parallel corpus consisting of both Hindi and English documents is created and is used for training and testing the system. Removing stop words from the documents is performed followed by stemming and normalization in order to reduce the feature space and to get language relations. Then, cosine similarity method is applied on query document and target document. Based on our experimental results it is proved that LSI based CLIR gets over the non-LSI based retrieval which have retrieval successes of 67% and 9% respectively.KeywordsLatent semantic indexingCross language information retrievalIndexingSingular value decomposition

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application of Latent Semantic Indexing for Hindi-English CLIR Irrespective of Context Similarity

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Indian Languages IR using Latent Semantic Indexing
A.P Sivakumar ... A Govardhan
International Journal of Computer Science and Information Technology | VOL. 3
A.P Sivakumar, et. al.A.P Sivakumar ... A Govardhan
30 Aug 2011
International Journal of Computer Science and Information Technology | VOL. 3

Latent Semantic Analysis (LSA) based object recognition and clustering
Vinaykumar Hebballi ... Vidhu Rojit
-
Vinaykumar Hebballi, et. al.Vinaykumar Hebballi ... Vidhu Rojit
01 Oct 2015
01 Oct 2015

Implementation techniques for large-scale latent semantic indexing applications
Roger B Bradford
-
Roger B BradfordRoger B Bradford
24 Oct 2011
24 Oct 2011

Effective News Text Summarization Techniques
-
International Journal of Advanced Trends in Computer Science and Engineering | VOL. 12
--
15 Jun 2023
International Journal of Advanced Trends in Computer Science and Engineering | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of Latent Semantic Indexing for Hindi-English CLIR Irrespective of Context Similarity

Abstract

Talk to us

Similar Papers