Abstract

In the objective of a possible performance improvement of the Arabic information retrieval systems, we propose to introduce the latent semantic analysis method to cure the problems arising from the vector- space model. The present contribution describes how linguistic processing and weighting schemes could improve the LSA method, and the comparison between the vector-space model and LSA approach, which aim to reduce the index term number of an Arabic corpus specialized in the environment field. The results of our experiments show clearly a positive influence of the linguistic processing and weighting schemes, and LSA improvement compared to the vector-space model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.