A semantic model of morphological information retrieval: A comparative accumulative analysis

Nabeel Z Tawfeeq,Wisam S Abed,Omar G Ghazal

doi:10.1109/aicis51645.2020.00011

Nabeel Z Tawfeeq, Wisam S Abed + Show 1 more

https://doi.org/10.1109/aicis51645.2020.00011

Copy DOI

Export

Save

Cite

Publication Date: Nov 1, 2020

Affiliation: University of Mosul

Abstract
Full-Text
Similar Papers

Abstract

Listen

The main function of information retrieval (IR) system is to obtain efficient and exactly a minimum subset of document that is related to user concern. Synonymy and polysemy act as a barrier for natural language processing algorithms due to overestimation and misrepresentation. The proposed model uses the implicit of higher rank structure in combing terms with document to optimize the identification of relevant document based on terms used in queries with an enhanced automatic indexing approach has been suggested. The study benefited from the use of Term Frequency Inverse Document Frequency (TF-IDF) method to assign weight for each term in the document. Each document is presented as a vector of weight in the space. Also, the user query is represented as vector of weight. Finally, a Singular Value Decomposition (SVD) approach has been used in which a huge weight of term-document matrix is factorized into collection of vectors for approximation of the original matrix. The cosine similarity is also used to determine the closed vector of document to the user query. In regard to English information retrieval, It was observed that TF-IDF showed higher performance before term percentage 0.3 while Latent Semantic Indexing (LSI) was more stable than TF-IDF, especially in terms of the use of word association.

Full Text