Abstract

Information Retrieval system retrieves relevant documents from large datasets. Automatic Query Expansion (AQE) is one of the approaches to enhance IR performance by adding additional terms to original query. The selection of suitable additional terms for AQE is a crucial task. Term weighting method is one of the ways to deal with such a problem. This paper presents a new term weighting based AQE approach to retrieve more relevant documents from data corpus. The proposed approach comprises of three major steps. First step determines the optimal weights of different IR evidences for different terms using Particle Swarm Optimization (PSO). Fuzzy logic technique is used to improve performance of PSO by controlling inertia and acceleration coefficients during the optimization. Co-occurrence score is introduced as new IR evidence in the proposed approach. Second step is focused on removal of noisy terms by using new combined semantic filtering method. Third step reweights the terms using Rocchio method. The proposed approach is compared with recently developed automatic query expansion approaches in terms of performance measures such as precision, recall, F-measure and MAP (Mean Average Precision). Three benchmark datasets CACM, CISI and TREC-3 are used to verify the results. The proposed approach is found better than other approaches according to results obtained for these benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.