Abstract

Genetic algorithms are usually used in information retrieval systems (IRs) to enhance the information retrieval process, and to increase the efficiency of the optimal information retrieval in order to meet the users' needs and help them find what they want exactly among the growing numbers of available information. The improvement of adaptive genetic algorithms helps to retrieve the information needed by the user accurately, reduces the retrieved relevant files and excludes irrelevant files. In this study, the researcher explored the problems embedded in this process, attempted to find solutions such as the way of choosing mutation probability and fitness function, and chose Cranfield English Corpus test collection on mathematics. Such collection was conducted by Cyrial Cleverdon and used at the University of Cranfield in 1960 containing 1400 documents, and 225 queries for simulation purposes. The researcher also used cosine similarity and jaccards to compute similarity between the query and documents, and used two proposed adaptive fitness function, mutation operators as well as adaptive crossover. The process aimed at evaluating the effectiveness of results according to the measures of precision and recall. Finally, the study concluded that we might have several improvements when using adaptive genetic algorithms. ďż˝

Highlights

  • Due to the increasing number of information and documents created by millions of authors and organizations on the Internet, information can be retrieved through using information retrieval system

  • Results showed some improvements in information retrieval system performance using adaptive genetic algorithms, through implementing some queries, using several methods in order to obtain relevant information, sorting such queries and ranking them depending on similarity measure [13]

  • Results showed that the adaptive genetic algorithm (AGA) is used in information retrieval system (IRs) using Vector Space Model (VSM) and cosine fitness function

Read more

Summary

INTRODUCTION

Due to the increasing number of information and documents created by millions of authors and organizations on the Internet, information can be retrieved through using information retrieval system. Results showed some improvements in information retrieval system performance using adaptive genetic algorithms, through implementing some queries, using several methods in order to obtain relevant information, sorting such queries and ranking them depending on similarity measure [13]. As it should be clear the study aims at investigating the information retrieval models. The researcher used two models: Vector Space Model and Extended Boolean Model to compute the similarity between the query and documents [5]. The corpus of the study consists of 1400 English documents on Mathematics and 255 queries to evaluate the effectiveness of the results according to the measures of precision and recall [4] [7]

INFORMATION RETRIEVAL
GENETIC ALGORITHM
Proposed Fitness Function
Equation of crossover probability
Adaptive mutation
Equation of mutation operator probability
LITERATURE REVIEW
Results
CONCLUSION
FUTURE WORK

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.