Applying Genetic Algorithms to Information Retrieval Using Vector Space Model

Laith Mohammad Qasim Abualigah,Essam S.Hanandeh

doi:10.5121/ijcsea.2015.5102

Abstract

Genetic algorithms are usually used in information retrieval systems (IRs) to enhance the information retrieval process, and to increase the efficiency of the optimal information retrieval in order to meet the users' needs and help them find what they want exactly among the growing numbers of available information. The improvement of adaptive genetic algorithms helps to retrieve the information needed by the user accurately, reduces the retrieved relevant files and excludes irrelevant files. In this study, the researcher explored the problems embedded in this process, attempted to find solutions such as the way of choosing mutation probability and fitness function, and chose Cranfield English Corpus test collection on mathematics. Such collection was conducted by Cyrial Cleverdon and used at the University of Cranfield in 1960 containing 1400 documents, and 225 queries for simulation purposes. The researcher also used cosine similarity and jaccards to compute similarity between the query and documents, and used two proposed adaptive fitness function, mutation operators as well as adaptive crossover. The process aimed at evaluating the effectiveness of results according to the measures of precision and recall. Finally, the study concluded that we might have several improvements when using adaptive genetic algorithms. �

Highlights

Due to the increasing number of information and documents created by millions of authors and organizations on the Internet, information can be retrieved through using information retrieval system
Results showed some improvements in information retrieval system performance using adaptive genetic algorithms, through implementing some queries, using several methods in order to obtain relevant information, sorting such queries and ranking them depending on similarity measure [13]
Results showed that the adaptive genetic algorithm (AGA) is used in information retrieval system (IRs) using Vector Space Model (VSM) and cosine fitness function

Summary

INTRODUCTION

Due to the increasing number of information and documents created by millions of authors and organizations on the Internet, information can be retrieved through using information retrieval system. Results showed some improvements in information retrieval system performance using adaptive genetic algorithms, through implementing some queries, using several methods in order to obtain relevant information, sorting such queries and ranking them depending on similarity measure [13]. As it should be clear the study aims at investigating the information retrieval models. The researcher used two models: Vector Space Model and Extended Boolean Model to compute the similarity between the query and documents [5]. The corpus of the study consists of 1400 English documents on Mathematics and 255 queries to evaluate the effectiveness of the results according to the measures of precision and recall [4] [7]

INFORMATION RETRIEVAL

GENETIC ALGORITHM

Proposed Fitness Function

Equation of crossover probability

Adaptive mutation

Equation of mutation operator probability

LITERATURE REVIEW

Results

CONCLUSION

FUTURE WORK

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Science, Engineering and Applications	Publication Date: Feb 28, 2015
Citations: 74	License type: cc-by

R Discovery Prime

R Discovery Prime

Applying Genetic Algorithms to Information Retrieval Using Vector Space Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Computer Science, Engineering and Applications

Lead the way for us

Similar Papers

APPLYING GENETIC ALGORITHMS TO INFORMATION RETRIEVAL USING VECTOR SPACE MODEL

Zenodo (CERN European Organization for Nuclear Research) | VOL. -

23 Feb 2015
Zenodo (CERN European Organization for Nuclear Research) | VOL. -

Correction: Mahmood et al. Hard Real-Time Task Scheduling in Cloud Computing Using an Adaptive Genetic Algorithm. Computers 2017, 6, 15
Amjad Mahmood ... Rashed A Bahlool
Computers | VOL. 7
Amjad Mahmood, et. al.Amjad Mahmood ... Rashed A Bahlool
15 Jun 2018
Computers | VOL. 7

A Genetic Optimization Algorithm Based on Adaptive Dimensionality Reduction
Tai Kuang ... Minghai Xu
Mathematical Problems in Engineering | VOL. 2020
Tai Kuang, et. al.Tai Kuang ... Minghai Xu
11 May 2020
Mathematical Problems in Engineering | VOL. 2020

An adaptive genetic algorithm with diversity-guided mutation and its global convergence property
Mei-Yi Li ... Zi-Xing Cai
Journal of Central South University of Technology | VOL. 11
Mei-Yi Li, et. al.Mei-Yi Li ... Zi-Xing Cai
01 Sep 2004
Journal of Central South University of Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Applying Genetic Algorithms to Information Retrieval Using Vector Space Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Computer Science, Engineering and Applications