An Effective Pre-Processing Algorithm for Information Retrieval Systems

Vikram Singh,Balwinder Saini

doi:10.5121/ijdms.2014.6602

Abstract

The Internet is probably the most successful distributed computing system ever. However, our capabilities for data querying and manipulation on the internet are primordial at best. The user expectations are enhancing over the period of time along with increased amount of operational data past few decades. The data-user expects more deep, exact, and detailed results. Result retrieval for the user query is always relative o the pattern of data storage and index. In Information retrieval systems, tokenization is an integrals part whose prime objective is to identifying the token and their count. In this paper, we have proposed an effective tokenization approach which is based on training vector and result shows that efficiency/ effectiveness of proposed algorithm. Tokenization on documents helps to satisfy user’s information need more precisely and reduced search sharply, is believed to be a part of information retrieval. Pre-processing of input document is an integral part of Tokenization, which involves preprocessing of documents and generates its respective tokens which is the basis of these tokens probabilistic IR generate its scoring and gives reduced search space. The comparative analysis is based on the two parameters; Number of Token generated, Pre-processing time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Effective Pre-Processing Algorithm for Information Retrieval Systems

Abstract

Talk to us

Similar Papers

More From: International Journal of Database Management Systems

Lead the way for us

Journal: International Journal of Database Management Systems	Publication Date: Dec 31, 2014
Citations: 29

Similar Papers

Enhance Inverted Index Using in Information Retrieval
Alia Karim Hassan ... Duaa Enteesha Mhawi
Engineering and Technology Journal | VOL. 34
Alia Karim Hassan, et. al.Alia Karim Hassan ... Duaa Enteesha Mhawi
01 Feb 2016
Engineering and Technology Journal | VOL. 34

An Effective Tokenization Algorithm for Information Retrieval Systems
Vikram Singh ... Balwinder Saini
-
Vikram Singh, et. al.Vikram Singh ... Balwinder Saini
13 Sep 2014
13 Sep 2014

Salton Award Lecture - Information retrieval and computer science
W Bruce Croft
-
W Bruce CroftW Bruce Croft
28 Jul 2003
28 Jul 2003

Development of IR systems: New direction
Valery I Frants ... Vladimir G Voiskunskii
Information Processing and Management | VOL. 32
Valery I Frants, et. al.Valery I Frants ... Vladimir G Voiskunskii
01 May 1996
Information Processing and Management | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Effective Pre-Processing Algorithm for Information Retrieval Systems

Abstract

Talk to us

Similar Papers

More From: International Journal of Database Management Systems