EFFICACIOUS GEOSPATIAL INFORMATION RETRIEVAL USING DENSITY PROBABILISTIC DOCUMENT CORRELATION APPROACH

Uma Uma

doi:10.3844/jcssp.2013.83.93

Abstract

Information Retrieval (IR) is a profound technique to find information that addresses the need of query. Processing of normal text is easier and information can be retrieved efficiently. There are plenty of algorithms in hand to carry out the normal text retrieval. Whereas retrieving geospatial information is very complex and requires additional operations to be performed. Since geospatial data contain complex details than general data such as location, direction. To handle geographical queries, we proposed a Density Probabilistic Document Correlation (DPDC) approach. This approach, initially categorize the geographical features from text that satisfies the given queries. Existing text classification techniques are unsuitable for geospatial text classification due to the exclusivity of the geographical features. Depending on the DPDC approach result we predict overlap of the feature set for a document. Based on overlap and document correlation, the documents are ranked. Highly relevant documents are extracted depending on the score obtained through ranking. Documents with high score are considered the most relevant. The experimental results show that our proposed method efficiently retrieves the list of relevant documents.

Highlights

For the past several years, geographical data has been useful for large spatial data sets
Most relevant documents for the user queries are in the top of the list, whereas the irrelevant documents are not retrieved, which are eliminated by the Density Probabilistic Document Correlation (DPDC) approach
Due to the complex nature of spatial data type and the correlation relationship exists among the spatial data; information retrieval of spatial data becomes laborious

Summary

INTRODUCTION

Spatial data is the progression of discovering interesting patterns, which were formerly unknown, but potentially. The correlation and relationship exist among spatial data are frequently handled by the algorithms of data mining. Based on the features chosen the relevant documents are retrieved User expresses their interest in the form of queries to a component, which performs the search operation. Stop words have the impact on the retrieval process since they have high frequency of appearing in document with less meaning and affect the weighting process, which is carried out in our Density Probabilistic Document Correlation (DPDC) approach. The preprocessed documents or data sets and the user query are given to the DPDC approach component. In order to retrieve relevant document, we use the ranking algorithm, which determines the occurrence trained features in a document. Most relevant documents for the user queries are in the top of the list, whereas the irrelevant documents are not retrieved, which are eliminated by the DPDC approach

RELATED WORK

PROPOSED METHODOLOGY

Feature Selection

Document Preprocessing

DPDC Approach

Estimate Probability of Feature Occurrence

Predict Feature Overlap

Estimate Document Weight

Determine Document Score

Rank and Retrieve the Documents

EXPERIMENTAL EVALUATION

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computer Science	Publication Date: Jan 1, 2013
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

EFFICACIOUS GEOSPATIAL INFORMATION RETRIEVAL USING DENSITY PROBABILISTIC DOCUMENT CORRELATION APPROACH

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science

Lead the way for us

Similar Papers

Improved Distance Functions for Instance-Based Text Classification.
Khalil El Hindi ... Reem Aljulaidan
Computational intelligence and neuroscience | VOL. 2020
Khalil El Hindi, et. al.Khalil El Hindi ... Reem Aljulaidan
22 Nov 2020
Computational intelligence and neuroscience | VOL. 2020

Research On Text Classification Based On Deep Neural Network
Deageon Kim
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14
Deageon KimDeageon Kim
31 Dec 2022
International Journal of Communication Networks and Information Security (IJCNIS) | VOL. 14

How to Improve Text Summarization and Classification by Mutual Cooperation on an Integrated Framework
Hyoungil Jeong ... Jungyun Seo
Expert Systems With Applications | VOL. 60
Hyoungil Jeong, et. al.Hyoungil Jeong ... Jungyun Seo
10 May 2016
Expert Systems With Applications | VOL. 60

Several alternative term weighting methods for text representation and classification
Zhong Tang ... Song Li
Knowledge-Based Systems | VOL. 207
Zhong Tang, et. al.Zhong Tang ... Song Li
14 Aug 2020
Knowledge-Based Systems | VOL. 207

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EFFICACIOUS GEOSPATIAL INFORMATION RETRIEVAL USING DENSITY PROBABILISTIC DOCUMENT CORRELATION APPROACH

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Computer Science