Efficient large-scale distance-based join queries in spatialhadoop

Francisco García-García,Michael Vassilakopoulos,Luis Iribarne,Yannis Manolopoulos,Antonio Corral

doi:10.1007/s10707-017-0309-y

Francisco García-García, Michael Vassilakopoulos + Show 3 more

Open Access

https://doi.org/10.1007/s10707-017-0309-y

Copy DOI

Abstract

Efficient processing of Distance-Based Join Queries (DBJQs) in spatial databases is of paramount importance in many application domains. The most representative and known DBJQs are the K Closest Pairs Query (KCPQ) and the e Distance Join Query (eDJQ). These types of join queries are characterized by a number of desired pairs (K) or a distance threshold (e) between the components of the pairs in the final result, over two spatial datasets. Both are expensive operations, since two spatial datasets are combined with additional constraints. Given the increasing volume of spatial data originating from multiple sources and stored in distributed servers, it is not always efficient to perform DBJQs on a centralized server. For this reason, this paper addresses the problem of computing DBJQs on big spatial datasets in SpatialHadoop, an extension of Hadoop that supports efficient processing of spatial queries in a cloud-based setting. We propose novel algorithms, based on plane-sweep, to perform efficient parallel DBJQs on large-scale spatial datasets in SpatialHadoop. We evaluate the performance of the proposed algorithms in several situations with large real-world as well as synthetic datasets. The experiments demonstrate the efficiency and scalability of our proposed methodologies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: GeoInformatica	Publication Date: Sep 20, 2017
Citations: 14	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Efficient large-scale distance-based join queries in spatialhadoop

Abstract

Talk to us

Similar Papers

More From: GeoInformatica

Lead the way for us

Similar Papers

Compact Data Structures for Efficient Processing of Distance-Based Join Queries
Guillermo De Bernardo ... Nieves R Brisaboa
-
Guillermo De Bernardo, et. al.Guillermo De Bernardo ... Nieves R Brisaboa
19 Nov 2022
19 Nov 2022

New plane-sweep algorithms for distance-based join queries in spatial databases
George Roumelis ... Antonio Corral
GeoInformatica | VOL. 20
George Roumelis, et. al.George Roumelis ... Antonio Corral
27 Feb 2016
GeoInformatica | VOL. 20

Efficient distance join query processing in distributed spatial data management systems
Francisco García-García ... Yannis Manolopoulos
Information Sciences | VOL. 512
Francisco García-García, et. al.Francisco García-García ... Yannis Manolopoulos
14 Oct 2019
Information Sciences | VOL. 512

A performance comparison of distance-based query algorithms using R-trees in spatial databases
Antonio Corral ... J Almendrosjimenez
Information Sciences | VOL. 177
Antonio Corral, et. al.Antonio Corral ... J Almendrosjimenez
12 Jan 2007
Information Sciences | VOL. 177

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient large-scale distance-based join queries in spatialhadoop

Abstract

Talk to us

Similar Papers

More From: GeoInformatica