On Packing Very Large R-trees

Haoyu Tan,Huajian Mao,Wuman Luo,Lionel M Ni

doi:10.1109/mdm.2012.40

Abstract

Many emerging mobile applications require analyzing large spatial datasets. In these applications, efficient query processing relies on spatial access methods such as R-trees. For datasets that are fairly static, R-trees are often built as a data loading process using packing techniques. However, traditional R-tree packing algorithms can only run on a single machine and thereby cannot scale to very large datasets. In this paper, we design and implement a general framework for parallel Rtree packing using MapReduce. This framework sequentially packs each R-tree level from bottom up. For lower levels that have a large number of rectangles, we propose a partition based algorithm for parallel packing. We also discuss two spatial partitioning methods that can efficiently handle heavily skewed datasets. To evaluate the performance, we conducted extensive experiments using large real datasets. The size of the datasets is up to 100GB and the number of spatial objects is up to 2 billion. Besides range queries, k-nearest neighbor searches and spatial joins are also used for evaluation. To the best of our knowledge, it is the first work that evaluates the query performance of packed R-trees on such large datasets with spatial queries other than range queries. The results confirm the scalability of our proposed framework and parallel packing algorithms. It is also shown that our packed R-trees have good query performance and optimal space utilization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On Packing Very Large R-trees

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient query processing on large spatial databases: A performance study
George Roumelis ... Antonio Corral
Journal of Systems and Software | VOL. 132
George Roumelis, et. al.George Roumelis ... Antonio Corral
06 Jul 2017
Journal of Systems and Software | VOL. 132

Efficient evaluation of partially-dimensional range queries in large OLAP datasets
Yaokai Feng ... Akifumi Makinouchi
International Journal of Data Mining, Modelling and Management | VOL. 3
Yaokai Feng, et. al.Yaokai Feng ... Akifumi Makinouchi
01 Jan 2010
International Journal of Data Mining, Modelling and Management | VOL. 3

Range and region query processing in spatial databases

-

28 Feb 2017
28 Feb 2017

Web data retrieval: solving spatial range queries using k-nearest neighbor searches
Wan D Bae ... Cyrus Shahabi
GeoInformatica | VOL. 13
Wan D Bae, et. al.Wan D Bae ... Cyrus Shahabi
03 Oct 2008
GeoInformatica | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Packing Very Large R-trees

Abstract

Talk to us

Similar Papers