Model of Point Cloud Data Management System in Big Data Paradigm

Vladimir Pajić,Miro Govedarica,Mladen Amović

doi:10.3390/ijgi7070265

Vladimir Pajić, Miro Govedarica + Show 1 more

Open Access

https://doi.org/10.3390/ijgi7070265

Copy DOI

Abstract

Modern geoinformation technologies for collecting and processing data, such as laser scanning or photogrammetry, can generate point clouds with billions of points. They provide abundant information that can be used for different types of analysis. Due to its characteristics, the point cloud is often viewed as a special type of geospatial data. In order to efficiently manage such volumes of data, techniques based on a computer cluster have to be used. The Apache Spark framework has proven to be a solution for efficient processing of large volumes of data. This paper thoroughly examines the representation of point cloud data type using Apache Spark constructs. The common operations over point clouds, range queries and k-nearest neighbors queries (kNN) are implemented using Apache Spark DataFrame Application Programming Interface (API). It enabled the design of point cloud related user defined types (UDT) and user defined functions (UDF). The structure of the point cloud for efficient storing in Big Data key-value stores was analyzed and described. The methods presented in this paper were compared to PostgreSQL RDBMS, and the results were discussed.

Highlights

The development of satellite remote-sensing, global navigation systems, aerial photogrammetric cameras, sensor networks, radar remote sensing and laser scanning contributed to an exponential increase in the collected amount of geospatial data [1]
This paper presents the method for point cloud management in a distributed environment based on Apache Spark framework
There is increasing research on that topic every year and, software products for geospatial big data are developing at a fast pace

Summary

Introduction

The development of satellite remote-sensing, global navigation systems, aerial photogrammetric cameras, sensor networks, radar remote sensing and laser scanning contributed to an exponential increase in the collected amount of geospatial data [1]. The amount of collected data largely exceeds the possibility of its storage on individual computers and requires storing on a computer cluster. Apache Spark introduces a new platform for distributed processing that runs on one of the cluster resource managers such as Mesos or YARN. It is designed to maintain scalability and fault tolerance of MapReduce programs, and to support applications for which MapReduce was inefficient. This is achieved through the functionality of intra-memory execution

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS International Journal of Geo-Information	Publication Date: Jul 9, 2018
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Model of Point Cloud Data Management System in Big Data Paradigm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information

Lead the way for us

Similar Papers

SIDELOADING – INGESTION OF LARGE POINT CLOUDS INTO THE APACHE SPARK BIG DATA ENGINE
J Boehm ... K Liu
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLI-B2
J Boehm, et. al.J Boehm ... K Liu
07 Jun 2016
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLI-B2

SIDELOADING – INGESTION OF LARGE POINT CLOUDS INTO THE APACHE SPARK BIG DATA ENGINE
J Boehm ... C Alis
ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLI-B2
J Boehm, et. al.J Boehm ... C Alis
07 Jun 2016
ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. XLI-B2

Proximity Queries on Point Clouds using Rapid Construction Path Oracle
Yinzhao Yan ... Raymond Chi-Wing Wong
Proceedings of the ACM on Management of Data | VOL. 2
Yinzhao Yan, et. al.Yinzhao Yan ... Raymond Chi-Wing Wong
12 Mar 2024
Proceedings of the ACM on Management of Data | VOL. 2

User-Defined Functions and Types
Jason Brimhall ... Jonathan Gennick
-
Jason Brimhall, et. al.Jason Brimhall ... Jonathan Gennick
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model of Point Cloud Data Management System in Big Data Paradigm

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information