USING SYSTEMS OF PARALLEL AND DISTRIBUTED DATA PROCESSING TO BUILD HYDROLOGICAL MODELS BASED ON REMOTE SENSING DATA

A A Kolesnikov,P M Kikin,E A Panidi,A G Rusina

doi:10.5194/isprs-archives-xliii-b4-2021-111-2021

Abstract

Abstract. The article describes the possibilities and advantages of using distributed systems in the processing and analysis of remote sensing data. The preparation and processing of various types of remote sensing data (multispectral satellite images, values of climatic indicators, elevation data), which will then be used to build a simulation model of a hydroelectric power plant, was chosen as the basic task for testing the chosen approach. The existing approaches with distributed processing of spatial data of various types (vector cartographic objects, raster data, point clouds, graphs) are analyzed. The description of the developed approach is given and the rationale for the choice of its components is made. The preprocessing operations that were performed on the used raster data are described. An approach to the problems of raster data segmentation based on libraries for distributed machine learning is considered. Comparison of the speed of working with data for various algorithms of machine learning and processing is given.

Highlights

Geospatial and remote sensing data, due to their very large volume, variety and speed of updating, are one of the main elements of the big data concept
Traditional approaches use the power of computing stations to process data, but at the same time they can only scale vertically and at some point physically cannot cope with the continuous growth of the volume of processed data [7-9]. This problem is most often solved with the help of parallel and distributed processing technologies, which implement the simultaneous processing of each of the parts of the entire data set on a separate node and the combination of intermediate results into the final one [11–13]
The problems facing the authors of the article of predicting spread of tropical diseases, building simulation models of hydroelectric power plants, building databases of natural resource potential require the processing of large volumes of constantly updated remote sensing data on the territory of individual regions and countries in general

Summary

Introduction

Geospatial and remote sensing data, due to their very large volume, variety and speed of updating, are one of the main elements of the big data concept. Traditional approaches use the power of computing stations to process data, but at the same time they can only scale vertically (which is always costly and the capabilities are severely limited by the hardware platform) and at some point physically cannot cope with the continuous growth of the volume of processed data [7-9]. This problem is most often solved with the help of parallel and distributed processing technologies, which implement the simultaneous processing of each of the parts of the entire data set on a separate node and the combination of intermediate results into the final one [11–13]. To do this, it was necessary to analyze the existing open-source software for distributed processing of spatial data, determine their features, advantages, disadvantages, and evaluate the effectiveness of their application on certain datasets

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

USING SYSTEMS OF PARALLEL AND DISTRIBUTED DATA PROCESSING TO BUILD HYDROLOGICAL MODELS BASED ON REMOTE SENSING DATA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences

Lead the way for us

Journal: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences	Publication Date: Jun 30, 2021
License type: CC BY 4.0

Similar Papers

Machine learning algorithms for Big Data analytics including deep learning
Shaveta Malik ... Rohit Sahoo
-
Shaveta Malik, et. al.Shaveta Malik ... Rohit Sahoo
24 Aug 2022
24 Aug 2022

Machine and deep learning algorithms for classifying different types of dementia: A literature review
Masoud Noroozi ... Niloofar Deravi
Applied Neuropsychology: Adult | VOL. ahead-of-print
Masoud Noroozi, et. al.Masoud Noroozi ... Niloofar Deravi
31 Jul 2024
Applied Neuropsychology: Adult | VOL. ahead-of-print

Confirming the statistically significant superiority of tree-based machine learning algorithms over their counterparts for tabular data.
Haohui Lu ... Shahadat Uddin
PLOS ONE | VOL. 19
Haohui Lu, et. al.Haohui Lu ... Shahadat Uddin
18 Apr 2024
PLOS ONE | VOL. 19

Investigating Machine Learning as a Basis for Asteroid Taxnomies in the 3-Micron Spectral Region
Matthew Richardson ... Andrew Rivkin
-
Matthew Richardson, et. al.Matthew Richardson ... Andrew Rivkin
08 Oct 2020
08 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

USING SYSTEMS OF PARALLEL AND DISTRIBUTED DATA PROCESSING TO BUILD HYDROLOGICAL MODELS BASED ON REMOTE SENSING DATA

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences