Abstract

Historical earth observation (EO) data have played an important role in long-term scientific and environmental monitoring. The effective organization of large-scale and long-term remote-sensing data to achieve efficient retrieval and access has become one of the important issues. However, inherent big data characteristics, such as a large scale, and asymmetrical temporal and spatial distributions, have caused problems with the efficiency of data retrieval and access. Therefore, this study proposes an efficient data organization method for use in a cloud-computing environment that has two aims. First, it addresses the problem of low retrieval efficiency. An asymmetrical index model for the image metadata is constructed that is based on a unified spatio-temporal grid coding; a prepartitioning mechanism under the HBase architecture is established to realize the uniform storage of the metadata with an asymmetrical spatiotemporal distribution and to avoid retrieval efficiency bottlenecks caused by a load imbalance. Second, it addresses low access efficiency. By dividing the remote-sensing image into tiles, a unified spatio-temporal code is established for each tile, and a consistent hash operation is performed; tiles with similar hash values are stored in the same or adjacent Hadoop Distributed File System nodes. In this way, tiles with temporal or spatial correlations can be gathered and stored, and lots of disk seeks can be avoided during retrieval, thereby greatly improving the data access efficiency. Comparative experiments showed that the data organization method can effectively improve the retrieval and access efficiencies of large-scale and long time-series remote-sensing data in a cloud-computing environment.

Highlights

  • L Arge-scale and long time-series remote-sensing data will inevitably produce repeated observations of the same area, providing sufficient and usable data sources for the detection of land cover changes [1], urban expansion analysis [2], and environmental pollution prevention and control [3]

  • The high-efficiency organization model of large-scale and long-term remote-sensing data in a cloud-computing environment proposed in this study eliminates the metadata differences between different satellite sensors by constructing a unified metadata field, eliminates the differences in the spatial reference and storage format of multi-source remote-sensing data through DataCube physical segmentation and format conversion, constructs a GeoSOT-ST-based asymmetrical spatiotemporal index to optimize the metadata storage strategy in HBase, and realize a distributed storage optimization of the remote-sensing tiles based on a consistent hash coding of the spatiotemporal information

  • Through the massive metadata retrieval experiment in HBase and the remote-sensing tile reading and writing experiments in HDFS, it was demonstrated that the proposed data organization method can effectively improve the retrieval and access efficiency of large-scale and long-term remote-sensing data

Read more

Summary

Introduction

L Arge-scale and long time-series remote-sensing data will inevitably produce repeated observations of the same area, providing sufficient and usable data sources for the detection of land cover changes [1], urban expansion analysis [2], and environmental pollution prevention and control [3]. The massive, multi-source, heterogeneous, and long time-series characteristics of remote-sensing data bring great challenges to data organization and management, such as making retrieval and access efficiency low, and not convenient to comprehensively and in-depth analyze the hidden information from multiple dimensions and multiple angles. In terms of the above problems, some scholars have proposed carrying out geometric and radiation normalization on (Corresponding author: Lizhe Wang) J. Liu are with the School of Computer Science, China University of Geosciences, Wuhan, 430074, China and with Hubei Key Laboratory of Intelligent Geo-Information Processing, China University of Geosciences, Wuhan 430074, China

Objectives
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call