Abstract

Stereo matching of high-resolution satellite images (HRSI) is still a fundamental but challenging task in the field of photogrammetry and remote sensing. Recently, deep learning (DL) methods have demonstrated the potential for stereo matching on public benchmark datasets. Currently, mainstream stereo matching models depend on quantities of training data with ground truth. However, datasets for stereo matching of satellite images are scarce, which profoundly blocks the application of DL in this field. To facilitate further research, this paper publishes a large-scale dataset, termed WHU-Stereo, for stereo matching DL network training and testing. This dataset is created by using airborne LiDAR point clouds and high-resolution stereo imageries taken from the Chinese GaoFen-7 satellite (GF-7). Occlusions should be seriously considered in the generation of ground-truth disparities. We propose an occlusion removal technique, which can adapt to point clouds with different densities and shows high potential in training data preparation. The WHU-Stereo dataset contains more than 1700 epipolar rectified image pairs, which cover six areas in China and includes various kinds of landscapes. We have assessed the accuracy of ground-truth disparity maps, and it is shown that our dataset achieves comparable precision compared with existing state-of-the-art stereo matching datasets. To verify its feasibility, in experiments, the hand-crafted SGM algorithm and recent DL networks have been tested on the dataset. Experimental results show that the WHU-Stereo dataset can serve as a challenging benchmark for stereo matching of high-resolution satellite images and performance evaluation of deep learning models. Our dataset is available at https://github.com/Sheng029/WHU-Stereo.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call