Stereo Image Pairs Research Articles

With the emergence of the Smart City concept, the rapid advancement of urban three-dimensional (3D) reconstruction becomes imperative. While current developments in the field of 3D reconstruction have enabled the generation of 3D products such as Digital Surface Models (DSM), challenges persist in accurately reconstructing shadows, handling occlusions, and addressing low-texture areas in very-high-resolution remote sensing images. These challenges often lead to difficulties in calculating satisfactory disparity maps using existing stereo matching methods, thereby reducing the accuracy of 3D reconstruction. This issue is particularly pronounced in urban scenes, which contain numerous super high-rise and densely distributed buildings, resulting in large disparity values and occluded regions in stereo image pairs, and further leading to a large number of mismatched points in the obtained disparity map. In response to these challenges, this paper proposes a method to refine the disparity in urban scenes based on open-source GIS data. First, we register the GIS data with the epipolar-rectified images since there always exists unignorable geolocation errors between them. Specifically, buildings with different heights present different offsets in GIS data registering; thus, we perform multi-modal matching for each building and merge them into the final building mask. Subsequently, a two-layer optimization process is applied to the initial disparity map based on the building mask, encompassing both global and local optimization. Finally, we perform a post-correction on the building facades to obtain the final refined disparity map that can be employed for high-precision 3D reconstruction. Experimental results on SuperView-1, GaoFen-7, and GeoEye satellite images show that the proposed method has the ability to correct the occluded and mismatched areas in the initial disparity map generated by both hand-crafted and deep-learning stereo matching methods. The DSM generated by the refined disparity reduces the average height error from 2.2 m to 1.6 m, which demonstrates superior performance compared with other disparity refinement methods. Furthermore, the proposed method is able to improve the integrity of the target structure and present steeper building facades and complete roofs, which are conducive to subsequent 3D model generation.

Read full abstract

With the development of remote sensing satellite technology for Earth observation, remote sensing stereo images have been used for three-dimensional reconstruction in various fields, such as urban planning and construction. However, remote sensing images often contain noise, occluded regions, untextured areas, and repeated textures, which can lead to reduced accuracy in stereo matching and affect the quality of 3D reconstruction results. To reduce the impact of complex scenes in remote sensing images on stereo matching and to ensure both speed and accuracy, we propose a new end-to-end stereo matching network based on convolutional neural networks (CNNs). The proposed stereo matching network can learn features at different scales from the original images and construct cost volumes with varying scales to obtain richer scale information. Additionally, when constructing the cost volume, we introduce negative disparity to adapt to the common occurrence of both negative and non-negative disparities in remote sensing stereo image pairs. For cost aggregation, we employ a 3D convolution-based encoder–decoder structure that allows the network to adaptively aggregate information. Before feature aggregation, we also introduce an attention module to retain more valuable feature information, enhance feature representation, and obtain a higher-quality disparity map. By training on the publicly available US3D dataset, we obtain an accuracy of 1.115 pixels in end-point error (EPE) and 5.32% in the error pixel ratio (D1) on the test dataset, and the inference speed is 92 ms. Comparing our model with existing state-of-the-art models, we achieve higher accuracy, and the network is beneficial for the three-dimensional reconstruction of remote sensing images.

Read full abstract

Stereo Image Pairs Research Articles

Related Topics

Articles published on Stereo Image Pairs

360 ° Stereo Image Composition With Depth Adaption.

Self‐supervised binocular depth estimation algorithm with self‐rectification for autonomous driving

Refining disparity maps using deep learning and edge-aware smoothing filter

CVGSR: Stereo image Super-Resolution with Cross-View guidance

A road surface reconstruction dataset for autonomous driving

Stereo matching from monocular images using feature consistency

A dense matching method for remote sensing images fused with CPS denoising

Observation of Chiral Channels in Helical Covalent Organic Frameworks.

Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

Irregular boundaries stereo images dataset creating using depth estimation model

Bridging local and global representations for self-supervised monocular depth estimation

ERROR ANALYSIS OF VISUAL ODOMETRY FOR A SMALL SIZE UNMANNED AERIAL VEHICLE

Deformation mechanisms and their role in the lack of ductility in the refractory-based high entropy alloy AlMo0.5NbTa0.5TiZr

End-to-End Edge-Guided Multi-Scale Matching Network for Optical Satellite Stereo Image Pairs

Multi-Scale Interaction Network for Low-Light Stereo Image Enhancement

Disparity Refinement for Stereo Matching of High-Resolution Remote Sensing Images Based on GIS Data

Stereo Matching Method for Remote Sensing Images Based on Attention and Scale Fusion

Disparity Computation With Low Intensity Quantization on Stereo Image Pairs

Dense Matching With Optimized Penalty and Interpolation for High-Resolution Optical Stereo Image Pairs

Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Stereo Image Pairs Research Articles

Related Topics

Articles published on Stereo Image Pairs

360 ° Stereo Image Composition With Depth Adaption.

Self‐supervised binocular depth estimation algorithm with self‐rectification for autonomous driving

Refining disparity maps using deep learning and edge-aware smoothing filter

CVGSR: Stereo image Super-Resolution with Cross-View guidance

A road surface reconstruction dataset for autonomous driving

Stereo matching from monocular images using feature consistency

A dense matching method for remote sensing images fused with CPS denoising

Observation of Chiral Channels in Helical Covalent Organic Frameworks.

Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

Irregular boundaries stereo images dataset creating using depth estimation model

Bridging local and global representations for self-supervised monocular depth estimation

ERROR ANALYSIS OF VISUAL ODOMETRY FOR A SMALL SIZE UNMANNED AERIAL VEHICLE

Deformation mechanisms and their role in the lack of ductility in the refractory-based high entropy alloy AlMo0.5NbTa0.5TiZr

End-to-End Edge-Guided Multi-Scale Matching Network for Optical Satellite Stereo Image Pairs

Multi-Scale Interaction Network for Low-Light Stereo Image Enhancement

Disparity Refinement for Stereo Matching of High-Resolution Remote Sensing Images Based on GIS Data

Stereo Matching Method for Remote Sensing Images Based on Attention and Scale Fusion

Disparity Computation With Low Intensity Quantization on Stereo Image Pairs

Dense Matching With Optimized Penalty and Interpolation for High-Resolution Optical Stereo Image Pairs

Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network