Untextured Areas Research Articles

With the development of remote sensing satellite technology for Earth observation, remote sensing stereo images have been used for three-dimensional reconstruction in various fields, such as urban planning and construction. However, remote sensing images often contain noise, occluded regions, untextured areas, and repeated textures, which can lead to reduced accuracy in stereo matching and affect the quality of 3D reconstruction results. To reduce the impact of complex scenes in remote sensing images on stereo matching and to ensure both speed and accuracy, we propose a new end-to-end stereo matching network based on convolutional neural networks (CNNs). The proposed stereo matching network can learn features at different scales from the original images and construct cost volumes with varying scales to obtain richer scale information. Additionally, when constructing the cost volume, we introduce negative disparity to adapt to the common occurrence of both negative and non-negative disparities in remote sensing stereo image pairs. For cost aggregation, we employ a 3D convolution-based encoder–decoder structure that allows the network to adaptively aggregate information. Before feature aggregation, we also introduce an attention module to retain more valuable feature information, enhance feature representation, and obtain a higher-quality disparity map. By training on the publicly available US3D dataset, we obtain an accuracy of 1.115 pixels in end-point error (EPE) and 5.32% in the error pixel ratio (D1) on the test dataset, and the inference speed is 92 ms. Comparing our model with existing state-of-the-art models, we achieve higher accuracy, and the network is beneficial for the three-dimensional reconstruction of remote sensing images.

Read full abstract

Stereo matching is the key technology in stereo vision. Given a pair of rectified images, stereo matching determines correspondences between the pair images and estimate depth by obtaining disparity between corresponding pixels. The current work has shown that depth estimation from a stereo pair of images can be formulated as a supervised learning task with an end‐to‐end frame based on convolutional neural networks (CNNs). However, 3D CNN puts a great burden on memory storage and computation, which further leads to the significantly increased computation time. To alleviate this issue, atrous convolution was proposed to reduce the number of convolutional operations via a relatively sparse receptive field. However, this sparse receptive field makes it difficult to find reliable corresponding points in fuzzy areas, e.g., occluded areas and untextured areas, owing to the loss of rich contextual information. To address this problem, we propose the Group‐based Atrous Convolution Spatial Pyramid Pooling (GASPP) to robustly segment objects at multiple scales with affordable computing resources. The main feature of the GASPP module is to set convolutional layers with continuous dilation rate in each group, so that it can reduce the impact of holes introduced by atrous convolution on network performance. Moreover, we introduce a tailored cascade cost volume in a pyramid form to reduce memory, so as to meet real‐time performance. The group‐based atrous convolution stereo matching network is evaluated on the street scene benchmark KITTI 2015 and Scene Flow and achieves state‐of‐the‐art performance.

Read full abstract

Untextured Areas Research Articles

Related Topics

Articles published on Untextured Areas

Multiple prior representation learning for self-supervised monocular depth estimation via hybrid transformer

Stereo Matching Method for Remote Sensing Images Based on Attention and Scale Fusion

IMPROVING PAIRWISE DSM WITH 3SGM: A SEMANTIC SEGMENTATION FOR SGM USING AN AUTOMATICALLY REFINED NEURAL NETWORK

Instant Panoramic Texture Mapping with Semantic Object Matching for Large-Scale Urban Scene Reproduction.

Group‐Based Atrous Convolution Stereo Matching Network

Semi-dense and robust image registration by shift adapted weighted aggregation and variational completion

Stereo Correspondence with Occlusion Handling in a Symmetric Patch-Based Graph-Cuts Model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Untextured Areas Research Articles

Related Topics

Articles published on Untextured Areas

Multiple prior representation learning for self-supervised monocular depth estimation via hybrid transformer

Stereo Matching Method for Remote Sensing Images Based on Attention and Scale Fusion

IMPROVING PAIRWISE DSM WITH 3SGM: A SEMANTIC SEGMENTATION FOR SGM USING AN AUTOMATICALLY REFINED NEURAL NETWORK

Instant Panoramic Texture Mapping with Semantic Object Matching for Large-Scale Urban Scene Reproduction.

Group‐Based Atrous Convolution Stereo Matching Network

Semi-dense and robust image registration by shift adapted weighted aggregation and variational completion

Stereo Correspondence with Occlusion Handling in a Symmetric Patch-Based Graph-Cuts Model