Accurate Disparity Estimation Research Articles

AbstractRecent studies have demonstrated that deep learning‐based stereo matching methods (DLSMs) can far exceed conventional ones on most benchmark datasets by both improving visual performance and decreasing the mismatching rate. However, applying DLSMs on high‐resolution satellite stereos with broad image coverage and wide terrain variety is still challenging. First, the broad coverage of satellite stereos brings a wide disparity range, while DLSMs are limited to a narrow disparity range in most cases, resulting in incorrect disparity estimation in areas with contradictory disparity ranges. Second, high‐resolution satellite stereos always comprise various terrain types, which is more complicated than carefully prepared datasets. Thus, the performance of DLSMs on satellite stereos is unstable, especially for intractable regions such as texture‐less and occluded regions. Third, generating DSMs requires occlusion‐aware disparity maps, while traditional occlusion detection methods are not always applicable for DLSMs with continuous disparity. To tackle these problems, this paper proposes a novel DLSM‐based DSM generation workflow. The workflow comprises three steps: pre‐processing, disparity estimation and post‐processing. The pre‐processing step introduces low‐resolution terrain to shift unmatched disparity ranges into a fixed scope and crops satellite stereos to regular patches. The disparity estimation step proposes a hybrid feature fusion network (HF2Net) to improve the matching performance. In detail, HF2Net designs a cross‐scale feature extractor (CSF) and a multi‐scale cost filter. The feature extractor differentiates structural‐context features in complex scenes and thus enhances HF2Net's robustness to satellite stereos, especially on intractable regions. The cost filter filters out most matching errors to ensure accurate disparity estimation. The post‐processing step generates initial DSM patches with estimated disparity maps and then refines them for the final large‐scale DSMs. Primary experiments on the public US3D dataset showed better accuracy than state‐of‐the‐art methods, indicating HF2Net's superiority. We then created a self‐made Gaofen‐7 dataset to train HF2Net and conducted DSM generation experiments on two Gaofen‐7 stereos to further demonstrate the effectiveness and practical capability of the proposed workflow.

Read full abstract

3D object detection is an essential task in autonomous driving and robotics. Though great progress has been made, challenges remain in estimating 3D pose for distant and occluded objects. In this paper, we present a novel framework named ZoomNet for stereo imagery-based 3D detection. The pipeline of ZoomNet begins with an ordinary 2D object detection model which is used to obtain pairs of left-right bounding boxes. To further exploit the abundant texture cues in rgb images for more accurate disparity estimation, we introduce a conceptually straight-forward module – adaptive zooming, which simultaneously resizes 2D instance bounding boxes to a unified resolution and adjusts the camera intrinsic parameters accordingly. In this way, we are able to estimate higher-quality disparity maps from the resized box images then construct dense point clouds for both nearby and distant objects. Moreover, we introduce to learn part locations as complementary features to improve the resistance against occlusion and put forward the 3D fitting score to better estimate the 3D detection quality. Extensive experiments on the popular KITTI 3D detection dataset indicate ZoomNet surpasses all previous state-of-the-art methods by large margins (improved by 9.4% on APbv (IoU=0.7) over pseudo-LiDAR). Ablation study also demonstrates that our adaptive zooming strategy brings an improvement of over 10% on AP3d (IoU=0.7). In addition, since the official KITTI benchmark lacks fine-grained annotations like pixel-wise part locations, we also present our KFG dataset by augmenting KITTI with detailed instance-wise annotations including pixel-wise part location, pixel-wise disparity, etc.. Both the KFG dataset and our codes will be publicly available at https://github.com/detectRecog/ZoomNet.

Read full abstract

Accurate Disparity Estimation Research Articles

Articles published on Accurate Disparity Estimation

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation.

Digital surface model generation from high‐resolution satellite stereos based on hybrid feature fusion network

Bidirectional Semi-supervised Dual-branch CNN for Robust 3D Reconstruction of Stereo Endoscopic Images via Adaptive Cross and Parallel Supervisions.

Light-field spectral decomposition with a spatial–angular consistency prior for disparity estimation

CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching

Multi-scale graph neural network for global stereo matching

Adaptive Recurrent Iterative Updating Stereo Matching Network

A Confidence-Aware Cascade Network for Multi-Scale Stereo Matching of Very-High-Resolution Remote Sensing Images

Sparse LIDAR Measurement Fusion with Joint Updating Cost for Fast Stereo Matching

Deep Event Stereo Leveraged by Event-to-Image Translation

Shape Prior Guided Instance Disparity Estimation for 3D Object Detection.

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection

A Novel Stereo Matching Algorithm for Digital Surface Model (DSM) Generation in Water Areas

Fast and Accurate 3D Measurement Based on Light-Field Camera and Deep Learning.

A Low-Cost Real-Time Embedded Stereo Vision System for Accurate Disparity Estimation Based on Guided Image Filtering

Accurate disparity estimation in light field using ground control points

Content-Based Guided Image Filtering, Weighted Semi-Global Optimization, and Efficient Disparity Refinement for Fast and Accurate Disparity Estimation

PM-PM: PatchMatch with Potts Model for object segmentation and stereo matching.

Cross-Based Local Stereo Matching Using Orthogonal Integral Images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Accurate Disparity Estimation Research Articles

Articles published on Accurate Disparity Estimation

OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation.

Digital surface model generation from high‐resolution satellite stereos based on hybrid feature fusion network

Bidirectional Semi-supervised Dual-branch CNN for Robust 3D Reconstruction of Stereo Endoscopic Images via Adaptive Cross and Parallel Supervisions.

Light-field spectral decomposition with a spatial–angular consistency prior for disparity estimation

CGFNet: 3D Convolution Guided and Multi-scale Volume Fusion Network for fast and robust stereo matching

Multi-scale graph neural network for global stereo matching

Adaptive Recurrent Iterative Updating Stereo Matching Network

A Confidence-Aware Cascade Network for Multi-Scale Stereo Matching of Very-High-Resolution Remote Sensing Images

Sparse LIDAR Measurement Fusion with Joint Updating Cost for Fast Stereo Matching

Deep Event Stereo Leveraged by Event-to-Image Translation

Shape Prior Guided Instance Disparity Estimation for 3D Object Detection.

ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection

A Novel Stereo Matching Algorithm for Digital Surface Model (DSM) Generation in Water Areas

Fast and Accurate 3D Measurement Based on Light-Field Camera and Deep Learning.

A Low-Cost Real-Time Embedded Stereo Vision System for Accurate Disparity Estimation Based on Guided Image Filtering

Accurate disparity estimation in light field using ground control points

Content-Based Guided Image Filtering, Weighted Semi-Global Optimization, and Efficient Disparity Refinement for Fast and Accurate Disparity Estimation

PM-PM: PatchMatch with Potts Model for object segmentation and stereo matching.

Cross-Based Local Stereo Matching Using Orthogonal Integral Images