Quality Of Depth Map Research Articles

Depth images and thermal images contain the spatial geometry information and surface temperature information, which can act as complementary information for the RGB modality. However, the quality of the depth and thermal images is often unreliable in some challenging scenarios, which will result in the performance degradation of the two-modal based salient object detection (SOD). Meanwhile, some researchers pay attention to the triple-modal SOD task, namely the visible-depth-thermal (VDT) SOD, where they attempt to explore the complementarity of the RGB image, the depth image, and the thermal image. However, existing triple-modal SOD methods fail to perceive the quality of depth maps and thermal images, which leads to performance degradation when dealing with scenes with low-quality depth and thermal images. Therefore, in this paper, we propose a quality-aware selective fusion network (QSF-Net) to conduct VDT salient object detection, which contains three subnets including the initial feature extraction subnet, the quality-aware region selection subnet, and the region-guided selective fusion subnet. Firstly, except for extracting features, the initial feature extraction subnet can generate a preliminary prediction map from each modality via a shrinkage pyramid architecture, which is equipped with the multi-scale fusion (MSF) module. Then, we design the weakly-supervised quality-aware region selection subnet to generate the quality-aware maps. Concretely, we first find the high-quality and low-quality regions by using the preliminary predictions, which further constitute the pseudo label that can be used to train this subnet. Finally, the region-guided selective fusion subnet purifies the initial features under the guidance of the quality-aware maps, and then fuses the triple-modal features and refines the edge details of prediction maps through the intra-modality and inter-modality attention (IIA) module and the edge refinement (ER) module, respectively. Extensive experiments are performed on VDT-2048 dataset, and the results show that our saliency model consistently outperforms 13 state-of-the-art methods with a large margin. Our code and results are available at https://github.com/Lx-Bao/QSFNet.

Read full abstract

Using depth estimation joint target detection networks to locate targets in the UAV field of view is a novel application in the depth estimation research field. The presence of more depth variations and low-texture regions in the ultra-low altitude oblique photographic images make them trickier to train for an excellent depth estimation network compared to autonomous driving scenarios. This presents a challenge in achieving optimal training. This study investigates the problem of unsupervised monocular depth estimation for ultra-low altitude oblique photography images. It aims to make subsequent advanced vision tasks better benefit from excellent depth estimation results in terms of overcoming complex scenes. The lack of effective back-projection directionality in training using adjacent frames is attributed to the extensive low-textured areas contained in the training data for complex ultra-low altitude oblique photography. We propose a self-supervised scene-aware refinement learning architecture from the perspective of enhancing feature perception to deal with such problems. The architecture consists of a multi-resolution feature fusion depth network and a perceptual refinement network (PRNet), together with a pose network to enhance regional differences in complex environments from a refined feature context perspective to obtain higher quality depth maps. We rethink the problem of depth information recovery and design the edge information aggregation (EIA) module, which is configured in the decoder section to refine the local region depth detail representation. We design several loss terms to constrain the training of the network in order to improve the quality of depth estimation. Our method is compared with six state-of-the-art self-supervised monocular depth estimation methods on three datasets (UAVid 2020, WildUAV, UAV ula). The experimental results demonstrate that our model achieves the best performance in most scenarios. The code and the private dataset (UAV ula) can be publicly available at https://github.com/takisu0916/MRFEDepth.

Read full abstract

Quality Of Depth Map Research Articles

Related Topics

Articles published on Quality Of Depth Map

On Alpha-Expansion-Based Graph-Cut Optimization for Decoder-Side Depth Estimation

Multi-view depth estimation based on multi-feature aggregation for 3D reconstruction

Unsupervised Domain Adaptation Depth Estimation Based on Self-attention Mechanism and Edge Consistency Constraints

Lifelong-MonoDepth: Lifelong Learning for Multidomain Monocular Metric Depth Estimation.

Quality-Aware Selective Fusion Network for V-D-T Salient Object Detection.

Depth map super-resolution via learned nonlocal model and enhanced local regularization

Scene-aware refinement network for unsupervised monocular depth estimation in ultra-low altitude oblique photography of UAV

Multiscale and multidirection depth map super resolution with semantic inference

Immersive Video Postprocessing for Efficient Video Coding

Polarimetric monocular leaf normal estimation model for plant phenotyping

DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks

EGA-Net: Edge feature enhancement and global information attention network for RGB-D salient object detection

Depth Map Prediction of Occluded Objects Using Structure Tensor with Gain Regularization

3D Vision Using Multiple Structured Light-Based Kinect Depth Cameras

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.

MONOCULAR DEPTH PREDICTION IN PHOTOGRAMMETRIC APPLICATIONS

OutCast: Outdoor Single‐image Relighting with Cast Shadows

Energy minimization for image focus volume in shape from focus

PDR-Net: Progressive depth reconstruction network for color guided depth map super-resolution

RGBD salient object detection based on depth feature enhancement

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Quality Of Depth Map Research Articles

Related Topics

Articles published on Quality Of Depth Map

On Alpha-Expansion-Based Graph-Cut Optimization for Decoder-Side Depth Estimation

Multi-view depth estimation based on multi-feature aggregation for 3D reconstruction

Unsupervised Domain Adaptation Depth Estimation Based on Self-attention Mechanism and Edge Consistency Constraints

Lifelong-MonoDepth: Lifelong Learning for Multidomain Monocular Metric Depth Estimation.

Quality-Aware Selective Fusion Network for V-D-T Salient Object Detection.

Depth map super-resolution via learned nonlocal model and enhanced local regularization

Scene-aware refinement network for unsupervised monocular depth estimation in ultra-low altitude oblique photography of UAV

Multiscale and multidirection depth map super resolution with semantic inference

Immersive Video Postprocessing for Efficient Video Coding

Polarimetric monocular leaf normal estimation model for plant phenotyping

DDL-MVS: Depth Discontinuity Learning for Multi-View Stereo Networks

EGA-Net: Edge feature enhancement and global information attention network for RGB-D salient object detection

Depth Map Prediction of Occluded Objects Using Structure Tensor with Gain Regularization

3D Vision Using Multiple Structured Light-Based Kinect Depth Cameras

Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.

MONOCULAR DEPTH PREDICTION IN PHOTOGRAMMETRIC APPLICATIONS

OutCast: Outdoor Single‐image Relighting with Cast Shadows

Energy minimization for image focus volume in shape from focus

PDR-Net: Progressive depth reconstruction network for color guided depth map super-resolution

RGBD salient object detection based on depth feature enhancement