Unsupervised Monocular Training Method for Depth Estimation Using Statistical Masks

Xiangtong Wang,Peng Cheng,Wei Li,Binbin Liang,Menglong Yang

doi:10.1109/access.2020.3032582

Abstract

Recently, unsupervised monocular training methods based on convolutional neural networks have already shown surprisingly progress in improving the accuracy of depth estimation. However, the performance of these methods suffers deeply from problematic pixels such as occluded pixels, low-texture pixels, and so on. In this paper, we introduce a method to a mask by the statistic of error maps for segmenting the problematic pixels. Different from the conventional methods which use additional segmentation networks to classify problematic pixels, we use a multi-task learning architecture to generate identical mask, mean mask, and variance mask for filtering the problematic pixels. Experimental results show that our proposed method has satisfactory performance compared with other relative methods on the KITTI dataset. Moreover, we also apply our method to the UAV dataset VisDrone, and the results also indicate the effectiveness of the method in detecting moving objects.

Highlights

I FERRING the accurate depth information from a single image has potential applications in 3D reconstruction, robotics, scene understanding, etc
Impressive progress has been made in improving the performance of monocular depth estimation through a color image by training a deep network
Since our statistical masks are mainly obtained on error maps, we introduce the concept of error vectors

Summary

Introduction

I FERRING the accurate depth information from a single image has potential applications in 3D reconstruction, robotics, scene understanding, etc. Unlike the stereo vision methods which can infer disparity from more images in different viewpoints, monocular depth estimation is an ill-posed and inherently ambiguous problem [1]. Impressive progress has been made in improving the performance of monocular depth estimation through a color image by training a deep network. Several self-supervised approaches have been proposed to train monocular depth estimation models using only synchronized stereo pairs [2] or monocular video [3]. Monocular video is an attractive alternative to stereobased supervision due to its more accessible training data. A pose estimation network is necessary to train the depth estimation model and to constitute the minimum learning framework for monocular training methods. The bottleneck of the unsupervised monocular training methods is very obvious: if the depth map of the target frame is well estimated, the most of pixels in the target frame will be better matched in the synthesized frame after the warp, but there are still a large number of pixels that

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 23	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Unsupervised Monocular Training Method for Depth Estimation Using Statistical Masks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multiscale space-time-frequency feature-guided multitask learning CNN for motor imagery EEG classification
Xiuling Liu ... Peng Xiong
Journal of Neural Engineering | VOL. 18
Xiuling Liu, et. al.Xiuling Liu ... Peng Xiong
24 Feb 2021
Journal of Neural Engineering | VOL. 18

Gated ensemble of spatio-temporal mixture of experts for multi-task learning in ride-hailing system
Md Hishamur Rahman ... Dongjie Wang
Multimodal Transportation | VOL. 3
Md Hishamur Rahman, et. al.Md Hishamur Rahman ... Dongjie Wang
22 Aug 2024
Multimodal Transportation | VOL. 3

Scalable Object Detection Using Deep but Lightweight CNN with Features Fusion
Qiaosong Chen ... Xin Deng
-
Qiaosong Chen, et. al.Qiaosong Chen ... Xin Deng
01 Jan 2017
01 Jan 2017

Cross‐modal Learning of Visual Categories using Different Levels of Supervision
...
-
, et. al. ...
01 Jan 2007
01 Jan 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Monocular Training Method for Depth Estimation Using Statistical Masks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access