Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion

Xin Yang,Xinglin Liu,Siyuan He,Qingling Chang,Yan Cui

doi:10.1109/access.2021.3076346

Xin Yang, Xinglin Liu + Show 3 more

Open Access

https://doi.org/10.1109/access.2021.3076346

Copy DOI

Abstract

Monocular depth estimation is a basic task in machine vision. In recent years, the performance of monocular depth estimation has been greatly improved. However, most depth estimation networks are based on a very deep network to extract features that lead to a large amount of information lost. The loss of object information is particularly serious in the encoding and decoding process. This information loss leads to the estimated depth maps lacking object structure detail and have non-clear edges. Especially in a complex indoor environment, which is our research focus in this paper, the consequences of this loss of information are particularly serious. To solve this problem, we propose a Dense feature fusion network that uses a feature pyramid to aggregate various scale features. Furthermore, to improve the fusion effectiveness of decoded object contour information and depth information, we propose an adaptive depth fusion module, which allows the fusion network to fuse various scale depth maps adaptively to increase object information in the predicted depth map. Unlike other work predicting depth maps relying on U-NET architecture, our depth map predicted by fusing multi-scale depth maps. These depth maps have their own characteristics. By fusing them, we can estimate depth maps that not only include accurate depth information but also have rich object contour and structure detail. Experiments indicate that the proposed model can predict depth maps with more object information than other prework, and our model also shows competitive accuracy. Furthermore, compared with other contemporary techniques, our method gets state-of-the-art in edge accuracy on the NYU Depth V2 dataset.

Highlights

Depth estimation is a fundamental problem in computer vision, applied to robot navigation, augmented reality, 3D reconstruction, autonomous driving, and other fields
Most models of depth estimation are based on very deep neural networks to extract the features from the image to get good performance, but the feature maps obtained by multiple convolutions lose many pieces of information especially object information, which leads to small objects and object structure detail missed in the feature map
U-NET gets good performance in many vision tasks, the gradual decoding makes U-NET shows poor performance in multi-scale feature fusion. To deal with those problems, we propose a network to estimate the depth from a single image by fuse multi-scale depth maps

Summary

INTRODUCTION

Depth estimation is a fundamental problem in computer vision, applied to robot navigation, augmented reality, 3D reconstruction, autonomous driving, and other fields. Most models of depth estimation are based on very deep neural networks to extract the features from the image to get good performance, but the feature maps obtained by multiple convolutions lose many pieces of information especially object information, which leads to small objects and object structure detail missed in the feature map. For indoor scenes with many objects, the impact of information loss is serious To deal with this problem, some pre-works [7], [8] introduce the skip-connection to add Low-scale features to the decoder module. U-NET gets good performance in many vision tasks, the gradual decoding makes U-NET shows poor performance in multi-scale feature fusion To deal with those problems, we propose a network to estimate the depth from a single image by fuse multi-scale depth maps. By estimating coarse depth maps of various scales, and VOLUME 9, 2021 performing weighted summation on these depth maps, a depth map with both high accuracy and rich scene information is obtained. Extensive experimental results show that our model shows that our predicted depth map has more object information and clearer edges than other previous works, and has competitive depth accuracy in the NYU-Depth V2 dataset

RELATED WORK

EXPERIMENT

DATASET AND EXPERIMENTAL SETTING

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 31	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

SABV-Depth: A biologically inspired deep learning network for monocular depth estimation
Junfan Wang ... Qiheng Miao
Knowledge-Based Systems | VOL. 263
Junfan Wang, et. al.Junfan Wang ... Qiheng Miao
14 Jan 2023
Knowledge-Based Systems | VOL. 263

Multilevel feature fusion and edge optimization network for self-supervised monocular depth estimation
Guohua Liu ... Shuqing Niu
Journal of Electronic Imaging | VOL. 31
Guohua Liu, et. al.Guohua Liu ... Shuqing Niu
06 Jun 2022
Journal of Electronic Imaging | VOL. 31

Monocular depth estimation based on deep learning: An overview
Chaoqiang Zhao ... Chongzhen Zhang
Science China Technological Sciences | VOL. 63
Chaoqiang Zhao, et. al.Chaoqiang Zhao ... Chongzhen Zhang
10 Jun 2020
Science China Technological Sciences | VOL. 63

Multi-scale depth classification network for monocular depth estimation
Yi Yang ... Botong Zhang
Computers and Electrical Engineering | VOL. 102
Yi Yang, et. al.Yi Yang ... Botong Zhang
28 Jun 2022
Computers and Electrical Engineering | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monocular Depth Estimation Based on Multi-Scale Depth Map Fusion

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access