An Unsupervised Monocular Image Depth Prediction Algorithm Based on Multiple Loss Deep Learning

Xiaojiao Tang,Lifang Chen

doi:10.1109/access.2019.2951035

Abstract

In order to improve the predication accuracy with low execution time in the process of image depth map generation, we mainly investigate the unsupervised monocular image depth prediction. In this paper, an unsupervised monocular image depth prediction method based on multiple loss deep learning is designed from following two aspects. First, a monocular image depth estimation algorithm based on multi-scale feature extraction is proposed, which includes two parts: a feature extraction network and a deconvolution prediction network. The feature extraction network extracts image features at different levels of the network and introduces the acquired multi-scale features into the deconvolution layer, without changing the image resolution. Through training, the left and right disparity map can be eventually predicted. Second, we provide a new multiple loss function with the asymmetric parameters of the training model and constraint theorem of polar geometry. The Multi-Scale-Structural Similarity Index (MS-SSIM) algorithm and L1 algorithm are combined as the loss function of image reconstruction, the left-right disparity consistency and the flipped left-right disparity consistency are incorporated in the loss function of the network model training. The simulation results show that this method can effectively improve the prediction results accuracy, particularly for complex images with mirrors, transparent, and shadows. KITTI dataset is further utilized to evaluate our method, which can achieve end-to-end results that even exceed those of a supervised method.

Highlights

As a fundamental problem of computer vision, image depth estimation has received significant attentions in both industrial and academic areas
We propose an unsupervised monocular image depth prediction algorithm and the simulation results show that the method improves the accuracy of image depth prediction
We propose an unsupervised monocular image depth prediction algorithm based on multiple loss deep learning

Summary

INTRODUCTION

As a fundamental problem of computer vision, image depth estimation has received significant attentions in both industrial and academic areas. To solve this problem, lots of research studies emerges based on monocular depth estimation algorithms with supervised learning [5], [6] These methods directly train a convolutional neural network (CNN) by using a large amount of ground truth depth data, and the trained model directly predicts the depth of each pixel in the image. We propose an unsupervised monocular image depth prediction algorithm based on multiple loss deep learning This network architecture can obtain left and right disparity maps without the ground truth depth. Proposed monocular image depth estimation architecture of the CNN structure based on ResNet-50, The blocks C (yellow), P (red), d(blue), b (purple) correspond to convolution, max pooling, disparity maps and blocks. We incorporate the flipped left-right disparity consistency into the network model training loss function, so that the postprocessing step directly into our network This significantly reduces the testing time of our images. In (6), xf is the pixel coordinates in the input flipped left image. dxlff is the disparity map of the flipped left image, dflxf is the horizontal flipped of the left disparity map dxl , dflxf and dxlff are theoretically equal

EXPERIMENT

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Unsupervised Monocular Image Depth Prediction Algorithm Based on Multiple Loss Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-scale Cross-form Pyramid Network for Stereo Matching
Zhidong Zhu ... Yuchao Dai
-
Zhidong Zhu, et. al.Zhidong Zhu ... Yuchao Dai
01 Jun 2019
01 Jun 2019

Multiple independent losses scheduling: A simple training method for deep neural networks
Jiali Deng ... Xuan Cheng
Intelligent Data Analysis | VOL. 27
Jiali Deng, et. al.Jiali Deng ... Xuan Cheng
30 Jan 2023
Intelligent Data Analysis | VOL. 27

Underwater image restoration with multi-scale shallow feature extraction and detail enhancement network
Heng Wu ... Xianmin Zhang
Journal of Modern Optics | VOL. 70
Heng Wu, et. al.Heng Wu ... Xianmin Zhang
02 Sep 2023
Journal of Modern Optics | VOL. 70

A Multi-Scale Content-Structure Feature Extraction Network Applied to Gully Extraction
Feiyang Dong ... Yucheng Zhang
Remote Sensing | VOL. 16
Feiyang Dong, et. al.Feiyang Dong ... Yucheng Zhang
25 Sep 2024
Remote Sensing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Unsupervised Monocular Image Depth Prediction Algorithm Based on Multiple Loss Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access