Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field

Xu-Ran Zhao,Xun Wang,Qi-Chao Chen

doi:10.1007/s11390-017-1735-x

Abstract

Deep convolutional neural networks (DCNNs) based methods recently keep setting new records on the tasks of predicting depth maps from monocular images. When dealing with video-based applications such as 2D (2-dimensional) to 3D (3-dimensional) video conversion, however, these approaches tend to produce temporally inconsistent depth maps, since their CNN models are optimized over single frames. In this paper, we address this problem by introducing a novel spatial-temporal conditional random fields (CRF) model into the DCNN architecture, which is able to enforce temporal consistency between depth map estimations over consecutive video frames. In our approach, temporally consistent superpixel (TSP) is first applied to an image sequence to establish the correspondence of targets in consecutive frames. A DCNN is then used to regress the depth value of each temporal superpixel, followed by a spatial-temporal CRF layer to model the relationship of the estimated depths in both spatial and temporal domains. The parameters in both DCNN and CRF models are jointly optimized with back propagation. Experimental results show that our approach not only is able to significantly enhance the temporal consistency of estimated depth maps over existing single-frame-based approaches, but also improves the depth estimation accuracy in terms of various evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology

Lead the way for us

Journal: Journal of Computer Science and Technology	Publication Date: May 1, 2017
Citations: 2

Similar Papers

Decision letter: Graphical-model framework for automated annotation of cell identities in dense cellular images
Ronald L Calabrese
-
Ronald L CalabreseRonald L Calabrese
24 Aug 2020
24 Aug 2020

PATH-13. LEARNED RESIZING WITH EFFICIENT TRAINING (LRET) FACILITATES IMPROVED PERFORMANCE OF LARGE-SCALE BRAIN TUMOR HISTOLOGY IMAGE CLASSIFICATION MODELS
Brent A Orr ... Quyhn T Tran
Neuro-Oncology | VOL. 26
Brent A Orr, et. al.Brent A Orr ... Quyhn T Tran
18 Jun 2024
Neuro-Oncology | VOL. 26

Fast video super resolution using deep convolutional networks
K Chaitanya Pavan Tanay ... P K Baruah
-
K Chaitanya Pavan Tanay, et. al.K Chaitanya Pavan Tanay ... P K Baruah
01 Mar 2017
01 Mar 2017

Video Classification via Weakly Supervised Sequence Modeling
Jingjing Liu ... Dimitris N Metaxas
Computer Vision and Image Understanding | VOL. 152
Jingjing Liu, et. al.Jingjing Liu ... Dimitris N Metaxas
10 Nov 2015
Computer Vision and Image Understanding | VOL. 152

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporally Consistent Depth Map Prediction Using Deep Convolutional Neural Network and Spatial-Temporal Conditional Random Field

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology