Learning Effective Geometry Representation from Videos for Self-Supervised Monocular Depth Estimation

Hailiang Zhao,Chonghao Zhang,Yongyi Kong,Haoji Zhang,Jiansen Zhao

doi:10.3390/ijgi13060193

Abstract

Recent studies on self-supervised monocular depth estimation have achieved promising results, which are mainly based on the joint optimization of depth and pose estimation via high-level photometric loss. However, how to learn the latent and beneficial task-specific geometry representation from videos is still far from being explored. To tackle this issue, we propose two novel schemes to learn more effective representation from monocular videos: (i) an Inter-task Attention Model (IAM) to learn the geometric correlation representation between the depth and pose learning networks to make structure and motion information mutually beneficial; (ii) a Spatial-Temporal Memory Module (STMM) to exploit long-range geometric context representation among consecutive frames both spatially and temporally. Systematic ablation studies are conducted to demonstrate the effectiveness of each component. Evaluations on KITTI show that our method outperforms current state-of-the-art techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Effective Geometry Representation from Videos for Self-Supervised Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information

Lead the way for us

Journal: ISPRS International Journal of Geo-Information	Publication Date: Jun 11, 2024
License type: CC BY 4.0

Similar Papers

Geometric Constraints for Self-supervised Monocular Depth Estimation on Laparoscopic Images with Dual-task Consistency
Wenda Li ... Yuichiro Hayashi
-
Wenda Li, et. al.Wenda Li ... Yuichiro Hayashi
01 Jan 2021
01 Jan 2021

Self-supervised monocular depth estimation from oblique UAV videos
Logambal Madhuanand ... Michael Ying Yang
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 176
Logambal Madhuanand, et. al.Logambal Madhuanand ... Michael Ying Yang
13 Apr 2021
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 176

MDSNet: self-supervised monocular depth estimation for video sequences using self-attention and threshold mask
Jiaqi Zhao ... Wang Zhang
Journal of Electronic Imaging | VOL. 31
Jiaqi Zhao, et. al.Jiaqi Zhao ... Wang Zhang
14 Sep 2022
Journal of Electronic Imaging | VOL. 31

Self-supervised monocular depth estimation on water scenes via specular reflection prior
Zhengyang Lu ... Ying Chen
Digital Signal Processing | VOL. 149
Zhengyang Lu, et. al.Zhengyang Lu ... Ying Chen
03 Apr 2024
Digital Signal Processing | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Effective Geometry Representation from Videos for Self-Supervised Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: ISPRS International Journal of Geo-Information