Generalizable stereo depth estimation with masked image modelling.

Samyakh Tukra,Stamatia Giannarou,Chi Xu,Haozheng Xu

doi:10.1049/htl2.12067

Samyakh Tukra, Stamatia Giannarou + Show 2 more

Open Access

https://doi.org/10.1049/htl2.12067

Copy DOI

Journal: Healthcare technology letters	Publication Date: Dec 23, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Imperial College London

Abstract

Generalizable and accurate stereo depth estimation is vital for 3D reconstruction, especially in surgery. Supervised learning methods obtain best performance however, limited ground truth data for surgical scenes limits generalizability. Self-supervised methods don't need ground truth, but suffer from scale ambiguity and incorrect disparity prediction due to inconsistency of photometric loss. This work proposes a two-phase training procedure that is generalizable and retains the high performance of supervised methods. It entails: (1) performing self-supervised representation learning of left and right views via masked image modelling (MIM) to learn generalizable semantic stereo features (2) utilizing the MIM pre-trained model to learn robust depth representation via supervised learning for disparity estimation on synthetic data only. To improve stereo representations learnt via MIM, perceptual loss terms are introduced, which improve the model's stereo representations learnt by explicitly encouraging the learning of higher scene-level features. Qualitative and quantitative performance evaluation on surgical and natural scenes shows that the approach achieves sub-millimetre accuracy and lowest errors respectively, setting a new state-of-the-art. Despite not training on surgical nor natural scene data for disparityestimation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalizable stereo depth estimation with masked image modelling.

Abstract

Talk to us

Similar Papers

More From: Healthcare technology letters

Lead the way for us

Similar Papers

Stereo Depth Estimation via Self-supervised Contrastive Representation Learning
Samyakh Tukra ... Stamatia Giannarou
-
Samyakh Tukra, et. al.Samyakh Tukra ... Stamatia Giannarou
01 Jan 2021
01 Jan 2021

Graph Barlow Twins: A self-supervised representation learning framework for graphs
Piotr Bielak ... Nitesh V Chawla
Knowledge-Based Systems | VOL. 256
Piotr Bielak, et. al.Piotr Bielak ... Nitesh V Chawla
17 Aug 2022
Knowledge-Based Systems | VOL. 256

Visual Representation Learning with Minimal Supervision

-

24 Feb 2021
24 Feb 2021

Generalized self-supervised contrastive learning with bregman divergence for image recognition
Zhiyuan Li ... Anca Ralescu
Pattern Recognition Letters | VOL. 171
Zhiyuan Li, et. al.Zhiyuan Li ... Anca Ralescu
22 May 2023
Pattern Recognition Letters | VOL. 171

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalizable stereo depth estimation with masked image modelling.

Abstract

Talk to us

Similar Papers

More From: Healthcare technology letters