HAPNet: hierarchically aggregated pyramid network for real-time stereo matching

Patrick Brandao,Dimitris Psychogyios,Evangelos Mazomenos,Danail Stoyanov,Mirek Janatka

doi:10.1080/21681163.2020.1835561

Abstract

ABSTRACT Recovering the 3D shape of the surgical site is crucial for multiple computer-assisted interventions. Stereo endoscopes can be used to compute 3D depth but computational stereo is a challenging, non-convex and inherently discontinuous optimisation problem. In this paper, we propose a deep learning architecture which avoids the explicit construction of a cost volume of similarity which is one of the most computationally costly blocks of stereo algorithms. This makes training our network significantly more efficient and avoids the needs for large memory allocation. Our method performs well, especially around regions comprising multiple discontinuities around surgical instrumentation or around complex small structures and instruments. The method compares well to the state-of-the-art techniques while taking a different methodological angle to computational stereo problem in surgical video.

Full Text