Camera Motion Estimation Research Articles

Video streams are utilised to guide minimally-invasive surgery and diagnosis in a wide range of procedures, and many computer-assisted techniques have been developed to automatically analyse them. These approaches can provide additional information to the surgeon such as lesion detection, instrument navigation, or anatomy 3D shape modelling. However, the necessary image features to recognise these patterns are not always reliably detected due to the presence of irregular light patterns such as specular highlight reflections. In this paper, we aim at removing specular highlights from endoscopic videos using machine learning. We propose using a temporal generative adversarial network (GAN) to inpaint the hidden anatomy under specularities, inferring its appearance spatially and from neighbouring frames, where they are not present in the same location. This is achieved using in-vivo data from gastric endoscopy (Hyper Kvasir) in a fully unsupervised manner that relies on the automatic detection of specular highlights. System evaluations show significant improvements to other methods through direct comparison and ablation studies that depict the importance of the network’s temporal and transfer learning components. The generalisability of our system to different surgical setups and procedures was also evaluated qualitatively on in-vivo data of gastric endoscopy and ex-vivo porcine data (SERV-CT, SCARED). We also assess the effect of our method in comparison to other methods on computer vision tasks that underpin 3D reconstruction and camera motion estimation, namely stereo disparity, optical flow, and sparse point feature matching. These are evaluated quantitatively and qualitatively and results show a positive effect of our specular inpainting method on these tasks in a novel comprehensive analysis. Our code and dataset are made available at https://github.com/endomapper/Endo-STTN.

Read full abstract

Camera Motion Estimation Research Articles

Related Topics

Articles published on Camera Motion Estimation

RGB-D visual odometry by constructing and matching features at superpixel level

Unsupervised Monocular Depth Estimation With Channel and Spatial Attention.

Towards explainable artificial intelligence in deep vision-based odometry

Deep Scene Flow Learning: From 2D Images to 3D Point Clouds.

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence

SPSVO: a self-supervised surgical perception stereo visual odometer for endoscopy

Rethinking two-dimensional camera motion estimation assessment for digital video stabilization: A camera motion field-based metric

The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields

PointSLOT: Real-Time Simultaneous Localization and Object Tracking for Dynamic Environment

Wheeled Robot Visual Odometer Based on Two-dimensional Iterative Closest Point Algorithm

A Pose-Only Solution to Visual Reconstruction and Navigation.

Enhancing Conventional Geometry-Based Visual Odometry Pipeline Through Integration of Deep Descriptors

SimVODIS++: Neural Semantic Visual Odometry in Dynamic Environments

Single-pass inline pipeline 3D reconstruction using depth camera array

Deep homography estimation in dynamic surgical scenes for laparoscopic camera motion extraction

Accuracy and Speed Improvement of Event Camera Motion Estimation Using a Bird's-Eye View Transformation.

Camera Motion Estimation from Image Sequence in Pipe and Construction of 3D Point Cloud of Pipe

IMU-Assisted Online Video Background Identification.

3D ego-Motion Estimation Using low-Cost mmWave Radars via Radar Velocity Factor for Pose-Graph SLAM

Depth-based branching level estimation for bronchoscopic navigation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Camera Motion Estimation Research Articles

Related Topics

Articles published on Camera Motion Estimation

RGB-D visual odometry by constructing and matching features at superpixel level

Unsupervised Monocular Depth Estimation With Channel and Spatial Attention.

Towards explainable artificial intelligence in deep vision-based odometry

Deep Scene Flow Learning: From 2D Images to 3D Point Clouds.

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence

SPSVO: a self-supervised surgical perception stereo visual odometer for endoscopy

Rethinking two-dimensional camera motion estimation assessment for digital video stabilization: A camera motion field-based metric

The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields

PointSLOT: Real-Time Simultaneous Localization and Object Tracking for Dynamic Environment

Wheeled Robot Visual Odometer Based on Two-dimensional Iterative Closest Point Algorithm

A Pose-Only Solution to Visual Reconstruction and Navigation.

Enhancing Conventional Geometry-Based Visual Odometry Pipeline Through Integration of Deep Descriptors

SimVODIS++: Neural Semantic Visual Odometry in Dynamic Environments

Single-pass inline pipeline 3D reconstruction using depth camera array

Deep homography estimation in dynamic surgical scenes for laparoscopic camera motion extraction

Accuracy and Speed Improvement of Event Camera Motion Estimation Using a Bird's-Eye View Transformation.

Camera Motion Estimation from Image Sequence in Pipe and Construction of 3D Point Cloud of Pipe

IMU-Assisted Online Video Background Identification.

3D ego-Motion Estimation Using low-Cost mmWave Radars via Radar Velocity Factor for Pose-Graph SLAM

Depth-based branching level estimation for bronchoscopic navigation