3D human pose estimation in multi-view operating room videos using differentiable camera projections

Beerend G.A Gerats,Jelmer M Wolterink,Ivo A.M.J Broeders

doi:10.1080/21681163.2022.2155580

Abstract

ABSTRACT 3D human pose estimation in multi-view operating room (OR) videos is a relevant asset for person tracking and action recognition. However, the surgical environment makes it challenging to find poses due to sterile clothing, frequent occlusions and limited public data. Methods specifically designed for the OR are generally based on the fusion of detected poses in multiple camera views. Typically, a 2D pose estimator such as a convolutional neural network (CNN) detects joint locations. Then, the detected joint locations are projected to 3D and fused over all camera views. However, accurate detection in 2D does not guarantee accurate localisation in 3D space. In this work, we propose to directly optimise for localisation in 3D by training 2D CNNs end-to-end based on a 3D loss that is backpropagated through each camera’s projection parameters. Using videos from the MVOR dataset, we show that this end-to-end approach outperforms optimisation in 2D space.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization	Publication Date: Dec 19, 2022
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

3D human pose estimation in multi-view operating room videos using differentiable camera projections

Abstract

Talk to us

Similar Papers

More From: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization

Lead the way for us

Similar Papers

A Multi-Task Neural Network for Action Recognition with 3D Key-Points
Rongxiao Tang ... Luyang Wang
-
Rongxiao Tang, et. al.Rongxiao Tang ... Luyang Wang
10 Jan 2021
10 Jan 2021

An Image Cues Coding Approach for 3D Human Pose Estimation
Meng Xing ... Zhiyong Feng
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 15
Meng Xing, et. al.Meng Xing ... Zhiyong Feng
30 Nov 2019
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 15

Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
Jiahao Lin ... Gim Hee Lee
-
Jiahao Lin, et. al.Jiahao Lin ... Gim Hee Lee
01 Jun 2021
01 Jun 2021

Smart-VPoseNet: 3D human pose estimation models and methods based on multi-view discriminant network
Hao Wang ... Minghui Sun
Knowledge-Based Systems | VOL. 239
Hao Wang, et. al.Hao Wang ... Minghui Sun
24 Dec 2021
Knowledge-Based Systems | VOL. 239

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

3D human pose estimation in multi-view operating room videos using differentiable camera projections

Abstract

Talk to us

Similar Papers

More From: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization