Towards accurate surgical workflow recognition with convolutional networks and transformers

Bokai Zhang,Julian Abbing,Amer Ghanem,Danyal Fer,Jocelyn Barker,Rami Abukhalil,Varun Kejriwal Goel,Fausto Milletarì

doi:10.1080/21681163.2021.2002191

Abstract

ABSTRACT Recognising workflow phases from endoscopic surgical videos is crucial to deriving indicators that convey the quality, efficiency, outcome of the surgery, and offering insights into surgical team skills. Additionally, workflow information is used to organise large surgical video libraries for training purposes. In this paper, we explore different deep networks that capture spatial and temporal information from surgical videos for surgical workflow recognition. The approach is based on a combination of two networks: The first network is used for feature extraction from video snippets. The second network is performing action segmentation to identify the different parts of the surgical workflow by analysing the extracted features. This work focuses on proposing, comparing, and analysing different design choices. This includes fully convolutional, fully transformer, and hybrid models, which consist of transformers used in conjunction with convolutions. We evaluate the methods against a large dataset of endoscopic surgical videos acquired during Gastric Bypass surgery. Both our proposed fully transformer method and fully convolutional approach achieve state-of-the-art results. By integrating transformers and convolutions, our hybrid model achieves 93% frame-level accuracy and 85 segmental edit distance score. This demonstrates the potential of hybrid models that employ both transformers and convolutions for accurate surgical workflow recognition.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards accurate surgical workflow recognition with convolutional networks and transformers

Abstract

Talk to us

Similar Papers

More From: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization

Lead the way for us

Journal: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization	Publication Date: Nov 25, 2021
Citations: 11

Similar Papers

Semi-supervised spatio-temporal CNN for recognition of surgical workflow
Yuwen Chen ... Qi Long Sun
EURASIP Journal on Image and Video Processing | VOL. 2018
Yuwen Chen, et. al.Yuwen Chen ... Qi Long Sun
25 Aug 2018
EURASIP Journal on Image and Video Processing | VOL. 2018

Temporal-based Swin Transformer network for workflow recognition of surgical video.
Xiaoying Pan ... Xianli He
International Journal of Computer Assisted Radiology and Surgery | VOL. 18
Xiaoying Pan, et. al.Xiaoying Pan ... Xianli He
04 Nov 2022
International Journal of Computer Assisted Radiology and Surgery | VOL. 18

Surgical workflow recognition with 3DCNN for Sleeve Gastrectomy
Bokai Zhang ... Amer Ghanem
International Journal of Computer Assisted Radiology and Surgery | VOL. 16
Bokai Zhang, et. al.Bokai Zhang ... Amer Ghanem
20 Aug 2021
International Journal of Computer Assisted Radiology and Surgery | VOL. 16

“Deep-Onto” network for surgical workflow and context recognition
Hirenkumar Nakawala ... Laura Erica Pescatori
International Journal of Computer Assisted Radiology and Surgery | VOL. 14
Hirenkumar Nakawala, et. al.Hirenkumar Nakawala ... Laura Erica Pescatori
16 Nov 2018
International Journal of Computer Assisted Radiology and Surgery | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards accurate surgical workflow recognition with convolutional networks and transformers

Abstract

Talk to us

Similar Papers

More From: Computer Methods in Biomechanics and Biomedical Engineering: Imaging &amp; Visualization

More From: Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization