Prediction of remaining surgery duration in laparoscopic videos based on visual saliency and the transformer network.

Constantinos Loukas,Konstantina Prevezanou,Dimitrios Schizas,Ioannis Seimenis

doi:10.1002/rcs.2632

Abstract

Real-time prediction of the remaining surgery duration (RSD) is important for optimal scheduling of resources in the operating room. We focus on the intraoperative prediction of RSD from laparoscopic video. An extensive evaluation of seven common deep learning models, a proposed one based on the Transformer architecture (TransLocal) and four baseline approaches, is presented. The proposed pipeline includes a CNN-LSTM for feature extraction from salient regions within short video segments and a Transformer with local attention mechanisms. Using the Cholec80 dataset, TransLocal yielded the best performance (mean absolute error (MAE)=7.1min). For long and short surgeries, the MAE was 10.6 and 4.4min, respectively. Thirty minutes before the end of surgery MAE=6.2min, 7.2 and 5.5min for all long and short surgeries, respectively. The proposed technique achieves state-of-the-art results. In the future, we aim to incorporate intraoperative indicators and pre-operative data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prediction of remaining surgery duration in laparoscopic videos based on visual saliency and the transformer network.

Abstract

Talk to us

Similar Papers

More From: The International Journal of Medical Robotics and Computer Assisted Surgery

Lead the way for us

Journal: The International Journal of Medical Robotics and Computer Assisted Surgery	Publication Date: Apr 1, 2024
License type: CC BY-NC 4.0

Similar Papers

Attn-CommNet: Coordinated Traffic Lights Control On Large-Scale Network Level
Jiashi Gao ... Xinming Shi
-
Jiashi Gao, et. al.Jiashi Gao ... Xinming Shi
01 Nov 2021
01 Nov 2021

3D hierarchical dual-attention fully convolutional networks with hybrid losses for diverse glioma segmentation
Deting Kong ... Jie Xue
Knowledge Based Systems | VOL. 237
Deting Kong, et. al.Deting Kong ... Jie Xue
14 Nov 2021
Knowledge Based Systems | VOL. 237

The Deep Features and Attention Mechanism-Based Method to Dish Healthcare Under Social IoT Systems: An Empirical Study With a Hand-Deep Local–Global Net
Honghao Gao ... Qiang Xu
IEEE Transactions on Computational Social Systems | VOL. 9
Honghao Gao, et. al.Honghao Gao ... Qiang Xu
01 Feb 2022
IEEE Transactions on Computational Social Systems | VOL. 9

A Text Normalization Method for Speech Synthesis Based on Local Attention Mechanism
Lan Huang ... Kangping Wang
IEEE access : practical innovations, open solutions | VOL. 8
Lan Huang, et. al.Lan Huang ... Kangping Wang
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction of remaining surgery duration in laparoscopic videos based on visual saliency and the transformer network.

Abstract

Talk to us

Similar Papers

More From: The International Journal of Medical Robotics and Computer Assisted Surgery