TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

Sanat Ramesh,Diego Dall’Alba,Cristians Gonzalez,Tong Yu,Pietro Mascagni,Didier Mutter,Jacques Marescaux,Paolo Fiorini,Nicolas Padoy

doi:10.1007/s11548-023-02864-8

Abstract

PurposeAutomatic recognition of surgical activities from intraoperative surgical videos is crucial for developing intelligent support systems for computer-assisted interventions. Current state-of-the-art recognition methods are based on deep learning where data augmentation has shown the potential to improve the generalization of these methods. This has spurred work on automated and simplified augmentation strategies for image classification and object detection on datasets of still images. Extending such augmentation methods to videos is not straightforward, as the temporal dimension needs to be considered. Furthermore, surgical videos pose additional challenges as they are composed of multiple, interconnected, and long-duration activities.MethodsThis work proposes a new simplified augmentation method, called TRandAugment, specifically designed for long surgical videos, that treats each video as an assemble of temporal segments and applies consistent but random transformations to each segment. The proposed augmentation method is used to train an end-to-end spatiotemporal model consisting of a CNN (ResNet50) followed by a TCN.ResultsThe effectiveness of the proposed method is demonstrated on two surgical video datasets, namely Bypass40 and CATARACTS, and two tasks, surgical phase and step recognition. TRandAugment adds a performance boost of 1–6% over previous state-of-the-art methods, that uses manually designed augmentations.ConclusionThis work presents a simplified and automated augmentation method for long surgical videos. The proposed method has been validated on different datasets and tasks indicating the importance of devising temporal augmentation methods for long surgical videos.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Assisted Radiology and Surgery	Publication Date: Mar 22, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Assisted Radiology and Surgery

Lead the way for us

Similar Papers

Object detection and activity recognition in video surveillance using neural networks
Vishva Payghode ... Ashwani Kumar Dubey
International Journal of Web Information Systems | VOL. 19
Vishva Payghode, et. al.Vishva Payghode ... Ashwani Kumar Dubey
20 Apr 2023
International Journal of Web Information Systems | VOL. 19

A Guide to Annotation of Neurosurgical Intraoperative Video for Machine Learning Analysis and Computer Vision
Dhiraj J Pangal ... Daniel A Donoho
World Neurosurgery | VOL. 150
Dhiraj J Pangal, et. al.Dhiraj J Pangal ... Daniel A Donoho
17 Mar 2021
World Neurosurgery | VOL. 150

RadarSpecAugment: A Simple Data Augmentation Method for Radar-Based Human Activity Recognition
Donghong She ... Xin Lou
IEEE Sensors Letters | VOL. 5
Donghong She, et. al.Donghong She ... Xin Lou
01 Apr 2021
IEEE Sensors Letters | VOL. 5

Sensor-data augmentation for human activity recognition with time-warping and data masking
Chi Yoon Jeong ... Mooseop Kim
Multimedia Tools and Applications | VOL. 80
Chi Yoon Jeong, et. al.Chi Yoon Jeong ... Mooseop Kim
12 Mar 2021
Multimedia Tools and Applications | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TRandAugment: temporal random augmentation strategy for surgical activity recognition from videos

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Assisted Radiology and Surgery