Forest Graph Convolutional Network for Surgical Action Triplet Recognition in Endoscopic Videos

Nan Xi,Junsong Yuan,Jingjing Meng

doi:10.1109/tcsvt.2022.3191838

Abstract

Recognizing surgical activities in endoscopic videos is of vital importance for developing context-aware decision support in the operating room. In this work, we model each surgical activity as an action triplet, consisting of the surgical instrument, the action, and the target organ that the instrument is interacting with. The goal is to recognize these action triplets from endoscopic videos. However, correctly recognizing fine-grained activity triplets is challenging because of the long-tail distribution of the triplet classes and the complex associations between triplets as well as within each triplet. In addition, multiple triplets may appear in a given video frame. To address these challenges, we propose a new model for surgical action triplet recognition based on a classification forest and Graph Convolutional Network (GCN), which we call Forest GCN. The classification forest is employed to calibrate fine-grained triplet classifiers by the upstream parent classifiers to suppress noisy logits of the triplet classes in the long tail. And stacked GCNs are designed to model the dependencies between triplet classes while leveraging the language embedding. Experiments on the endoscopic video dataset, CholecT50, demonstrate that our proposed method outperforms current state-of-the-art methods on surgical action triplet recognition.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Forest Graph Convolutional Network for Surgical Action Triplet Recognition in Endoscopic Videos

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Dec 1, 2022
Citations: 6

Similar Papers

Graph Convolutional Neural Network For Weakly Supervised Abnormality Localization In Long Capsule Endoscopy Videos
Sodiq Adewole ... Sana Syed
-
Sodiq Adewole, et. al.Sodiq Adewole ... Sana Syed
15 Dec 2021
15 Dec 2021

Graph Convolutional Neural Network for Human Action Recognition: A Comprehensive Survey
Tasweer Ahmad ... Lianwen Jin
IEEE transactions on artificial intelligence | VOL. 2
Tasweer Ahmad, et. al.Tasweer Ahmad ... Lianwen Jin
01 Apr 2021
IEEE transactions on artificial intelligence | VOL. 2

A Small Sample Recognition Model for Poisonous and Edible Mushrooms based on Graph Convolutional Neural Network.
Li Zhu ... Abdul Rehman Javed
Computational Intelligence and Neuroscience | VOL. 2022
Li Zhu, et. al.Li Zhu ... Abdul Rehman Javed
12 Aug 2022
Computational Intelligence and Neuroscience | VOL. 2022

Mesoscale precipitation nowcasting from weather radar data using space-time-separable graph convolutional networks
Daniele Trappolini ... Saverio Di Fabio
-
Daniele Trappolini, et. al.Daniele Trappolini ... Saverio Di Fabio
27 Mar 2022
27 Mar 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Forest Graph Convolutional Network for Surgical Action Triplet Recognition in Endoscopic Videos

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society