MT4MTL-KD: A Multi-Teacher Knowledge Distillation Framework for Triplet Recognition.

Shuangchun Gui,Xun Zhou,Yi Cao,Jixiang Chen,Chen Zhang,Zhenkun Wang

doi:10.1109/tmi.2023.3345736

Abstract

The recognition of surgical triplets plays a critical role in the practical application of surgical videos. It involves the sub-tasks of recognizing instruments, verbs, and targets, while establishing precise associations between them. Existing methods face two significant challenges in triplet recognition: 1) the imbalanced class distribution of surgical triplets may lead to spurious task association learning, and 2) the feature extractors cannot reconcile local and global context modeling. To overcome these challenges, this paper presents a novel multi-teacher knowledge distillation framework for multi-task triplet learning, known as MT4MTL-KD. MT4MTL-KD leverages teacher models trained on less imbalanced sub-tasks to assist multi-task student learning for triplet recognition. Moreover, we adopt different categories of backbones for the teacher and student models, facilitating the integration of local and global context modeling. To further align the semantic knowledge between the triplet task and its sub-tasks, we propose a novel feature attention module (FAM). This module utilizes attention mechanisms to assign multi-task features to specific sub-tasks. We evaluate the performance of MT4MTL-KD on both the 5-fold cross-validation and the CholecTriplet challenge splits of the CholecT45 dataset. The experimental results consistently demonstrate the superiority of our framework over state-of-the-art methods, achieving significant improvements of up to 6.4% on the cross-validation split.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MT4MTL-KD: A Multi-Teacher Knowledge Distillation Framework for Triplet Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging

Lead the way for us

Journal: IEEE Transactions on Medical Imaging	Publication Date: Apr 1, 2024
Citations: 3

Similar Papers

Development of local and global wastewater biochemical oxygen demand real-time prediction models using supervised machine learning algorithms
Abdulaziz Sami Qambar ... Mohammed Majid M Al Khalidy
Engineering Applications of Artificial Intelligence | VOL. 118
Abdulaziz Sami Qambar, et. al.Abdulaziz Sami Qambar ... Mohammed Majid M Al Khalidy
15 Dec 2022
Engineering Applications of Artificial Intelligence | VOL. 118

Comparative analysis of local and consensus quantitative structure-activity relationship approaches for the prediction of bioconcentration factor
G Piir ... U Maran
SAR and QSAR in Environmental Research | VOL. 24
G Piir, et. al.G Piir ... U Maran
14 Feb 2013
SAR and QSAR in Environmental Research | VOL. 24

A Comparison Between Local and Global Models Among Different Near Infrared Spectroscopy Instruments for Corn Oils Prediction
Xien Yin Yap ... Kim Seng Chia
-
Xien Yin Yap, et. al.Xien Yin Yap ... Kim Seng Chia
05 Mar 2021
05 Mar 2021

A probabilistic framework for learning task relationships in multi-task learning
Yu Zhang
-
Yu ZhangYu Zhang
23 Dec 2014
23 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MT4MTL-KD: A Multi-Teacher Knowledge Distillation Framework for Triplet Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging