Enhancing surgical instrument segmentation: integrating vision transformer insights with adapter

Meng Wei,Miaojing Shi,Tom Vercauteren

doi:10.1007/s11548-024-03140-z

Abstract

PurposeIn surgical image segmentation, a major challenge is the extensive time and resources required to gather large-scale annotated datasets. Given the scarcity of annotated data in this field, our work aims to develop a model that achieves competitive performance with training on limited datasets, while also enhancing model robustness in various surgical scenarios.MethodsWe propose a method that harnesses the strengths of pre-trained Vision Transformers (ViTs) and data efficiency of convolutional neural networks (CNNs). Specifically, we demonstrate how a CNN segmentation model can be used as a lightweight adapter for a frozen ViT feature encoder. Our novel feature adapter uses cross-attention modules that merge the multiscale features derived from the CNN encoder with feature embeddings from ViT, ensuring integration of the global insights from ViT along with local information from CNN.ResultsExtensive experiments demonstrate our method outperforms current models in surgical instrument segmentation. Specifically, it achieves superior performance in binary segmentation on the Robust-MIS 2019 dataset, as well as in multiclass segmentation tasks on the EndoVis 2017 and EndoVis 2018 datasets. It also showcases remarkable robustness through cross-dataset validation across these 3 datasets, along with the CholecSeg8k and AutoLaparo datasets. Ablation studies based on the datasets prove the efficacy of our novel adapter module.ConclusionIn this study, we presented a novel approach integrating ViT and CNN. Our unique feature adapter successfully combines the global insights of ViT with the local, multi-scale spatial capabilities of CNN. This integration effectively overcomes data limitations in surgical instrument segmentation. The source code is available at: https://github.com/weimengmeng1999/AdapterSIS.git.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing surgical instrument segmentation: integrating vision transformer insights with adapter

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Assisted Radiology and Surgery

Lead the way for us

Journal: International Journal of Computer Assisted Radiology and Surgery	Publication Date: May 8, 2024
License type: CC BY 4.0

Similar Papers

Dual-stage semantic segmentation of endoscopic surgical instruments.
Wenxin Chen ... Xingguang Duan
Medical physics | VOL. -
Wenxin Chen, et. al.Wenxin Chen ... Xingguang Duan
10 Sep 2024
Medical physics | VOL. -

SSIS-Seg: Simulation-Supervised Image Synthesis for Surgical Instrument Segmentation.
Emanuele Colleoni ... Dimitris Psychogyios
IEEE Transactions on Medical Imaging | VOL. 41
Emanuele Colleoni, et. al.Emanuele Colleoni ... Dimitris Psychogyios
01 Nov 2022
IEEE Transactions on Medical Imaging | VOL. 41

An attention-guided network for surgical instrument segmentation from endoscopic images
Lei Yang ... Yanhong Liu
Computers in Biology and Medicine | VOL. 151
Lei Yang, et. al.Lei Yang ... Yanhong Liu
24 Oct 2022
Computers in Biology and Medicine | VOL. 151

A Holistically-Nested U-Net: Surgical Instrument Segmentation Based on Convolutional Neural Network.
Lingtao Yu ... Xiaoyan Yu
Journal of Digital Imaging | VOL. 33
Lingtao Yu, et. al.Lingtao Yu ... Xiaoyan Yu
08 Oct 2019
Journal of Digital Imaging | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing surgical instrument segmentation: integrating vision transformer insights with adapter

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Assisted Radiology and Surgery