Slimmable transformer with hybrid axial-attention for medical image segmentation

Yiyue Hu,Nan Mu,Lei Liu,Lei Zhang,Jingfeng Jiang,Xiaoning Li

doi:10.1016/j.compbiomed.2024.108370

Abstract

The transformer architecture has achieved remarkable success in medical image analysis owing to its powerful capability for capturing long-range dependencies. However, due to the lack of intrinsic inductive bias in modeling visual structural information, the transformer generally requires a large-scale pre-training schedule, limiting the clinical applications over expensive small-scale medical data. To this end, we propose a slimmable transformer to explore intrinsic inductive bias via position information for medical image segmentation. Specifically, we empirically investigate how different position encoding strategies affect the prediction quality of the region of interest (ROI) and observe that ROIs are sensitive to different position encoding strategies. Motivated by this, we present a novel Hybrid Axial-Attention (HAA) that can be equipped with pixel-level spatial structure and relative position information as inductive bias. Moreover, we introduce a gating mechanism to achieve efficient feature selection and further improve the representation quality over small-scale datasets. Experiments on LGG and COVID-19 datasets prove the superiority of our method over the baseline and previous works. Internal workflow visualization with interpretability is conducted to validate our success better; the proposed slimmable transformer has the potential to be further developed into a visual software tool for improving computer-aided lesion diagnosis and treatment planning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Slimmable transformer with hybrid axial-attention for medical image segmentation

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Mar 29, 2024
Citations: 1

Similar Papers

Comparison of the retrieval of item versus spatial position information.
Scott D Gronlund ... Mark B Edwards
Journal of experimental psychology. Learning, memory, and cognition | VOL. 23
Scott D Gronlund, et. al.Scott D Gronlund ... Mark B Edwards
01 Jan 1997
Journal of experimental psychology. Learning, memory, and cognition | VOL. 23

Working memory for patterned sequences of auditory objects in a songbird
Jordan A Comins ... Timothy Q Gentner
Cognition | VOL. 117
Jordan A Comins, et. al.Jordan A Comins ... Timothy Q Gentner
16 Jul 2010
Cognition | VOL. 117

FMRI correlates of working memory: Specific posterior representation sites for motion and position information
Katja Umla-Runge ... Wolfgang Reith
Brain Research | VOL. 1382
Katja Umla-Runge, et. al.Katja Umla-Runge ... Wolfgang Reith
26 Jan 2011
Brain Research | VOL. 1382

Robust consensus tracking for a class of heterogeneous second‐order nonlinear multi‐agent systems
Chuanrui Wang ... Haibo Ji
International Journal of Robust and Nonlinear Control | VOL. 25
Chuanrui Wang, et. al.Chuanrui Wang ... Haibo Ji
31 Oct 2014
International Journal of Robust and Nonlinear Control | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Slimmable transformer with hybrid axial-attention for medical image segmentation

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine