SMTF: Sparse transformer with multiscale contextual fusion for medical image segmentation

Xichu Zhang,Xiaozhi Zhang,Lijun Ouyang,Chuanbo Qin,Lin Xiao,Dongping Xiong

doi:10.1016/j.bspc.2023.105458

Abstract

Medical image segmentation aims at recognizing the object of interest from surrounding tissues and structures, which is essential for the reliable diagnosis and morphological analysis of specific lesions. Automatic medical image segmentation has been significantly boosted by deep Convolutional Neural Networks (CNNs). However, CNNs usually fail to model long-range interactions due to the intrinsic locality of convolutional operations, which limits the segmentation performance. Recently, Transformer has been successfully applied in various computer visions, which leverages the self-attention mechanism for modelling long-range interactions to capture global information. Nevertheless, self-attention suffers from lacks of spatial locality and efficient computation. To address these issues, in this work, we develop a new sparse medical Transformer (SMTF) with multiscale contextual fusion for medical image segmentation. The proposed model combines convolutional operations and attention mechanisms to form a U-shaped framework to capture both local and global information. Specifically, to reduce the computational cost of traditional Transformer, we design a novel sparse attention module to construct Transformer layers by spherical Locality Sensitive Hashing method. The sparse attention partitions the feature space into different attention buckets, and the attention calculation is conducted only in the individual bucket. The designed sparse Transformer layer further incorporates a bottleneck block to construct the encoder in SMTF. It is worth noting that the proposed sparse Transformer can also aggregate the global feature information in early stages, which enables the model to learn more local and global information by incorporating CNNs at lower layers. Furthermore, we introduce a deep supervision strategy to guide the model to fuse multiscale feature information. It further enables the SMTF to effectively propagate feature information across layers to preserve more input spatial information and mitigate information attenuation. Benefiting from these, it can achieve better segmentation performance while being more robust and efficient. The proposed SMTF is evaluated on multiple medical image segmentation datasets and a clinical nasopharyngeal carcinoma dataset. Extensive experiments have demonstrated its superiority on both qualitative and quantitative evaluations. Code and models are available at https://github.com/qmx717/sparse-attention.git.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SMTF: Sparse transformer with multiscale contextual fusion for medical image segmentation

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Journal: Biomedical Signal Processing and Control	Publication Date: Sep 27, 2023
Citations: 4

Similar Papers

CPFTransformer: transformer fusion context pyramid medical image segmentation network.
Jiao Li ... Jinyu Ye
Frontiers in neuroscience | VOL. 17
Jiao Li, et. al.Jiao Li ... Jinyu Ye
07 Dec 2023
Frontiers in neuroscience | VOL. 17

MT-ONet: Mixed Transformer O-Net for Medical Image Segmentation
Pengfei Zheng
-
Pengfei ZhengPengfei Zheng
30 Nov 2022
30 Nov 2022

TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images.
Junjie Liang ... Cihui Yang
Quantitative imaging in medicine and surgery | VOL. 12
Junjie Liang, et. al.Junjie Liang ... Cihui Yang
01 Apr 2022
Quantitative imaging in medicine and surgery | VOL. 12

Combining edge guidance and feature pyramid for medical image segmentation
Shaolong Chen ... Zhiyong Zhang
Biomedical Signal Processing and Control | VOL. 78
Shaolong Chen, et. al.Shaolong Chen ... Zhiyong Zhang
26 Jul 2022
Biomedical Signal Processing and Control | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SMTF: Sparse transformer with multiscale contextual fusion for medical image segmentation

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control