Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation

Yong Chen,Xuesong Lu,Qinlan Xie

doi:10.1016/j.compbiomed.2023.107228

Abstract

Integrating transformers and convolutional neural networks represents a crucial and cutting-edge approach for tackling medical image segmentation problems. Nonetheless, the existing hybrid methods fail to fully leverage the strengths of both operators. During the Patch Embedding, the patch projection method ignores the two-dimensional structure and local spatial information within each patch, while the fixed patch size cannot capture features with rich representation effectively. Moreover, the calculation of self-attention results in attention diffusion, hindering the provision of precise details to the decoder while maintaining feature consistency. Lastly, none of the existing methods establish an efficient multi-scale modeling concept. To address these issues, we design the Collaborative Networks of Transformers and Convolutional neural networks (TC-CoNet), which is generally used for accurate 3D medical image segmentation. First, we elaborately design precise patch embedding to generate 3D features with accurate spatial position information, laying a solid foundation for subsequent learning. The encoder–decoder backbone network is then constructed by TC-CoNet in an interlaced combination to properly incorporate long-range dependencies and hierarchical object concepts at various scales. Furthermore, we employ the constricted attention bridge to constrict attention to local features, allowing us to accurately guide the recovery of detailed information while maintaining feature consistency. Finally, atrous spatial pyramid pooling is applied to high-level feature map to establish the concept of multi-scale objects. On five challenging datasets, including Synapse, ACDC, brain tumor segmentation, cardiac left atrium segmentation, and lung tumor segmentation, the extensive experiments demonstrate that TC-CoNet outperforms state-of-the-art approaches in terms of superiority, migration, and strong generalization. These illustrate in full the efficacy of the proposed transformers and convolutional neural networks combination for medical image segmentation. Our code is freely available at: https://github.com/YongChen-Exact/TC-CoNet.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation

Abstract

Published Version

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Jul 5, 2023
Citations: 3

Similar Papers

CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation.
Ran Gu ... Rui Huang
IEEE Transactions on Medical Imaging | VOL. 40
Ran Gu, et. al.Ran Gu ... Rui Huang
01 Feb 2021
IEEE Transactions on Medical Imaging | VOL. 40

Inter-Slice Context Residual Learning for 3D Medical Image Segmentation.
Jianpeng Zhang ... Yutong Xie
IEEE Transactions on Medical Imaging | VOL. 40
Jianpeng Zhang, et. al.Jianpeng Zhang ... Yutong Xie
01 Feb 2021
IEEE Transactions on Medical Imaging | VOL. 40

Medical Image Segmentation Algorithm Based on Feedback Mechanism CNN.
Feng-Ping An ... Zhi-Wen Liu
Contrast Media & Molecular Imaging | VOL. 2019
Feng-Ping An, et. al.Feng-Ping An ... Zhi-Wen Liu
01 Aug 2019
Contrast Media & Molecular Imaging | VOL. 2019

Medical image segmentation algorithm based on feedback mechanism convolutional neural network
An Feng-Ping ... Liu Zhi-Wen
Biomedical Signal Processing and Control | VOL. 53
An Feng-Ping, et. al.An Feng-Ping ... Liu Zhi-Wen
18 Jun 2019
Biomedical Signal Processing and Control | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation

Abstract

Published Version

Talk to us

Similar Papers

More From: Computers in Biology and Medicine