H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation.

Along He,Chengkun Du,Tao Li,Shuang Xia,Kai Wang,Huazhu Fu

doi:10.1109/tmi.2023.3264513

Abstract

Accurate medical image segmentation is of great significance for computer aided diagnosis. Although methods based on convolutional neural networks (CNNs) have achieved good results, it is weak to model the long-range dependencies, which is very important for segmentation task to build global context dependencies. The Transformers can establish long-range dependencies among pixels by self-attention, providing a supplement to the local convolution. In addition, multi-scale feature fusion and feature selection are crucial for medical image segmentation tasks, which is ignored by Transformers. However, it is challenging to directly apply self-attention to CNNs due to the quadratic computational complexity for high-resolution feature maps. Therefore, to integrate the merits of CNNs, multi-scale channel attention and Transformers, we propose an efficient hierarchical hybrid vision Transformer (H2Former) for medical image segmentation. With these merits, the model can be data-efficient for limited medical data regime. The experimental results show that our approach exceeds previous Transformer, CNNs and hybrid methods on three 2D and two 3D medical image segmentation tasks. Moreover, it keeps computational efficiency in model parameters, FLOPs and inference time. For example, H2Former outperforms TransUNet by 2.29% in IoU score on KVASIR-SEG dataset with 30.77% parameters and 59.23% FLOPs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging

Lead the way for us

Journal: IEEE Transactions on Medical Imaging	Publication Date: Sep 1, 2023
Citations: 50

Similar Papers

IBA-U-Net: Attentive BConvLSTM U-Net with Redesigned Inception for medical image segmentation
Siyuan Chen ... Peter X Liu
Computers in Biology and Medicine | VOL. 135
Siyuan Chen, et. al.Siyuan Chen ... Peter X Liu
12 Jun 2021
Computers in Biology and Medicine | VOL. 135

CA-Net: Comprehensive Attention Convolutional Neural Networks for Explainable Medical Image Segmentation.
Ran Gu ... Tom Vercauteren
IEEE Transactions on Medical Imaging | VOL. 40
Ran Gu, et. al.Ran Gu ... Tom Vercauteren
01 Feb 2021
IEEE Transactions on Medical Imaging | VOL. 40

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.
Boliang Li ... Yan Wang
PLOS ONE | VOL. 19
Boliang Li, et. al.Boliang Li ... Yan Wang
04 Apr 2024
PLOS ONE | VOL. 19

Cross-domain attention-guided generative data augmentation for medical image analysis with limited data
Zhenghua Xu ... Thomas Lukasiewicz
Computers in Biology and Medicine | VOL. 168
Zhenghua Xu, et. al.Zhenghua Xu ... Thomas Lukasiewicz
23 Nov 2023
Computers in Biology and Medicine | VOL. 168

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Medical Imaging