A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Chengyin Li,Indrin J Chetty,Dongxiao Zhu,Hassan Bagher‐Ebadian,Rafi Ibn Sultan,Benjamin Movsas,Mohamed Elshaikh

doi:10.1002/mp.16750

Abstract

Deep learning-based networks have become increasingly popular in the field of medical image segmentation. The purpose of this research was to develop and optimize a new architecture for automatic segmentation of the prostate gland and normal organs in the pelvic, thoracic, and upper gastro-intestinal (GI) regions. We developed an architecture which combines a shifted-window (Swin) transformer with a convolutional U-Net. The network includes a parallel encoder, a cross-fusion block, and a CNN-based decoder to extract local and global information and merge related features on the same scale. A skip connection is applied between the cross-fusion block and decoder to integrate low-level semantic features. Attention gates (AGs) are integrated within the CNN to suppress features in image background regions. Our network is termed "SwinAttUNet." We optimized the architecture for automatic image segmentation. Training datasets consisted of planning-CT datasets from 300 prostate cancer patients from an institutional database and 100 CT datasets from a publicly available dataset (CT-ORG). Images were linearly interpolated and resampled to a spatial resolution of (1.0×1.0×1.5) mm3 . A volume patch (192×192×96) was used for training and inference, and the dataset was split into training (75%), validation (10%), and test (15%) cohorts. Data augmentation transforms were applied consisting of random flip, rotation, and intensity scaling. The loss function comprised Dice and cross-entropy equally weighted and summed. We evaluated Dice coefficients (DSC), 95th percentile Hausdorff Distances (HD95), and Average Surface Distances (ASD) between results of our network and ground truth data. SwinAttUNet, DSC values were 86.54±1.21, 94.15±1.17, and 87.15±1.68% and HD95 values were 5.06±1.42, 3.16±0.93, and 5.54±1.63mm for the prostate, bladder, and rectum, respectively. Respective ASD values were 1.45±0.57, 0.82±0.12, and 1.42±0.38mm. For the lung, liver, kidneys and pelvic bones, respective DSC values were: 97.90±0.80, 96.16±0.76, 93.74±2.25, and 89.31±3.87%. Respective HD95 values were: 5.13±4.11, 2.73±1.19, 2.29±1.47, and 5.31±1.25mm. Respective ASD values were: 1.88±1.45, 1.78±1.21, 0.71±0.43, and 1.21±1.11mm. Our network outperformed several existing deep learning approaches using only attention-based convolutional or Transformer-based feature strategies, as detailed in the results section. We have demonstrated that our new architecture combining Transformer- and convolution-based features is able to better learn the local and global context for automatic segmentation of multi-organ, CT-based anatomy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Abstract

Talk to us

Similar Papers

More From: Medical physics

Lead the way for us

Journal: Medical physics	Publication Date: Sep 22, 2023
Citations: 1

Similar Papers

Deep learning models for preoperative T-stage assessment in rectal cancer using MRI: exploring the impact of rectal filling.
Chang Tian ... Yuan Yuan
Frontiers in medicine | VOL. 10
Chang Tian, et. al.Chang Tian ... Yuan Yuan
29 Nov 2023
Frontiers in medicine | VOL. 10

Deep learning-based clinical-radiomics nomogram for preoperative prediction of lymph node metastasis in patients with rectal cancer: a two-center study.
Shiyu Ma ... Zhihui Li
Frontiers in Medicine | VOL. 10
Shiyu Ma, et. al.Shiyu Ma ... Zhihui Li
01 Dec 2023
Frontiers in Medicine | VOL. 10

Study on the accuracy of automatic segmentation of knee CT images based on deep learning
... Wei Chai
Zhongguo xiu fu chong jian wai ke za zhi = Zhongguo xiufu chongjian waike zazhi = Chinese journal of reparative and reconstructive surgery | VOL. 36
, et. al. ... Wei Chai
15 May 2022
Zhongguo xiu fu chong jian wai ke za zhi = Zhongguo xiufu chongjian waike zazhi = Chinese journal of reparative and reconstructive surgery | VOL. 36

Automatic mandible segmentation from CT image using 3D fully convolutional neural network based on DenseASPP and attention gates
Jiangchang Xu ... Dingzhong Zhang
International Journal of Computer Assisted Radiology and Surgery | VOL. 16
Jiangchang Xu, et. al.Jiangchang Xu ... Dingzhong Zhang
21 Jul 2021
International Journal of Computer Assisted Radiology and Surgery | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Abstract

Talk to us

Similar Papers

More From: Medical physics