Dual-path Network for Liver and Tumor Segmentation in CT Images Using Swin Transformer Encoding Approach.

Shuzhou Li,Zhen Yang

doi:10.2174/1573405619666221014114953

Abstract

Liver and tumor segmentation from CT images is a complex and crucial step in achieving full-course adaptive radiotherapy and also plays an essential role in computer-aided clinical diagnosis systems. Deep learning-based methods play an important role in achieving automatic segmentation. This research aims to improve liver tumor detection performance by proposing a dual path feature extracting strategy and employing Swin-Transformer. The hierarchical Swin-Transformer is embedded into the encoder and decoder and combined with CNN to form a dual coding path structure incorporating long-range dependencies and multi-scale contextual connections to capture coarse-tuned features at different semantic scales. The features of the two encoding paths and the upsampling path are fused, tested and validated with LITS and in-house datasets. The proposed method has a DG of 97.95% and a DC of 96.2% for liver segmentation; a DG of 80.6% and a DC of 68.1% for tumor segmentation; and a classification study of the tumor dataset shows a DG of 91.1% and a DC of 87.2% for large and continuous tumors and a DG of 71.7% and a DC of 66.4% for small and scattered tumors. Swin-Transformer can be used as a robust encoder for medical image segmentation networks and, combined with CNN networks, can better recover local spatial information and enhance feature representation. Accurate localization before segmentation can achieve better results for small and scattered tumors.

Full Text