Abstract

Liver and tumor segmentation from CT images is a complex and crucial step in achieving full-course adaptive radiotherapy and also plays an essential role in computer-aided clinical diagnosis systems. Deep learning-based methods play an important role in achieving automatic segmentation. This research aims to improve liver tumor detection performance by proposing a dual path feature extracting strategy and employing Swin-Transformer. The hierarchical Swin-Transformer is embedded into the encoder and decoder and combined with CNN to form a dual coding path structure incorporating long-range dependencies and multi-scale contextual connections to capture coarse-tuned features at different semantic scales. The features of the two encoding paths and the upsampling path are fused, tested and validated with LITS and in-house datasets. The proposed method has a DG of 97.95% and a DC of 96.2% for liver segmentation; a DG of 80.6% and a DC of 68.1% for tumor segmentation; and a classification study of the tumor dataset shows a DG of 91.1% and a DC of 87.2% for large and continuous tumors and a DG of 71.7% and a DC of 66.4% for small and scattered tumors. Swin-Transformer can be used as a robust encoder for medical image segmentation networks and, combined with CNN networks, can better recover local spatial information and enhance feature representation. Accurate localization before segmentation can achieve better results for small and scattered tumors.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call