Accurate musculoseletal ultrasound (MSKUS) image segmentation is crucial for diagnosis and treatment planning. Compared with traditional segmentation methods, deploying deep learning segmentation methods that balance segmentation efficiency, accuracy, and model size on edge devices has greateradvantages. This paper aims to design a MSKUS image segmentation method that has fewer parameters, lower computation complexity and higher segmentationaccuracy. In this study, an attention mechanism-based lightweight UNet (AML-UNet) is designed to segment target muscle regions in MSKUS images. To suppress the transmission of redundant feature, Channel Reconstruction and Spatial Attention Module is designed in the encoding path. In addition, considering the inherent characteristic of MSKUS image, Multiscale Aggregation Module is developed to replace the skip connection architecture of U-Net. Deep supervision is also introduced to the decoding path to refine predicted masks gradually. Our method is evaluated on two MSKUS 2D-image segmentation datasets, including 3917 MSKUS and 1534 images respectively. In the experiments, a five-fold cross-validation method is adopted in ablation experiments and comparison experiments. In addition, Wilcoxon Signed-Rank Test and Bonferroni correction are employed to validate the significance level. 0.01 was used as the statistical significance level in ourpaper. AML-UNet yielded a mIoU of 84.17% and 90.14% on two datasets, representing a 3.38% ( ) and 3.48% ( ) over the Unext model. The number of parameters and FLOPs are only 0.21M and 0.96G, which are 1/34 and 1/29 of those in comparison withUNet. Our proposed model achieved superior results with fewer parameters while maintaining segmentation efficiency and accuracy compared to othermethods.
Read full abstract