Multilayer perceptron (MLP) networks have become a popular alternative to convolutional neural networks and transformers because of fewer parameters. However, existing MLP-based models improve performance by increasing model depth, which adds computational complexity when processing local features of images. To meet this challenge, we propose MSS-UNet, a lightweight convolutional neural network (CNN) and MLP model for the automated segmentation of skin lesions from dermoscopic images. Specifically, MSS-UNet first uses the convolutional module to extract local information, which is essential for precisely segmenting the skin lesion. We propose an efficient double-spatial-shift MLP module, named DSS-MLP, which enhances the vanilla MLP by enabling communication between different spatial locations through double spatial shifts. We also propose a module named MSSEA with multiple spatial shifts of different strides and lighter external attention to enlarge the local receptive field and capture the boundary continuity of skin lesions. We extensively evaluated the MSS-UNet on ISIC 2017, 2018, and PH2 skin lesion datasets. On three datasets, the method achieves IoU metrics of 85.01%±0.65, 83.65%±1.05, and 92.71%±1.03, with a parameter size and computational complexity of 0.33M and 15.98G, respectively, outperforming most state-of-the-art methods.The code is publicly available at https://github.com/AirZWH/MSS-UNet.
Read full abstract