Fast and accurate automatic segmentation of polyps in colonoscopy plays a crucial role in the early diagnosis and treatment of colon cancer. However, the current polyp segmentation algorithms based on deep neural networks suffer from the problems of larger models and lower segmentation accuracy. Meanwhile, achieving accurate segmentation of polyps is to improve the diagnostic efficiency of doctors, and this need motivates us to develop a set of lightweight models so that it can be easily embedded in clinical devices to meet the requirements of practical applications. This study aims to provide effective technical support for the rapid and precise segmentation of polyps in clinical applications. DeepNeXt, an innovative polyp segmentation model grounded in multi-scale attention mechanisms. DeepNeXt incorporates a multi-stage, lightweight convolutional encoder module, leveraging several lightweight convolutional layers for efficient and accurate feature extraction. Furthermore, it features a novel multi-stage feature fusion structure designed to circumvent the potential loss of feature information during the encoding phase. Additionally, the model employs a multi-scale attentional feature encoding module that harnesses multi-branch deep strip convolution techniques to extract multi-dimensional information from the feature maps post-encoding, thereby enhancing the neural network's capability to extract diverse feature information. Experimental validation on the Kvasir Segmentation Dataset (Kvasir-SEG dataset) and Colorectal Cancer-Clinic Datasetbase (CVC-ClinicDB datasets) demonstrates that DeepNeXt outperforms mainstream networks such as U-net, U-net++, TransUnet, SwinUnet, and TGANet in terms of parameters and floating-point operations (FLOPs). DeepNeXt achieved a FLOPs metric of only 3.04 G and a parameters (Params) metric of just 1.51 M, while delivering exceptional performance in segmentation. On the Kvasir-SEG dataset, it reached a mean intersection over union (mIOU) of 83.91, and on the CVC-ClinicDB dataset, 87.37. Additionally, the Dice and Recall metrics also showed superior results, highlighting that DeepNeXt strikes an optimal balance between computational efficiency, model compactness, and segmentation accuracy. In conclusion, we have proposed the DeepNeXt network, a novel lightweight multi-scale attention segmentation network tailored for computationally limited medical devices, which provides strong support for accurate and efficient polyp segmentation in clinical applications.
Read full abstract