Colorectal cancer (CRC) remains a significant health concern, with colonoscopy serving as the gold standard for diagnosis. Accurately segmenting polyps from colonoscopy images is crucial for detecting polyps and preventing CRC. However, challenges such as varying polyp sizes, blurred edges, and uneven brightness hinder segmentation accuracy. Leveraging artificial intelligence (AI) and robot-assisted surgery mechanisms can aid surgeons and physicians in detecting and treating polyps. To address these challenges, we propose a Colorectal Network (CR-Net), an AI-based encoder-decoder network for precise polyp and surgical instrument segmentation. CR-Net incorporates a pre-trained Visual Geometry Group model with 16 convolution layers (VGG16), attention mechanisms, redesigned skip connections, and horizontal dense connections within a U-Net architecture. The VGG16 encoder captures robust visual features, while redesigned skip connections accommodate complex data dimensions, leading to enhanced segmentation outcomes. Horizontal dense connections transfer overlooked features from the encoder to subsequent layers, further improving segmentation accuracy. Additionally, a spatial attention block enhances spatial features and ensures compatibility during upsampling. Evaluation of datasets including the Kvasir segmentation (Kvasir-SEG) dataset, Computer Vision Center Clinic Database (CVC-ClinicDB), Kvasir-Instrument dataset, and University of Washington Sinus Surgery Live (UW-Sinus-Surgery-Live) dataset demonstrates CR-Net's superior performance, achieving Dice Similarity Coefficients of 96.21%, 96.54%, 96.32%, and 92.84%, respectively, surpassing previous methods. These results highlight CR-Net's potential in empowering healthcare professionals through advanced AI-driven engineering applications. By bridging AI techniques with engineering innovations, CR-Net represents a significant advancement in CRC diagnosis and treatment.
Read full abstract