Abstract

Colonoscopy is considered the gold standard for colorectal cancer diagnosis and prognosis. However, existing methods are less accurate and prone to overlooking lesions during gastrointestinal endoscopic examinations. Computer-assisted diagnosis combined with robot-assisted minimally invasive surgery (RMIS) can significantly help medical practitioners detect and treat lesions. Therefore, two novel architectures are developed for polyp and surgical instrument segmentation to aid colorectal cancer diagnosis, assessment, and treatment. Colorectal cancer segmentation network (CCS-Net) is the base network used in this study. It uses the maximum convolutional layers near the input image for effective feature extraction from low-level information. In addition, CCS-Net uses an efficient feature upsampling unit to efficiently increase the input spatial features’ map size. Hence, CCS-Net is capable of providing a fair performance with satisfactory computational efficiency The multi-scale feature retention and aggregation network (MFRA-Net) is the final network in this study. MFRA-Net is developed to improve the segmentation accuracy of the CCS-Net further as it uses multi-scale feature retention to retain low-level spatial features and transfers them to deep stages of the network. MFRA-Net also combines multi-scale high-strided low-level information with high-level information to boost network segmentation performance. Finally, all the transferred multi-scale features from the early stages of the network are aggregated with high-level features in the deep levels of the network. This multi-scale feature retention and aggregation mechanism enables the network to maintain a better segmentation performance compared with other methods even with challenging blur, specular reflection, low contrast, and high variation cases. We evaluated both architectures on four challenging datasets: Kvasir-SEG, CVC-ClinicDB, Kvasir-Instrument, and the UW-Sinus-Surgery-Live dataset. The proposed method achieves dice similarity coefficients of 95.98%, 94.19%, 92.81%, and 88.57% for the CVC-ClinicDB, Kvasir-SEG, Kvasir-Instrument, and UW-Sinus-Surgery-Live datasets. The proposed method achieves superior segmentation performance compared with state-of-the-art methods and requires only 4.9 million trainable parameters for complete training. Therefore, the proposed networks can effectively assist health professionals in surgical procedures and colorectal cancer diagnosis through surgical instruments and polyp segmentation, respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call