This work addresses the critical task of polyp detection and classification using the SUN colonoscopy video database, which consists of still images annotated with bounding boxes. These images categorize frames into polyp and non-polyp and encompass six distinct classes of polyps: Hyperplastic polyp, Sessile serrated lesion, Low-grade adenoma, Traditional serrated adenoma, High-grade adenoma, and Invasive carcinoma. The approach involves a two-stage classification process. Initially, MobileNetV2 is employed to distinguish between polyp and non-polyp frames. Subsequently, ResNet50 and GoogLeNet are utilized to classify the identified polyps into the six predefined categories. Data augmentation techniques are implemented to address the inherent imbalance in class distribution within the dataset, enhancing model performance and generalizability. The results highlight the effectiveness of GoogLeNet, which achieved an impressive accuracy of 98%, significantly outperforming ResNet50's accuracy of 76.16%. This substantial improvement underscores the potential of GoogLeNet in enhancing the accuracy of polyp classification. The significance of this work lies in its contribution to advancing automated polyp detection and cancer stage classification, crucial for early diagnosis and treatment. These findings provide a foundation for further research and development in this domain, with the potential to improve clinical outcomes through more accurate and timely identification of colorectal polyps.
Read full abstract