Wireless capsule endoscopy (WCE) has emerged as a valuable non-invasive technique for visualizing the entire gastrointestinal (GI) tract. However, manual evaluation of WCE videos is a time-consuming and costly process. In this study, we present a novel diagnostic assistant system that employs deep convolutional neural networks (DCNNs) to accelerate the evaluation process. Our primary objective is to achieve a high negative predictive value (NPV), which is essential for the efficient identification of normal frames. Six distinct DCNN models were developed and implemented with this objective in mind. The models were trained on a limited dataset encompassing common GI pathologies that reflect real clinical scenarios. Each DCNN architecture comprises a convolutional part derived from renowned pre-trained networks and a custom-designed classifier block optimized for high NPV and classification accuracy. Following a comprehensive assessment utilizing the 5-fold cross-validation approach, the VG_BFCG model was identified as the most effective, exhibiting an average test accuracy of 0.946 and an NPV of 0.983. Moreover, in the event of encountering novel pathologies not present in the training data, our models exhibited robustness in NPV, which is of great importance for practical applications. For example, the DN_BFCG model demonstrated consistent performance, with an NPV exceeding 0.99 across a range of new pathologies. This validates the reliability of our models in clinical settings. Our findings suggest that our developed DCNN architectures have the potential to enhance the efficiency and accuracy of WCE video analysis, which could transform the landscape of gastroenterological diagnostics.
Read full abstract