Abstract

Abstract Background Crohn’s disease has the potential to involve all segments of the gastrointestinal tract, requiring frequent and judicious clinical and endoscopic surveillance. Capsule endoscopy (CE) has revolutionized the management of these patients, playing a role in diagnosis, classification, and assessment of response to treatment. In 2017, PillCam™ Crohn’s Capsule (PCC) was introduced to provide a panenteric assessment to this panenteric disease. However, reviewing each PCC exam is time-consuming and prone to errors as it implies examining thousands of images while significant lesions may only be represented in a small number of frames. Recently, the development of artificial intelligence (AI) algorithms for automatic interpretation of endoscopic images has generated a large interest. The performance of these tools, particularly convolutional neural networks have shown potential to overcome some of the main drawbacks of CE. This multicentric study aims to assess the performance of a deep learning algorithm for the automatic detection of small intestinal and colonic ulcers and erosions in PCC images. Methods We included a total of 59 PCC exams from two centres (ManopH Gastroenterology Clinic and Centro Hospitalar Universitário de São João) performed between 2017 and 2021. A total of 78415 frames were extracted. This pool of images was constituted by 14124 images depicting ulcers and erosions and 64291 with normal enteric and colonic mucosa. For the automatic identification of these findings, this image pool was split into training and validation datasets in a patient-based manner. Therefore, images from the same patient were restricted to a single dataset. A CNN model with transfer learning was developed using tensorflow and keras tools. Results The neural architecture of our CNN was optimized for the detection of enteric and colonic ulcers and erosions. An example of the output provided by the CNN is shown in Figure 1. The deep learning model had a sensitivity of 86.7%, a specificity of 98.6%, a positive predictive value of 92.3% and a negative predictive value of 97.4%. The overall accuracy of our network was 96.6% (Figure 2). At a threshold value of 50%, our CNN had an area under the curve of 0.98 for the detection of ulcers and erosions (Figure 3). Conclusion In this multicentric study, our group has developed a CNN and assessed its performance in a patient-based design, which provides significant robustness to our results. Patient-split analyses are the first step towards real-life application of AI solutions which, allied to the high-performance results of our model, may significantly improve the diagnostic yield of this panendoscopic tool.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.