Achieving histological remission is a desirable emerging treatment target in ulcerative colitis (UC), yet its assessment is challenging due to high inter- and intraobserver variability, reliance on experts, and lack of standardization. Artificial intelligence (AI) holds promise in addressing these issues. This systematic review, meta-analysis, and meta-regression evaluated the AI's performance in assessing histological remission and compared it with that of pathologists. We searched Medline/PubMed and Scopus databases from inception to September 2024. We included studies on AI models assessing histological activity in UC, with or without comparison to pathologists. Pooled performance metrics were calculated: sensitivity, specificity, positive and negative predictive value (PPV and NPV), observed agreement, and F1 score. A pairwise meta-analysis compared AI and pathologists, while sub-meta-analysis and meta-regression evaluated heterogeneity and factors influencing AI performance. Twelve studies met the inclusion criteria. AI models exhibited strong performance with a pooled sensitivity of 0.84 (95% CI, 0.80-0.88), specificity 0.87 (0.84-0.91), PPV 0.90 (0.87-0.92), NPV 0.80 (0.71-0.88), observed agreement 0.85 (0.82-0.89), and F1 score 0.85 (0.82-0.89). AI models demonstrated no significant differences with pathologists for specificity, observed agreement, and F1 score, while they were outperformed by pathologists for sensitivity and NPV. AI models for the adult population were linked to reduced heterogeneity and enhanced AI performance at meta-regression. AI shows significant potential for assessing histological remission in UC and performs comparably to pathologists. Future research should focus on standardized, large-scale studies to minimize heterogeneity and support widespread AI implementation in clinical practice.
Read full abstract