Abstract

This study aims to develop a machine learning (ML) model to predict the risk of residual or recurrent high-grade cervical intraepithelial neoplasia (CIN) after loop electrosurgical excision procedure (LEEP), addressing a critical gap in personalized follow-up care. A retrospective analysis of 532 patients who underwent LEEP for high-grade CIN at Cangzhou Central Hospital (2016-2020) was conducted. In the final analysis, 99 women (18.6%) were found to have residual or recurrent high-grade CIN (CIN2 or worse) within five years of follow-up. Four feature selection methods identified significant predictors of residual or recurrent CIN. Eight ML algorithms were evaluated using performance metrics such as AUROC, accuracy, sensitivity, specificity, PPV, NPV, F1 score, calibration curve, and decision curve analysis. Fivefold cross-validation optimized and validated the model, and SHAP analysis assessed feature importance. The XGBoost algorithm demonstrated the highest predictive performance with the best AUROC. The optimized model included six key predictors: age, ThinPrep cytologic test (TCT) results, HPV classification, CIN severity, glandular involvement, and margin status. SHAP analysis identified CIN severity and margin status as the most influential predictors. An online prediction tool was developed for real-time risk assessment. This ML-based predictive model for post-LEEP high-grade CIN provides a significant advancement in gynecologic oncology, enhancing personalized patient care and facilitating early intervention and informed clinical decision-making.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.