Early detection of nasopharyngeal carcinoma through machine-learning-driven prediction model in a population-based healthcare record database.

Jeng-Wen Chen,Bo-Sian Wang,Shih-Tsang Lin,Yu-Ning Chien,Hung-Yi Chiou,Yi-Chun Lin

doi:10.1002/cam4.7144

Abstract

Early diagnosis and treatment of nasopharyngeal carcinoma (NPC) are vital for a better prognosis. Still, because of obscure anatomical sites and insidious symptoms, nearly 80% of patients with NPC are diagnosed at a late stage. This study aimed to validate a machine learning (ML) model utilizing symptom-related diagnoses and procedures in medical records to predict nasopharyngeal carcinoma (NPC) occurrence and reduce the prediagnostic period. Data from a population-based health insurance database (2001-2008) were analyzed, comparing adults with and without newly diagnosed NPC. Medical records from 90 to 360 days before diagnosis were examined. Five ML algorithms (Light Gradient Boosting Machine [LGB], eXtreme Gradient Boosting [XGB], Multivariate Adaptive Regression Splines [MARS], Random Forest [RF], and Logistics Regression [LG]) were evaluated for optimal early NPC detection. We further use a real-world data of 1 million individuals randomly selected for testing the final model. Model performance was assessed using AUROC. Shapley values identified significant contributing variables. LGB showed maximum predictive power using 14 features and 90 days before diagnosis. The LGB models achieved AUROC, specificity, and sensitivity were 0.83, 0.81, and 0.64 for the test dataset, respectively. The LGB-driven NPC predictive tool effectively differentiated patients into high-risk and low-risk groups (hazard ratio: 5.85; 95% CI: 4.75-7.21). The model-layering effect is valid. ML approaches using electronic medical records accurately predicted NPC occurrence. The risk prediction model serves as a low-cost digital screening tool, offering rapid medical decision support to shorten prediagnostic periods. Timely referral is crucial for high-risk patients identified by the model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Early detection of nasopharyngeal carcinoma through machine-learning-driven prediction model in a population-based healthcare record database.

Abstract

Talk to us

Similar Papers

More From: Cancer Medicine

Lead the way for us

Journal: Cancer Medicine	Publication Date: Mar 28, 2024
License type: CC BY 4.0

Similar Papers

Evaluation of multiple antibodies to Epstein-Barr virus as markers for detecting patients with nasopharyngeal carcinoma
Mei-Ying Liu ... Jeng Ma
Journal of Medical Virology | VOL. 52
Mei-Ying Liu, et. al.Mei-Ying Liu ... Jeng Ma
01 Jul 1997
Journal of Medical Virology | VOL. 52

A prognostic scoring system for locoregional control in nasopharyngeal carcinoma following conformal radiotherapy
Skye Hongiun Cheng ... K Lawrence Yen
International Journal of Radiation Oncology*Biology*Physics | VOL. 66
Skye Hongiun Cheng, et. al.Skye Hongiun Cheng ... K Lawrence Yen
18 Sep 2006
International Journal of Radiation Oncology*Biology*Physics | VOL. 66

Uncovering nasopharyngeal carcinoma from chronic rhinosinusitis and healthy subjects using routine medical tests via machine learning.
Qi Liu ... Ruxu Du
PloS one | VOL. 17
Qi Liu, et. al.Qi Liu ... Ruxu Du
09 Sep 2022
PloS one | VOL. 17

Reserch progress of nasopharyngeal carcinoma related gene
...
Journal of International Oncology | VOL. 40
, et. al. ...
08 Sep 2013
Journal of International Oncology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Early detection of nasopharyngeal carcinoma through machine-learning-driven prediction model in a population-based healthcare record database.

Abstract

Talk to us

Similar Papers

More From: Cancer Medicine