BACKGROUND & AIMSHepatocellular carcinoma (HCC) incidence is increasing and correlated with metabolic dysfunction-associated steatotic liver disease (MASLD; formerly NAFLD), even in patients without advanced liver fibrosis who are more likely to be diagnosed with advanced disease stages and shorter survival time, and less likely to receive a liver transplant. Machine learning (ML) tools can characterize large datasets and help develop predictive models that can calculate individual HCC risk and guide selective screening and risk mitigation strategies. METHODSTableau and KNIME Analytics were used for descriptive analytics and machine learning tasks. ML models were developed using standard laboratory and clinical parameters. Sci-kit learn algorithms were used for model development. Data from UC Davis was used to develop and train a pilot predictive model, which was subsequently validated in an independent dataset from UC San Francisco (UCSF). MASLD and HCC patients were identified by ICD-9/10 codes. RESULTSOf the patients diagnosed with MASLD (n=1,561 training; n=686 validation), HCC developed in 14% (n=227) of the UC Davis training cohort and 25% (n=176) of the UCSF validation cohort. Liver fibrosis determined by the non-invasive FIB-4 score was the strongest single predictor for HCC in the model. Using the validation cohort, the model predicted HCC development at 92.06% accuracy with an AUC of 0.97, F1-score of 0.84, 98.34% specificity, and 74.41% sensitivity. CONCLUSIONSML models can aid physicians in providing early HCC risk assessment in patients with MASLD. Further validation will translate to cost-effective, personalized care of at-risk patients.
Read full abstract