Abstract Background Current heart failure guidelines emphasize the importance of timely detection of subclinical left ventricular (LV) remodelling and dysfunction for more precise risk stratification of asymptomatic subjects. Both LV diastolic dysfunction (LVDD) and LV hypertrophy (LVH) as assessed by echocardiography are known independent prognostic markers of future cardiovascular events in the community. However, selective screening strategies of individuals at risk who would benefit most from in-depth cardiac phenotyping are lacking. Purpose We assess the utility of several Machine Learning (ML) classifiers built on clinical and biochemical features for detecting subclinical LV abnormalities. Methods We included 1407 participants (mean age, 51 years, 51% women) randomly recruited from the general population. We used echocardiographic parameters reflecting LV diastolic function and structure to define LV abnormalities (LVDD, n=239; LVH, n=135). After that four supervised ML algorithms (Random Forest (RF), Gradient Boosting (GD), Stochastic Gradient Descent (SGD) and Support Vector Machines (SV)) were built based on routine clinical, hemodynamic and laboratory data (features; n=61) to categorize LVDD and LVH (two prediction tasks). We applied a 10-fold stratified cross-validation set-up. Results ML classifiers exhibited a high area under the ROC (AUC) for predicting LVDD with values between 88.5% and 93.1% (Figure, left panel). Age, BMI, different components of blood pressure, antihypertensive treatment, routine biomarkers such as serum electrolytes, creatinine, blood sugar, leptin, uric acid, lipid profile, as well as blood cell counts were the top selected features for predicting LVDD. Prediction AUC of ML algorithms for detection of LVH was somewhat lower than for LVDD and ranged from 72.5% to 78.7% (Figure, right panel). The top selected features for LVH classifier were similar to those of LVDD, but also included social class, serum gamma-glutamyl transferase, fasting insulin, plasma renin activity and cortisol. ROC curves (sensitivity-1-specificity) Conclusions ML algorithms combining routinely measured clinical and laboratory data have shown high accuracy of LVDD and LVH prediction. These ML classifiers might be useful to preselect individuals at risk for further in depth echocardiographic examination, monitoring and implementation of preventive strategies in order to delay transition to disease symptoms.
Read full abstract