Abstract

Coronary angiography (CAG) is invasive and expensive, while numbers of patients suspected of coronary artery disease (CAD) undergoing CAG results have no coronary lesions. To develop machine learning algorithms using symptoms and clinical variables to predict CAD. This study was conducted as a cross-sectional study of patients undergoing CAG. We randomly chose 2082 patients from 2602 patients suspected of CAD as the training set, and 520 patients as the test set. We utilized LASSO regression to do feature selection. The area under the receiver operating characteristic curve (AUC), confusion matrix of different thresholds, positive predictive value (PPV) and negative predictive value (NPV) were shown. Support vector machine algorithm performances in 10 folds were conducted in the training set for detecting severe CAD, while XGBoost algorithm performances were conducted in the test set for detecting severe CAD. The algorithm of logistic regression achieved an average AUC of 0.77 in the training set during 10-fold validation and an AUC of 0.75 in the test set. When probability predicted by the model was less than 0.1, 11 patients in the test set (520 patients) were screened out, and NPV reached 90.9%. When probability predicted by the model was less than 0.2, 110 patients in the test set were screened out, and reached 83.6%. Meanwhile, when threshold was set to 0.9, PPV reached 97.4%. When the threshold was set to 0.8, PPV reached 91.5%. Machine learning algorithm using data from hospital information systems could assist in severe CAD exclusion and confirmation, and thus help patients avoid unnecessary CAG.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call