Background and ObjectiveAs the most common cardiovascular defect, coronary artery disease (CAD), also called ischemic heart disease, is one of the substantial causes of death globally. Several diagnosis approaches such as baseline electrocardiography, echocardiography, magnetic resonance imaging, and coronary angiography are suggested for screening the suspected patients that may suffer from CAD. However, applying such methods may have health side effects and/or expensive costs. MethodsAs an alternative to the available diagnosis tools/methods, this research involves a decision tree learning algorithm called classification and regression tree (CART) for a simple and reliable diagnosis of CAD. Several CART models are developed based on the recently CAD dataset published in the literature. ResultsUtilizing all the features of the dataset (55 independent parameters), it was found that only 40 independent parameters influence the CAD diagnosis and consequently development of the predictive model. Based on the feature importance obtained from the first CART model, three new CART models are then developed using 18, 10, and 5 selected features. Except for the five-feature CART model, the outcomes of developed CART models demonstrate the maximum achievable accuracy, sensitivity, and specificity for CAD diagnosis (100%), while comparing the predictions with the reported targets. The error analysis reveals that the literature models including sequential minimal optimization (SMO), bagging SMO, Naïve Bayes (NB), artificial neural network (ANN), C4.5, J48, Bagging, and ANN in conjunction with the genetic algorithm (GA) do not outperform the CART methodology in classifying patients as normal or CAD. ConclusionsHence, the robustness of the tree-based algorithm in accurate and fast predictions is confirmed, implying the proposed classification technique can be successfully utilized to develop a coherent decision-making system for the CAD diagnosis.
Read full abstract