Abstract

 
 
 Coronary heart disease has been the number one illness to cause death in the world for decades. The healthcare indus-tries generates vast amount of clinical data, driven by medical records of patients, regulatory requirements, and results of medicalexaminations. In order to obtain the most relevant features for coronary heart disease, this study has conducted an experimental evaluation on data-driven diagnosis of coronary heart disease using classification algorithms. A statistical test (Chi-square) is usedto find the most valuable features and risk factors associated with coronary heart disease. The purposed of this univariate feature extraction algorithm is to determine the difference between the observed resuslts with expected results. Furthermore, CHD is predicted using several classification machine learning algorithms including Logistic Regression, Complement Naïve Bayes. andSupport Vector Machine (SVM). This study also evaluates ensemble machine learning algorithms, such as Random Forest and Extreme Gradient Boosting (XGBoost), Gradient Boost, to find the best performance of the classifications algorithms and select essential features from the dataset. Holdout and cross-validations methods are used to separated the dataset into two sets, called thetraining set and the testing set. The performance of proposed algorithm are assessed in terms of certain factors such as specificityand sensitivity. From this study, it is shown that Gradient boost model exhibits the best performance with 0.839 sensitivity.
 
 
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Application on Sciences, Technology and Engineering
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.