Abstract

Heart disease is a prevalent condition nowadays that, if undiagnosed, can be deadly. To predict heart disease, researchers designed many machine learning models. In this study, we propose a model that chooses fewer attribute columns for training, and we use these chosen features to determine the heart problem severity. Correlation Repeated Heat map and Information Gain were used for selecting the features. To train our model we used the UCI Cleveland heart disease dataset. We removed duplicate data to improve the accuracy score, and we also encoded the categorical data collection using the OneHot(OH) encoding method, which can improve prediction accuracy. Support Vector, Logistic Regression, K-Nearest Neighbour, Naive Bayes, Decision Tree, Random Forest, Adaboost, and XGBoost are the eight classifier algorithms that are used in this process overall. Based on repeated heat map correlation, we compare the accuracy score each time. In this proposed method, the Adaboost classification algorithm used by the fbs row heat map achieves the highest accuracy for heart disease detection and it is 92%. By choosing features according to the information gain value, we compare the accuracy score each time in information gain. For both XGBoost and Logistic Regression, we got an accuracy score of 93.44%. However, compared to the XGBoost classification technique, Logistic Regression requires less time. Accuracy, precision, recall, f1-score, sensitivity, specificity, and the AUC of ROC charts were used to evaluate the performance of the model. Overall, the results of our model demonstrate that it is reliable and accurate in identifying cardiac disease and its level of severeness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.