Cardiovascular disease remains a global health concern, being a leading cause of mortality. In response to the critical nature of heart-related conditions, this research focuses on the development of smart systems leveraging machine learning algorithms for accurate and timely diagnosis. The study explores various machine learning approaches to predict recurrence of heart diseases based on patient data encompassing key health factors. With heart disease being a prevalent cause of death worldwide, real-time forecasting methods from medical data sources have become crucial. The implementation of machine learning in healthcare demonstrates its potential for early and precise recurrence of disease detection. Despite the abundance of health information generated by medical institutions, there is an underutilization of this data, leading to a healthcare system that is "data rich" but "knowledge poor." This work aims to address this gap by presenting a reliable recurrence heart disease prediction system. The research highlights the necessity for effective analysis methods to uncover connections and patterns within healthcare data. The proposed prediction system utilizes a diverse set of health factors, contributing to a comprehensive understanding of heart disease. By leveraging machine learning algorithms, the system aims to enhance the accuracy and efficiency of diagnosis. The findings of this research not only contribute to the advancement of predictive healthcare but also underscore the significance of unlocking valuable insights from the vast pool of medical data. Ultimately, the implementation of such intelligent systems holds promise for improving patient outcomes and reducing the burden of cardiovascular diseases on global healthcare systems. The paper demonstrated Hybrid methods: Support Vector Machine (SVM), Random Forest (RF), and NaĂŻve Bayes (NB), to build the prediction models. Data preprocessing and feature selection steps were done before building the models. The models were evaluated based on the accuracy, precision, recall, and F1-score. The Random Forest model performed best with 94.27% accuracy.
Read full abstract