Abstract

Cardiovascular disease is defined as a set of conditions related to the disorder of the heart and blood vessels. Predicting and diagnosing cardiovascular disease is significant to ensure the appropriate treatment of this disease. Machine learning approaches are generally utilized to automatically detect the hidden patterns in vast amounts of data without human intervention. In the early stage of cardiovascular disease, a machine learning model can aid physicians in making the right decision about the medication. This research aims to develop an intelligent agent to predict cardiovascular disease to investigate what steps should be taken before any untoward incident occurs. This paper proposes an ensemble method-based multilayer dynamic system (MLDS) that can improve its current knowledge in every layer. The proposed model applies Correlation Attribute Evaluator (CAE), Gain Ratio Attribute Evaluator (GRAE), Information Gain Attribute Evaluator (IGAE), Lasso, and Extra Trees classifier (ETC) for feature selection. Finally, Random Forest (RF), Naïve Bayes (NB), and Gradient Boosting (GB) classifiers combinedly construct the ensemble method for classification in the model. The K Nearest Neighbor (KNN) algorithm is applied to find the test data's neighborhood data points while the base classifiers mentioned are failed to classify correctly in any layer. To test the proposed model's efficiency, we have used a realistic dataset (70,000 instances) collected from Kaggle. The proposed model has achieved 88.84%, 89.44%, 91.56%, 92.72%, and 94.16% accuracy based on the train and test data's different splitting ratios (50:50, 60:40, 70:30, 80:20, and 87.5:12.5). Our proposed model has achieved a 0.94 AUC value. AUC = 0.94 means it has a 94% probability of correctly classifying positive and negative classes, Whereas the splitting ratio is 87.5:12.5. The Cleveland, Hungarian, and Cleveland-Hungary-Switzerland-Long Beach datasets have also been applied to train the model, and the model achieved 98.88%, 99.53%, 99.98%, 98.36%, 96.66%, 97.77%, 99.56, and 94.37% accuracy depending on the different splitting ratios of these datasets. The proposed model has been compared to five other models, indicating that the proposed model can effectively predict cardiovascular disease.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call