Abstract

Educational data mining (EDM) is a specialized field within data mining that focuses on extracting valuable insights from academic data across high school and university levels. A common practice in EDM involves predicting students’ grades to identify at-risk individuals and improve the efficiency of academic tasks. This knowledge benefits students, parents, and institutions equally. Early detection enables interventions that improve student performance. The literature presents various prediction strategies, each with its own unique advantages and disadvantages. This study aims to comprehensively evaluate the methods, tools, and applications of machine learning (ML) and data mining (DM) in education. The main goal is to improve the accuracy of predicting academic achievements by employing eight widely recognized ML algorithms: naïve bayes (NB), k-nearest neighbors (KNN), support vector machine (SVM), random forest (RF), logistic regression (LR), extreme gradient boost (XGBOOST), and ensemble voting classifier (EVC). The focus is on improving data quality by eliminating instances of noise. Performance evaluation involves assessing parameters such as accuracy, precision, F-measure, and recall. Incorporating cross-validation and hyperparameter tuning improves classification accuracy. The ML models outperform other ensemble approaches, providing a valuable tool for predicting student performance and assisting educators in making proactive decisions through timely alerts.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call