Abstract

The ability to predict students' academic results is due to the application of various models developed using Data Mining techniques. It is one of the many applications available in Educational Data Mining (EDM). Although there are many classification algorithms available and there have been many researches done to compare certain algorithms, this study aims to compare Bayes-based, ANN-based, Regression-based, SVM-based, Instance-based, Tree-based and Rule-based classification algorithms using a dataset downloaded online from UCI Machine Learning Repository. The data were preprocessed and categorized to different datasets before applying Data Mining techniques. Dimension reduction and data re-balancing were applied and proved to increase prediction accuracy. The algorithms were compared in terms of accuracy rate, precision rate, AUC and model building time. Tree-based algorithms performed better than other categories of algorithms. Within Tree-based algorithms, RandomForest algorithm proved to be the best classification algorithm. Tree-based and Rule-based algorithms reveal the key attributes contributing to accurate prediction of students' results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.