Abstract

Educational Data Mining (EDM) is a rich research field in computer science. Tools and techniques in EDM are useful to predict student performance which gives practitioners useful insights to develop appropriate intervention strategies to improve pass rates and increase retention. The performance of the state-of-the-art machine learning classifiers is very much dependent on the task at hand. Investigating support vector machines has been used extensively in classification problems; however, the extant of literature shows a gap in the application of linear support vector machines as a predictor of student performance. The aim of this study was to compare the performance of linear support vector machines with the performance of the state-of-the-art classical machine learning algorithms in order to determine the algorithm that would improve prediction of student performance. In this quantitative study, an experimental research design was used. Experiments were set up using feature selection on a publicly available dataset of 1000 alpha-numeric student records. Linear support vector machines benchmarked with ten categorical machine learning algorithms showed superior performance in predicting student performance. The results of this research showed that features like race, gender, and lunch influence performance in mathematics whilst access to lunch was the primary factor which influences reading and writing performance.

Highlights

  • Predicting the factors impacting student performance early in the academic programme can assist in combating the high dropout rate experienced by higher education institutions

  • Ere are several data mining tools used in Educational Data Mining (EDM) to analyse and predict student performance to the benefit of educational institutions. ese interventions improved pass rates, curbed dropout rates, and increased retention rates [3]. ere are several tools that are used in EDM

  • Results and Discussion e Linear support vector machine (LSVM) was used to analyse the performance by extracting useful knowledge from the student dataset. e usefulness of the algorithm was to interpret relationships among variables and to determine factors that affect student performances in their mathematics, reading, and writing score

Read more

Summary

Introduction

Predicting the factors impacting student performance early in the academic programme can assist in combating the high dropout rate experienced by higher education institutions. Several studies have been conducted on using machine learning algorithms for early prediction of student performance.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call