Examination result repositories generated by most universities can serve as machine learning datasets for training various models to gain insights from the data. These datasets can train multiple linear regression models to determine a student's cumulative grade point average (CGPA), or the score that a student will get in specific courses. Additionally, classification-based supervised machine learning models can use these datasets to provide insights into the class result that a student will obtain. These insights can be invaluable for academic advising and early intervention. Moreover, these datasets can train clustering-based unsupervised machine learning models, such as the K-means clustering model, to understand how student results are grouped into various clusters. This information can be crucial for planning and evaluating the quality of the university. This paper uses the dataset of undergraduate students' examination results from the Department of Computer Science at the University of Nigeria, Nsukka, to train three supervised machine learning models and one unsupervised machine learning model, utilizing Jupyter Notebook as the Python IDE. The training results showed acceptable accuracies of 91.5% for the Naïve Bayes model and 95.1% for the Decision Tree model. The linear regression model demonstrated a negligible root mean square error of 8.23×10−18, while the K-means clustering model exhibited an acceptable Silhouette metric of 0.12.
Read full abstract