In this age of big data, education researchers are reconceptualizing and re-evaluating the value of education data. Therefore, we need to use educational data mining methods for data analysis to better guide teaching. The informatization level of colleges and universities is improving year by year, and the entire training data of students from enrollment to graduation is stored. These datasets are collected, stored, and kept by different departments, contain a large amount of regular and relevant information, and truly record the growth footprint of students. Traditional educational decision-making has not yet fully explored and used the valuable information hidden in data resources. Although some scholars have carried out research related to campus data mining at this stage, there are still many problems that have not yet been solved in the application of decision-making in colleges and universities. This paper is based on the idea of data-driven decision-making, combined with the data characteristics of campus big data, and establishes a model solution for student behavior analysis and behavior prediction by applying multiple machine learning algorithms. On the basis of the analysis of students’ academic behavior performance in the context of multi-category educational data, we proposed a cluster analysis framework for processing multi-type campus big data, and described the group characteristics of the clustering results. By introducing the K-prototype algorithm, we effectively solved the multi-category problem where traditional clustering algorithms (such as K-Means, etc.) cannot adapt to the attributes of educational data. The research results show that innovative educational decision-making models and methods are based on the idea of “data-prediction-decision”, which promotes the application research of big data science in the area of education.
Read full abstract