Abstract In the contemporary era of big data, the educational landscape is undergoing significant transformations. Literature education, as a vital component, faces both emerging opportunities and challenges. This study develops a framework for analyzing learning behaviors specific to literature education. It employs both an enhanced K-means clustering algorithm and a refined Apriori algorithm for mining data on student learning behaviors. Through cluster analysis and the investigation of association rules, this research explores the interconnections between students’ learning behaviors and their literary education. The findings categorize students into four distinct groups based on their learning behaviors. Students in Category 1 are identified as the most proficient learners, consistently achieving test scores above 85. Conversely, Category 2 students display the least motivation and effectiveness, with their examination scores not exceeding 70. Students in Categories 3 and 4 exhibit comparable levels of performance. Crucially, the analysis reveals that the most significant predictors of students’ literary achievement are their regular and examination scores, with correlation coefficients of 0.627 and 0.653, respectively. This segmentation and analysis of student behaviors facilitate the early detection of atypical learning patterns by educational practitioners, enabling timely intervention strategies to enhance academic outcomes.
Read full abstract