Abstract
In this era of High-Performance High computing systems, Large-scale Data Mining methodologies in the field of education have become a convenience to discover and extract knowledge from Databased of their respective educational archives. Typically, all educational institutions around the world maintain student data repositories. Attributes of students such as the name of the student, gender of student, age group (date of birth), religion, eligibility details, academic assessment details, etc. are kept in it. With this knowledge, in this paper, didactical data mining (DDM) is used to leverage the performance prediction of student and to analyse it proactively. As it is known, Classification and Clustering are the liveliest techniques in mining the required data. Hence, Bound Model of Clustering and Classification (BMCC) have been proposed in this research for most proficient educational data mining. Classification is one of the distinguished options in Data Mining to assign an object under some pre-defined classes according to their attributes, and hence it comes under a supervised learning problem. On the other side, clustering is considered as a non-supervised learning problem that involves in grouping up of objects with respect to some similarities. Moreover, this paper uses the dataset collected from Kerala Technological University-SNG College of Engineering (KTU_SNG) for performing the BMCC. An efficient J48 decision tree algorithm is used for classification and the k-means algorithm is incorporated for clustering here and is optimised with Bootstrap Aggregation (Bagging). The implementation has been done and analysed with a data mining tool called WEKA (Waikato Environment for Knowledge Analysis), and the results are compared with some most used classifications such as Bayes Classifier (NB), Neural Network (Multilayer Perceptron MLP) and J48. It is provable from the results that the model, proposed in this provides high Precision Rate (PR), accuracy and robustness with less computational time, though the sample data set includes some missing values.
Highlights
Data mining is the process of examining the large databases for extracting the new or required information
Of the experimental analysis, the results obtained for the proposed (BMCC) model corresponding to the parameters given in the are compared with the traditionalistic classification techniques such as Naive Bayes, Multilayer Perceptron (MLP) and J48
The proposed (BMCC) model is an effective technique for the Proficient Performance Prediction (PPP) of educational datasets and mining classification that helps in successfully identifying the huge data sets of educational domains
Summary
Data mining is the process of examining the large databases for extracting the new or required information. There are rising research scopes in utilizing data mining techniques for education, for Didactical Data Mining (DDM) a similar one to Educational Data Mining (EDM). This newly developing interdisciplinary field EDM involves knowledge extraction from the data obtained from the educational environments [11]. The collected data are from different sources, formats and at variant granularity levels and that may contain the personal or academic details of the students In another option, the huge data can be collected from the e-learning systems that are already provided with a huge amount of data from various institutions.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Advanced Computer Science and Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.