Abstract

Computerization of society has substantially improved the ability to generate and collect data from a variety of sources. A large amount of data has flooded almost every aspect of people's lives. AMIK HASS Bandung has an Informatic Management Study Program consisting of three areas of concentration that can be selected by students in the fourth semester including Computerized Accounting, Computer Administration, and Multimedia. The determination of concentration selection should be precise based on past data, so the academic section must have a pattern or rule to predict concentration selection. In this work, the data mining techniques were using Naive Bayes and Decision Tree J48 using WEKA tools. The data set used in this study was 111 with a split test percentage mode of 75% used as training data as the model formation and 25% as test data to be tested against both models that had been established. The highest accuracy result obtained on Naive Bayes which is obtaining a 71.4% score consisting of 20 instances that were properly clarified from 28 training data. While Decision Tree J48 has a lower accuracy of 64.3% consisting of 18 instances that are properly clarified from 28 training data. In Decision Tree J48 there are 4 patterns or rules formed to determine concentration selection so that the academic section can assist students in determining concentration selection.

Highlights

  • The rapid development, of information technology, is undisputed

  • The Computerized Accounting Class has data proximity of 76.9% indicating that the correct percentage of students choosing the concentration of Computerized Accounting from the entire student predicted chose the concentration of Computerized Accounting

  • While the average return of grades produced on Naïve Bayes was 71.4% and Decision Tree J48 was lower which was 64.3% stating that the average percentage of students predicted in the selection of a concentration compared to the overall students who chose that concentration

Read more

Summary

Introduction

The rapid development, of information technology, is undisputed. Along with these developments, all transaction data has been evolved by applying information technology. Vast amounts of data have flooded almost every aspect of people's lives. The growth of explosive data has been stored, while data has generated an urgent need for new techniques and automated tools that can help intelligently turn large amounts of data into useful information and knowledge. This led to the development of computer science called data mining with its various applications. More popular data mining referred to as Knowledge Data Discovery or KDD is automatic or practical pattern extraction that represents knowledge implicitly stored or captured in large databases, data warehouses, web, other large information repositories, or data streams (Larose, 2015)

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call