Accuracies and Training Times of Data Mining Classification Algorithms: An Empirical Comparative Study

S Olalekan Akinola,O Jephthar Oyabugbe

doi:10.4236/jsea.2015.89045

S Olalekan Akinola, O Jephthar Oyabugbe

Open Access

https://doi.org/10.4236/jsea.2015.89045

Copy DOI

Abstract

Two important performance indicators for data mining algorithms are accuracy of classification/ prediction and time taken for training. These indicators are useful for selecting best algorithms for classification/prediction tasks in data mining. Empirical studies on these performance indicators in data mining are few. Therefore, this study was designed to determine how data mining classification algorithm perform with increase in input data sizes. Three data mining classification algorithms—Decision Tree, Multi-Layer Perceptron (MLP) Neural Network and Naive Bayes— were subjected to varying simulated data sizes. The time taken by the algorithms for trainings and accuracies of their classifications were analyzed for the different data sizes. Results show that Naive Bayes takes least time to train data but with least accuracy as compared to MLP and Decision Tree algorithms.

Highlights

A large volume of data is poured into our computer networks, the World Wide Web (WWW), and various data storage devices every day from business, society, science and engineering, medicine, and almost every other aspect of daily life
(a) and Figure 1(b), it could be inferred that as data sizes were increasing, Naïve Bayes classification algorithm’s time complexity was the least, followed by J48 (Decision Tree) and Artificial Neural Networks (ANNs) (Multi-Layer Perceptron Neural Network) in that order. This means that MLP takes highest times for each of the data instances than the J48 Decision Tree and Naïve Bayes Classifiers
Results from this study show that there is a trade-off between accuracy and time complexities of the three algorithms (Multi-layer Perceptron, Naïve Bayes and Decision Tree) used

Summary

Introduction

A large volume of data is poured into our computer networks, the World Wide Web (WWW), and various data storage devices every day from business, society, science and engineering, medicine, and almost every other aspect of daily life. This explosive growth of available data volume emanates as a result of the computerization of our society and the fast development of powerful data collection and storage tools [1]. J. Oyabugbe tions) from very large databases or data warehouses [2]. Data mining consists of more than collection and managing data; it includes analysis and prediction

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Software Engineering and Applications	Publication Date: Jan 1, 2015
Citations: 18	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Accuracies and Training Times of Data Mining Classification Algorithms: An Empirical Comparative Study

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Software Engineering and Applications

Lead the way for us

Similar Papers

Data Mining Driven Models for Diagnosis of Diabetes Mellitus: A Survey
B Z Yahaya ... Y Atomsa
Indian Journal of Science and Technology | VOL. 11
B Z Yahaya, et. al.B Z Yahaya ... Y Atomsa
01 Nov 2018
Indian Journal of Science and Technology | VOL. 11

Two credit scoring models based on dual strategy ensemble trees
Gang Wang ... Kaiquan Xu
Knowledge Based Systems | VOL. 26
Gang Wang, et. al.Gang Wang ... Kaiquan Xu
13 Jul 2011
Knowledge Based Systems | VOL. 26

Direct marketing decision support through predictive customer response modeling
David L Olson ... Bongsug(Kevin) Chae
Decision Support Systems | VOL. 54
David L Olson, et. al.David L Olson ... Bongsug(Kevin) Chae
03 Jul 2012
Decision Support Systems | VOL. 54

Exploring new privacy approaches in a scalable classification framework
M Saravanan ... V.L Jayasre Manchari
-
M Saravanan, et. al.M Saravanan ... V.L Jayasre Manchari
01 Oct 2014
01 Oct 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accuracies and Training Times of Data Mining Classification Algorithms: An Empirical Comparative Study

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Software Engineering and Applications