Abstract

This paper analyses deep learning and traditional data mining and machine learning methods; compares the advantages and disadvantage of the traditional methods; introduces enterprise needs, systems and data, IT challenges, and Big Data in an extended service infrastructure. The feasibility and challenges of the applications of deep learning and traditional data mining and machine learning methods in Big Data analytics are also analyzed and presented.

Highlights

  • Data mining focuses on the knowledge discovery of data

  • Deep Learning is a new area in machine learning research, which has been introduced with the objective of moving machine learning closer to one of its original goals — artificial intelligence

  • Deep learning has the potential in dealing with big data there are challenges

Read more

Summary

Introduction

Data mining focuses on the knowledge discovery of data. Machine learning concentrates on prediction based on training and learning. The most widely used method in dimensionality reduction is principal component analysis (PCA). Redundancy means that some of the variables are correlated with one another Because of this redundancy, PCA can be used to reduce the observed variables into a smaller number of principal components [3]. PCA can be used to reduce the observed variables into a smaller number of principal components [3] Factor analysis is another method for dimensionality reduction. It is useful for understanding the underlying reasons for the correlations among a group of variables. The main applications of factor analysis are reducing the number of variables and detecting structure in the relationships among variables. The purposes of this paper are to 1) analyze deep learning and traditional data mining and machine learning methods (including k-means, k-nearest neighbor, support vector machines, decision trees, logistic regression, Naive Bayes, neural networks, bagging, boosting, and random forests); 2) compares the advantages and disadvantage of the traditional methods; 3) introduces enterprise needs, systems and data, IT challenges, and Big Data in an extended service infrastructure; and 4) discuss the feasibility and challenges of the applications of deep learning and traditional data mining and machine learning methods in Big Data analytics

Some Methods in Data Mining and Machine Learning
Support Vector Machine
Trees and Logistic Regression
Naïve Bayes
Neural Networks
Deep Learning
Comparison of Different Methods and Ensemble Methods
Big Data in Service Infrastructure and IT Challenges
Data Mining and Machine Learning in
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call