Machine Learning Library Research Articles

Artificial intelligence, specifically machine learning, has been applied in a variety of methods by the research group to transform several data sources into valuable facts and understanding, allowing for superior pattern identification skills. Machine learning algorithms on huge and complicated data sets, computationally expensive on the other hand, processing requires hardware and logical resources, such as space, CPU, and memory. As the amount of data created daily reaches quintillion bytes, A complex big data infrastructure becomes more and more relevant. Apache Spark Machine learning library (ML-lib) is a famous platform used for big data analysis, it includes several useful features for machine learning applications, involving regression, classification, and dimension reduction, as well as clustering and features extraction. In this contribution, we consider Apache Spark ML-lib as a computationally independent machine learning library, which is open-source, distributed, scalable, and platform. We have evaluated and compared several ML algorithms to analyze the platform’s qualities, compared Apache Spark ML-lib against Rapid Miner and Sklearn, which are two additional Big data and machine learning processing platforms. Logistic Classifier (LC), Decision Tree Classifier (DTc), Random Forest Classifier (RFC), and Gradient Boosted Tree Classifier (GBTC) are four machine learning algorithms that are compared across platforms. In addition, we have tested general regression methods such as Linear Regressor (LR), Decision Tree Regressor (DTR), Random Forest Regressor (RFR), and Gradient Boosted Tree Regressor (GBTR) on SUSY and Higgs datasets. Moreover, We have evaluated the unsupervised learning methods like K-means and Gaussian Mixer Models on the data set SUSY and Hepmass to determine the robustness of PySpark, in comparison with the classification and regression models. We used ”SUSY,” ”HIGGS,” ”BANK,” and ”HEPMASS” dataset from the UCI data repository. We also talk about recent developments in the research into Big Data machines and provide future research directions.

Read full abstract

The object of the research is modern online services and machine learning libraries for predicting the probability of the bank client's consent to the provision of the proposed services. One of the most problematic areas is the high unpredictability of the result in the field of banking marketing using the most common technique of introducing new services for clients – the so-called cold calling. Therefore, the question of assessing the probability and predicting the behavior of a potential client when promoting new banking services and services using cold calling is particularly relevant. In the course of the study, libraries of machine learning methods and data analysis of the Python programming language were used. A program was developed to build a model for predicting the behavior of bank customers using data processing methods using gradient boosting, regularization of gradient boosting, random forest algorithm and recurrent neural networks. Analogous models were built using cloud machine learning services Azure ML, BigML and the Auto-sklearn library. Data analysis and prediction models built using Python language libraries have a fairly high quality – an average of 94.5 %. Using the Azure ML cloud service, a predictive model with an accuracy of 88.6 % was built. The BigML machine learning service made it possible to build a model with an accuracy of 88.8 %. Machine learning methods from the Auto-sklearn library made it possible to obtain a model with a higher quality – 94.9 %. This is due to the fact that the proposed libraries of the Python programming language allow better customization of data processing methods and machine learning to obtain more accurate models than free cloud services that do not provide such capabilities. Thanks to this, it is possible to obtain a predictive model of the behavior of bank customers with a fairly high degree of accuracy. It is worth noting that in order to make a prediction (forecast), it is necessary to study the context of the task, process the data, build various machine learning algorithms, evaluate the quality of the models and choose the best of them.

Read full abstract

Machine Learning Library Research Articles

Related Topics

Articles published on Machine Learning Library

Quantification of Deep Neural Network Prediction Uncertainties for VVUQ of Machine Learning Models

Priority Evasion Attack: An Adversarial Example That Considers the Priority of Attack on Each Classifier

The study of the quality of multi-step time series forecasting

MULTIMODAL SPEECH RECOGNITION BASED ON AUDIO AND TEXT DATA

Large scale K-means clustering using GPUs

Evaluating and testing neural-network algorithm capabilities for automating image data analysis for remote sensing of the Earth

Impact of data quality on supervised machine learning: Case study on drilling vibrations

DeePKS-kit: A package for developing machine learning-based chemically accurate energy and density functional models

Supervised Machine Learning for Predicting Carbohydrate Malabsorptions Using Hydrogen Breath Tests

Machine Learning in Pansharpening: A benchmark, from shallow to deep networks

Webcam Based Gesture Detection and Recognition to Navigate in the Application

Performance Evaluation of Data-driven Intelligent Algorithms for Big data Ecosystem.

Toward Backdoor Attacks for Image Captioning Model in Deep Neural Networks

Consideration of the possibilities of applying machine learning methods for data analysis when promoting services to bank's clients

New Drug Development and Clinical Trial Design by Applying Genomic Information Management.

Intelligent ensembling of auto-ML system outputs for solving classification problems

Scalable and Fast Characteristic Mode Analysis using GPUs

Restricted-Area Adversarial Example Attack for Image Captioning Model

Vehicle Detection and Count in the Captured Stream Video Using Opencv in Machine Learning

Prediction of COVID-19 Using Some Machine Learning Models and Its Comparison with a Deep Learning Model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Machine Learning Library Research Articles

Related Topics

Articles published on Machine Learning Library

Quantification of Deep Neural Network Prediction Uncertainties for VVUQ of Machine Learning Models

Priority Evasion Attack: An Adversarial Example That Considers the Priority of Attack on Each Classifier

The study of the quality of multi-step time series forecasting

MULTIMODAL SPEECH RECOGNITION BASED ON AUDIO AND TEXT DATA

Large scale K-means clustering using GPUs

Evaluating and testing neural-network algorithm capabilities for automating image data analysis for remote sensing of the Earth

Impact of data quality on supervised machine learning: Case study on drilling vibrations

DeePKS-kit: A package for developing machine learning-based chemically accurate energy and density functional models

Supervised Machine Learning for Predicting Carbohydrate Malabsorptions Using Hydrogen Breath Tests

Machine Learning in Pansharpening: A benchmark, from shallow to deep networks

Webcam Based Gesture Detection and Recognition to Navigate in the Application

Performance Evaluation of Data-driven Intelligent Algorithms for Big data Ecosystem.

Toward Backdoor Attacks for Image Captioning Model in Deep Neural Networks

Consideration of the possibilities of applying machine learning methods for data analysis when promoting services to bank's clients

New Drug Development and Clinical Trial Design by Applying Genomic Information Management.

Intelligent ensembling of auto-ML system outputs for solving classification problems

Scalable and Fast Characteristic Mode Analysis using GPUs

Restricted-Area Adversarial Example Attack for Image Captioning Model

Vehicle Detection and Count in the Captured Stream Video Using Opencv in Machine Learning

Prediction of COVID-19 Using Some Machine Learning Models and Its Comparison with a Deep Learning Model