An efficient predictive analytics system for high dimensional big data

Myat Cho Mon Oo,Thandar Thein

doi:10.1016/j.jksuci.2019.09.001

Myat Cho Mon Oo, Thandar Thein

Open Access

https://doi.org/10.1016/j.jksuci.2019.09.001

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

The excessive growth of high dimensional big data has resulted in a greater challenge for data scientists to efficiently obtain valuable knowledge from these data. Traditional data mining techniques are not fit to process big data. Predictive analytics has grown in prominence alongside the emergence of big data. In this paper, an efficient predictive analytics system for high dimensional big data is proposed by enhancing scalable random forest (SRF) algorithm on the Apache Spark platform. SRF is enhanced by optimizing the hyperparameters and prediction performance is improved by reducing the dimensions. The effectiveness of the proposed system is examined on five real-world datasets. Experimental results demonstrated that the proposed system achieves the highly competitive performance compared with RF algorithm implemented by Spark MLlib.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of King Saud University - Computer and Information Sciences	Publication Date: Sep 7, 2019
Citations: 18	License type: cc-by-nc-nd

R Discovery Prime

An efficient predictive analytics system for high dimensional big data

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Similar Papers

HB-File: An efficient and effective high-dimensional big data storage structure based on US-ELM
Linlin Ding ... Baoyan Song
Neurocomputing | VOL. 261
Linlin Ding, et. al.Linlin Ding ... Baoyan Song
16 Feb 2017
Neurocomputing | VOL. 261

Novel Incremental Ranking Framework for Biomedical Data Analytics and Dimensionality Reduction: Big Data Challenges and Opportunities
...
Journal of Computer Science & Systems Biology | VOL. 8
, et. al. ...
01 Jan 2015
Journal of Computer Science & Systems Biology | VOL. 8

A comprehensive survey of anomaly detection techniques for high dimensional big data
Srikanth Thudumu ... Philip Branch
Journal of Big Data | VOL. 7
Srikanth Thudumu, et. al.Srikanth Thudumu ... Philip Branch
02 Jul 2020
Journal of Big Data | VOL. 7

QoE-Based Big Data Analysis with Deep Learning in Pervasive Edge Environment
Qianyu Meng ... Bo Liu
-
Qianyu Meng, et. al.Qianyu Meng ... Bo Liu
01 May 2018
01 May 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

An efficient predictive analytics system for high dimensional big data

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences