Development of Multiple Big Data Analytics Platforms with Rapid Response

Bao Rong Chang,Yun-Da Lee,Po-Hao Liao

doi:10.1155/2017/6972461

Bao Rong Chang, Yun-Da Lee + Show 1 more

Open Access

https://doi.org/10.1155/2017/6972461

Copy DOI

Journal: Scientific Programming	Publication Date: Jan 1, 2017
Citations: 5	License type: CC BY 4.0

Affiliation: National University of Kaohsiung

Abstract

The crucial problem of the integration of multiple platforms is how to adapt for their own computing features so as to execute the assignments most efficiently and gain the best outcome. This paper introduced the new approaches to big data platform, RHhadoop and SparkR, and integrated them to form a high-performance big data analytics with multiple platforms as part of business intelligence (BI) to carry out rapid data retrieval and analytics with R programming. This paper aims to develop the optimization for job scheduling using MSHEFT algorithm and implement the optimized platform selection based on computing features for improving the system throughput significantly. In addition, users would simply give R commands rather than run Java or Scala program to perform the data retrieval and analytics in the proposed platforms. As a result, according to performance index calculated for various methods, although the optimized platform selection can reduce the execution time for the data retrieval and analytics significantly, furthermore scheduling optimization definitely increases the system efficiency a lot.

Highlights

Big data [1] has been sharply in progress unprecedentedly in recent years and is changing the operation for business as well as the decision-making for the enterprise
The second one is an optimized platform selection (PS) utilized to choose an appropriate platform for execution according to the remaining amount of memory in a virtual machine but it is still based on first-comefirst-serve algorithm (FCFS), denoted FCFS-PS
The third method introduced the optimization for job scheduling using Memory-Sensitive Heterogeneous Earliest Finish Time (MSHEFT) algorithm employed to reschedule all of input queries in an ascending order in a job queue according to the smallest size of data file first

Summary

Introduction

Big data [1] has been sharply in progress unprecedentedly in recent years and is changing the operation for business as well as the decision-making for the enterprise. Big data with the features of high volume, high velocity, and high variety as well as in face of expanding incredible amounts of data, several issues emerging in big data such as storage, backup [2], management, processing, search [3], analytics, practical application, and other abilities to deal with the data face new challenges Those cannot be solved with traditional methods and it is worthwhile for us to continue exploring how to extract the valuable information from the huge amounts of data. According to the latest survey reported from American CIO magazine, 70% of IT operation has been done by batch processing in the business, which makes it “unable to control processing resources for operation as well as loading” [4] This becomes one of the biggest challenges for big data application

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of Multiple Big Data Analytics Platforms with Rapid Response

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming

Lead the way for us

Similar Papers

Development of multiple big data analysis platforms for business intelligence
Bao Rong Chang ... Yun-Da Lee
-
Bao Rong Chang, et. al.Bao Rong Chang ... Yun-Da Lee
01 May 2017
01 May 2017

Applying intelligent data traffic adaptation to high-performance multiple big data analytics platforms
Bao Rong Chang ... Po-Hao Liao
Computers & Electrical Engineering | VOL. 70
Bao Rong Chang, et. al.Bao Rong Chang ... Po-Hao Liao
23 Dec 2017
Computers & Electrical Engineering | VOL. 70

Road to freedom in big data analytics
...
-
, et. al. ...
01 Jan 2015
01 Jan 2015

Towards High Performance Data Analytics for Climate Change
Sandro Fiore ... Ian Foster
-
Sandro Fiore, et. al.Sandro Fiore ... Ian Foster
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of Multiple Big Data Analytics Platforms with Rapid Response

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Programming