Understanding the Performance of Low Power Raspberry Pi Cloud for Big Data

Wajdi Hajji,Fung Tso

doi:10.3390/electronics5020029

Wajdi Hajji, Fung Tso

Open Access

PDF Available

https://doi.org/10.3390/electronics5020029

Copy DOI

Export

Save

Cite

Journal: Electronics	Publication Date: Jun 6, 2016
Citations: 41	License type: CC BY 4.0

Affiliation: Liverpool John Moores University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Nowadays, Internet-of-Things (IoT) devices generate data at high speed and large volume. Often the data require real-time processing to support high system responsiveness which can be supported by localised Cloud and/or Fog computing paradigms. However, there are considerably large deployments of IoT such as sensor networks in remote areas where Internet connectivity is sparse, challenging the localised Cloud and/or Fog computing paradigms. With the advent of the Raspberry Pi, a credit card-sized single board computer, there is a great opportunity to construct low-cost, low-power portable cloud to support real-time data processing next to IoT deployments. In this paper, we extend our previous work on constructing Raspberry Pi Cloud to study its feasibility for real-time big data analytics under realistic application-level workload in both native and virtualised environments. We have extensively tested the performance of a single node Raspberry Pi 2 Model B with httperf and a cluster of 12 nodes with Apache Spark and HDFS (Hadoop Distributed File System). Our results have demonstrated that our portable cloud is useful for supporting real-time big data analytics. On the other hand, our results have also unveiled that overhead for CPU-bound workload in virtualised environment is surprisingly high, at 67.2%. We have found that, for big data applications, the virtualisation overhead is fractional for small jobs but becomes more significant for large jobs, up to 28.6%.

Highlights

Low-cost, low-power embedded devices are ubiquitous, part of the Internet-of-Things (IoT).These devices or things include RFID tags, sensors, actuators, smartphones, etc., which have substantial impact on our everyday-life and behaviour [1]
We extend our previous work on constructing Raspberry Pi Cloud to study its feasibility for real-time big data analytics under realistic application-level workload in both native and virtualised environments
There are: (1) Standalone mode: where Spark interacts with HDFS directly but MapReduce could collaborate with it in the same level to run jobs in cluster; (2) Hadoop Yarn: Spark just runs over Yarn which is a Hadoop distributed container manager; (3) Spark in MapReduce (SIMR): in this case Spark can run Spark jobs in addition to the standalone deployment

Summary

Introduction

Low-cost, low-power embedded devices are ubiquitous, part of the Internet-of-Things (IoT). This calls for a radically new computing paradigm which: (1) is capable of processing data efficiently; (2) has the agility of Cloud Computing; (3) is portable to support on-demand physical mobility; and (4) is low-cost, low-power for sustainable computing in remote areas. This new computing paradigm has been made possible by the emergence of low-cost, low-power credit card-sized single board computer—the Raspberry Pi [5]. We designed and conducted a set of experiments to test the performance of a single node and a cluster of 12 Raspberry Pi 2 boards with realistic network and CPU bound workload in both native and virtualised environments.

Related Work

Background

Docker

Experiment Setup

Single Node Experiments

Cluster Experiments

Single Node Performance

Spark and HDFS in the Native Environment

Spark and HDFS in Docker-Based Virtualised Environment

Virtualisation Impact on CPU and Memory Usage

Virtualisation Impact on Network Usage

Virtualisation Impact on Energy Consumption

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Understanding the Performance of Low Power Raspberry Pi Cloud for Big Data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Fog Based Intelligent Transportation Big Data Analytics in The Internet of Vehicles Environment: Motivations, Architecture, Challenges, and Critical Issues
Tasneem S J Darwish ... Kamalrulnizam Abu Bakar
IEEE Access | VOL. 6
Tasneem S J Darwish, et. al.Tasneem S J Darwish ... Kamalrulnizam Abu Bakar
01 Jan 2018
IEEE Access | VOL. 6

IoT Analysis and Processing System using Sensed Data for Laboratory Safety Management
Hyunseong Lee ... Seounghyeon Lee
IEIE Transactions on Smart Processing & Computing | VOL. 8
Hyunseong Lee, et. al.Hyunseong Lee ... Seounghyeon Lee
31 Aug 2019
IEIE Transactions on Smart Processing & Computing | VOL. 8

A framework for Internet data real-time processing: A machine-learning approach
Mario Di Mauro ... Cesario Di Sarno
-
Mario Di Mauro, et. al.Mario Di Mauro ... Cesario Di Sarno
01 Oct 2014
01 Oct 2014

Real Time Big Data Analytics in Smart City Applications
Manjunatha ... B Annappa
-
Manjunatha, et. al. Manjunatha ... B Annappa
01 Feb 2018
01 Feb 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Understanding the Performance of Low Power Raspberry Pi Cloud for Big Data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics