Assessment, Design and Implementation of a Private Cloud for MapReduce Applications

M Salgueiro,T F Pena,J C Cabaleiro,P González

doi:10.4236/oalib.1100526

Abstract

Scientific computation and data intensive analyses are ever more frequent. On the one hand, the MapReduce programming model has gained a lot of attention for its applicability in large parallel data analyses and Big Data applications. On the other hand, Cloud computing seems to be increasingly attractive in solving these computing problems that demand a lot of resources. This paper explores the potential symbiosis between MapReduce and Cloud Computing, in order to create a robust and scalable environment to execute MapReduce workflows regardless of the underlaying infrastructure. The main goal of this work is to provide an easy-to-install interface, so as non-expert scientists can deploy a suitable testbed for their MapReduce experiments on local resources of their institution. Testing cases were performed in order to evaluate the required time for the whole executing process on a real cluster.

Highlights

Scientific Computing enables to perform new kind of experiments that would have been impossible only a decade ago
The MapReduce programming model abstracts the common difficulties linked to distributed processing on large clusters, by offering a simple and efficient way of processing large data sets with a parallel distributed algorithm
It has been argued that MapReduce does not suit well for many scientific algorithms, a recent work [2] studied how to adapt different classes of algorithms into the MapReduce model and concluded that the MapReduce programming model can be used successfully even for solving complex scientific computing problems

Summary

Introduction

Scientific Computing enables to perform new kind of experiments that would have been impossible only a decade ago. Big Data science is generating datasets that are increasing exponentially in both complexity and volume, making their analysis a big challenge. Two issues should be addressed: finding an effective method to tackle such challenging problems, and obtaining the necessary resources to solve them. MapReduce [1] may help in addressing the first issue. The MapReduce programming model abstracts the common difficulties linked to distributed processing on large clusters, by offering a simple and efficient way of processing large data sets with a parallel distributed algorithm. It has been argued that MapReduce does not suit well for many scientific algorithms, a recent work [2] studied how to adapt different classes of algorithms into the MapReduce model and concluded that the MapReduce programming model can be used successfully even for solving complex scientific computing problems

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: OALib	Publication Date: Jan 1, 2014
Citations: 10	License type: cc-by

R Discovery Prime

R Discovery Prime

Assessment, Design and Implementation of a Private Cloud for MapReduce Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: OALib

Lead the way for us

Similar Papers

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Essaaidi
Concurrency and Computation: Practice and Experience | VOL. 29
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Essaaidi
29 Mar 2017
Concurrency and Computation: Practice and Experience | VOL. 29

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Essaaidi
Concurrency and Computation: Practice and Experience | VOL. 30
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Essaaidi
20 May 2018
Concurrency and Computation: Practice and Experience | VOL. 30

Healthcare 4.0: A Voyage of Fog Computing with IOT, Cloud Computing, Big Data, and Machine Learning
Anish Kumar Sarangi ... Bright Keswani
-
Anish Kumar Sarangi, et. al.Anish Kumar Sarangi ... Bright Keswani
03 Aug 2020
03 Aug 2020

Research based on Big Data and Cloud Computing
Xiaoru Chen ... Lijun Chen
-
Xiaoru Chen, et. al.Xiaoru Chen ... Lijun Chen
26 Feb 2018
26 Feb 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessment, Design and Implementation of a Private Cloud for MapReduce Applications

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: OALib