Abstract

Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study the effects of moving parallel scientific applications onto the cloud, we deployed several benchmark applications like matrix–vector operations and NAS parallel benchmarks, and DOUG (Domain decomposition On Unstructured Grids) on the cloud. DOUG is an open source software package for parallel iterative solution of very large sparse systems of linear equations. The detailed analysis of DOUG on the cloud showed that parallel applications benefit a lot and scale reasonable on the cloud. We could also observe the limitations of the cloud and its comparison with cluster in terms of performance. However, for efficiently running the scientific applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. Several iterative and embarrassingly parallel algorithms are reduced to the MapReduce model and their performance is measured and analyzed. The analysis showed that Hadoop MapReduce has significant problems with iterative methods, while it suits well for embarrassingly parallel algorithms. Scientific computing often uses iterative methods to solve large problems. Thus, for scientific computing on the cloud, this paper raises the necessity for better frameworks or optimizations for MapReduce.

Highlights

  • Scientific computing is a field of study that applies computer science to solve typical scientific problems

  • Cloud computing is a style of computing in which, typically, resources scalable on demand are provided “as a service” over the Internet to users who need not have knowledge of, expertise in, or control over the cloud infrastructure that supports them

  • In the SciCloud project we are mainly studying at adapting some of the scientific computing problems to the MapReduce [11] framework

Read more

Summary

Introduction

Scientific computing is a field of study that applies computer science to solve typical scientific problems. DOUG is an open source software package for parallel iterative solution of very large sparse systems of linear equations with up to several millions of unknowns While running these parallel applications on the SciCloud, it was realized that the transmission delays in the cloud environment to be the major problem for adapting HPC problems on the cloud. In the SciCloud project we are mainly studying at adapting some of the scientific computing problems to the MapReduce [11] framework.

SciCloud
Scientific computing on the cloud
Parallel scientific applications on the SciCloud
DOUG on the SciCloud
Adapting scientific computing problems to the clouds using MapReduce
SciCloud Hadoop framework
Scientific computing problems reduced to MapReduce framework
Related work
Findings
Conclusions and future research directions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call