Abstract

Cloud technology has the potential for widening access to high-performance computational resources for e-science research, but barriers to engagement with the technology remain high for many scientists. Workflows help overcome barriers by hiding details of underlying computational infrastructure and are portable between various platforms including cloud; they are also increasingly accepted within e-science research communities. Issues arising from the range of workflow systems available and the complexity of workflow development have been addressed by focusing on workflow interoperability, and providing customised support for different science communities. However, the deployments of such environments can be challenging, even where user requirements are comparatively modest. RESWO (Reconfigurable Environment Service for Workflow Orchestration) is a virtual platform-as-a-service cloud model that allows leaner customised environments to be assembled and deployed within a cloud. Suitable distributed computation resources are not always easily affordable and can present a further barrier to engagement by scientists. Desktop grids that use the spare CPU cycles available within an organisation are an attractively inexpensive type of infrastructure for many, and have been effectively virtualised as a cloud-based resource. However, hosts in this environment are volatile: leading to the tail problem, where some tasks become randomly delayed, affecting overall performance. To solve this problem, new algorithms have been developed to implement a cloudbursting scheduler in which durable cloud-based CPU resources may execute replicas of jobs that have become delayed. This paper describes experiences in the development of a RESWO instance in which a desktop grid is buttressed with CPU resources in the cloud to support the aspirations of bioscience researchers. A core component of the architecture, the cloudbursting scheduler, implements an algorithm to perform late job detection, cloud resource management and job monitoring. The experimental results obtained demonstrate significant performance improvements and benefits illustrated by use cases in bioscience research.

Highlights

  • Cloud technology has the potential to increase access to a much wider range of powerful computational resources for advancing e-science research, but in practice, rapid uptake by scientists has been impeded by a number of factors, most notably that cloud computing, as with grid computing in former years, typically requires advanced understanding of underlying computingWorkflows are not without their own challenges

  • Interoperability solutions combined with portal technology offer solutions but are typically cumbersome; Reconfigurable environment service for workflow orchestration (RESWO) provides a model for efficient deployment of such environments

  • distributed computing infrastructure (DCI) based on desktop grid (DG) are cheap, powerful and attractive to scientists

Read more

Summary

Introduction

Cloud technology has the potential to increase access to a much wider range of powerful computational resources for advancing e-science research, but in practice, rapid uptake by scientists has been impeded by a number of factors, most notably that cloud computing, as with grid computing in former years, typically requires advanced understanding of underlying computingWorkflows are not without their own challenges. RESWO (Reconfigurable Environment Service for Workflow Orchestration) is an emerging virtual platformas-a-service cloud model based on technologies developed originally in projects such as SHIWA and SCI-BUS.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call