InfiniCloud 2.0: distributing High Performance Computing across continents

Jakub Chrzęszczyk ,Ben Swift ,Tin Wee Tan ,Kenneth Ban ,Howard Andrew ,A Chrzeszczyk ,Jonathan Low ,Peter Davis

doi:10.14529/jsfi160204

Abstract

InfiniCloud 2.0 is World’s first native InfiniBand High Performance Cloud distributed across four continents, spanning Asia, Australia, Europe and North America. The project provides researchers with instant access to computational, storage and network resources distributed around the globe. These resources are then used to build a geographically distributed, virtual supercomputer, complete with globally-accessible parallel file system and job scheduling.This paper describes high level design and the implementation details of InfiniCloud 2.0. A gene sequencing pipeline as well as plasma physics simulation code are used to demonstrate system’s capabilities.

Highlights

The original InfiniCloud system, presented at Supercomputing Frontiers Singapore in March 2015, enabled researchers to quickly and efficiently copy large volumes of data between Singapore and Australia, as well as to process that data using two discrete, native InfiniBand High Performance Clouds [8]
While the unique capabilities of InfiniCloud enabled new ways of processing data, it inspired a whole new range of research questions: Can the entire capacity of the system be aggregated? Do entire data collections need to be copied for processing, or can data be accessed in place? How does the InfiniCloud design scale to an arbitrary number of sites? How we ensure a consistent state of all InfiniCloud clusters? And can the resources across four continents be joined together using the InfiniCortex fabric to create a Galaxy of Supercomputers [14]?
In (Section 1) and (Section 2) we demonstrated the concept, design and implementation of a geographically distributed, High Performance Cloud system, capable of aggregating high performance computing resources available across four continents

Summary

Introduction

The original InfiniCloud system, presented at Supercomputing Frontiers Singapore in March 2015, enabled researchers to quickly and efficiently copy large volumes of data between Singapore and Australia, as well as to process that data using two discrete, native InfiniBand High Performance Clouds [8]. While the unique capabilities of InfiniCloud enabled new ways of processing data, it inspired a whole new range of research questions: Can the entire capacity of the system be aggregated? In this paper we aim to explore these research questions and propose new ways of utilizing distributed computation, storage and network resources, using a variety of novel tools and techniques. We take advantage of the expansion and enhancement of the InfiniCortex fabric which took place in 2015 [12], which includes full support for InfiniBand subnets and routing, greater available bandwidth and last but not least the growing number of participating sites.

The Network

Connecting to Europe and additional US based facilities

Routable InfiniBand

The Cloud

InfiniCloud rationale and the existing solutions

Cloud Architecture

Cloud implementation

Cloud Controller

Compute Nodes

Resource scheduling

Availability zones

Instance types

Communication patterns

Bandwidth and latency considerations

BeeGFS

BeeGFS configuration for high bandwidth-delay products

Optimizing data access patterns

ElastiCluster

Implementation of variant calling genome analysis pipeline

Geopipeline performance analysis

Extempore

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Supercomputing Frontiers and Innovations	Publication Date: Sep 1, 2016
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

InfiniCloud 2.0: distributing High Performance Computing across continents

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations

Lead the way for us

Similar Papers

Building a Science Gateway For Processing and Modeling Sequencing Data Via Apache Airavata.
Zhong Wang ... Charles G Danko
Practice and Experience in Advanced Research Computing 2018 : Seamless Creativity : July 22-26 2017, Pittsburgh, Pennsylvania. Practice and Experience in Advanced Research Computing (Conference) (2018 : Pittsburgh, Pa.) | VOL. 2018
Zhong Wang, et. al.Zhong Wang ... Charles G Danko
22 Jul 2018
22 Jul 2018

Are There Clouds in Our Blue Sky Research Programs?
Derek Mathieson ... Clif Triplett
Journal of Petroleum Technology | VOL. 63
Derek Mathieson, et. al.Derek Mathieson ... Clif Triplett
01 Sep 2011
Journal of Petroleum Technology | VOL. 63

Towards Building a Lightweight Key-Value Store on Parallel File System
Jiaan Zeng ... Beth Plale
-
Jiaan Zeng, et. al.Jiaan Zeng ... Beth Plale
01 Sep 2015
01 Sep 2015

Optimization of tomographic reconstruction workflows on geographically distributed resources.
Tekin Bicer ... Rajkumar Kettimuthu
Journal of synchrotron radiation | VOL. 23
Tekin Bicer, et. al.Tekin Bicer ... Rajkumar Kettimuthu
15 Jun 2016
Journal of synchrotron radiation | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InfiniCloud 2.0: distributing High Performance Computing across continents

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations