Twister:Net - Communication Library for Big Data Processing in HPC and Cloud Environments

Supun Kamburugamuve,Gurhan Gunduz,Geoffrey Fox,Kannan Govindarajan,Vibhatha Abeykoon,Ahmet Uyar,Pulasthi Wickramasinghe

doi:10.1109/cloud.2018.00055

Abstract

Streaming processing and batch data processing are the dominant forms of big data analytics today, with numerous systems such as Hadoop, Spark, and Heron designed to process the ever-increasing explosion of data. Generally, these systems are developed as single projects with aspects such as communication, task management, and data management integrated together. By contrast, we take a component-based approach to big data by developing the essential features of a big data system as independent components with polymorphic implementations to support different requirements. Consequently, we recognize the requirements of both dataflow used in popular Apache Systems and the Bulk Synchronous Processing communication style common in High-Performance Computing (HPC) for different applications. Message Passing Interface (MPI) implementations are dominant in HPC but there are no such standard libraries available for big data. Twister:Net is a stand-alone, highly optimized dataflow style parallel communication library which can be used by big data systems or advanced users. Twister:Net can work both in cloud environments using TCP or HPC environments using MPI implementations. This paper introduces Twister:Net and compares it with existing systems to highlight its design and performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Twister:Net - Communication Library for Big Data Processing in HPC and Cloud Environments

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A survey on key roles of optical switching and labeling technologies on big data traffic of Data Centers and HPC environments
Efthymios N Lallas
AIMS Electronics and Electrical Engineering | VOL. 3
Efthymios N LallasEfthymios N Lallas
01 Jan 2019
AIMS Electronics and Electrical Engineering | VOL. 3

The Importance of Non-Data-Communication Overheads in MPI
Pavan Balaji ... Anthony Chan
The International Journal of High Performance Computing Applications | VOL. 24
Pavan Balaji, et. al.Pavan Balaji ... Anthony Chan
11 Jan 2010
The International Journal of High Performance Computing Applications | VOL. 24

Performance evaluation of some MPI implementations on workstation clusters
Zhixin Ba ... Zhenxiao Yang
-
Zhixin Ba, et. al. Zhixin Ba ... Zhenxiao Yang
01 Jan 1999
01 Jan 1999

Performance evaluation of some MPI implementations on workstation clusters
N Nupairoj ... L.M Ni
-
N Nupairoj, et. al.N Nupairoj ... L.M Ni
12 Oct 1994
12 Oct 1994

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Twister:Net - Communication Library for Big Data Processing in HPC and Cloud Environments

Abstract

Talk to us

Similar Papers