HPC in Big Data Age

Alexey Cheptsov

doi:10.1145/2642769.2642802

Abstract

The current IT technologies have a strong need for scaling up the high-performance analysis to large-scale datasets. Tremendously increased over the last few years volume and complexity of data gathered in both public (such as on the web) and enterprise (e.g. digitalized internal document base) domains have posed new challenges to providers of high performance computing (HPC) infrastructures, which is recognised in the community as Big Data problem. On contrast to the typical HPC applications, the Big Data ones are not oriented on reaching the peak performance of the infrastructure and thus offer more opportunities for the capacity infrastructure model rather than for the capability one, making the use of Cloud infrastructures preferable over the HPC. However, considering the more and more vanishing difference between these two infrastructure types, i.e. Cloud and HPC, it makes a lot of sense to investigate the abilities of traditional HPC infrastructure to execute Big Data applications as well, despite their relatively poor efficiency as compared with the traditional, very optimized HPC ones. This paper discusses the main state-of-the-art parallelisation techniques utilised in both Cloud and HPC domains and evaluates them on an exemplary text processing application on a testbed HPC cluster.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HPC in Big Data Age

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using High Performance Computing for Conquering Big Data
Antonio Gómez-Iglesias ... Ritu Arora
-
Antonio Gómez-Iglesias, et. al.Antonio Gómez-Iglesias ... Ritu Arora
01 Jan 2015
01 Jan 2015

Region-specific guidelines to encourage SMEs to use high performance computing
Sergio Botelho Junior ... Bill O’Gorman
Digital Policy, Regulation and Governance | VOL. 24
Sergio Botelho Junior, et. al.Sergio Botelho Junior ... Bill O’Gorman
05 Jul 2021
Digital Policy, Regulation and Governance | VOL. 24

Large-Scale Machine Learning and Optimization for Bioinformatics Data Analysis
Jianlin Cheng
-
Jianlin ChengJianlin Cheng
21 Sep 2020
21 Sep 2020

Approaches of enhancing interoperations among high performance computing and big data analytics via augmentation
Ajeet Ram Pathak ... Siddharth S Rautaray
Cluster Computing | VOL. 23
Ajeet Ram Pathak, et. al.Ajeet Ram Pathak ... Siddharth S Rautaray
03 Aug 2019
Cluster Computing | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HPC in Big Data Age

Abstract

Talk to us

Similar Papers