Keddah

Jie Deng,Steve Uhlig,Felix Cuadrado,Gareth Tyson

doi:10.1145/3301503

Abstract

As a distributed system, Hadoop heavily relies on the network to complete data-processing jobs. While the traffic generated by Hadoop jobs is critical for job execution performance, the actual behaviour of Hadoop network traffic is still poorly understood. This lack of understanding greatly complicates research relying on Hadoop workloads. In this article, we explore Hadoop traffic through empirical traces. We analyse the generated traffic of multiple types of MapReduce jobs, with varying input sizes, and cluster configuration parameters. We present Keddah, a toolchain for capturing, modelling, and reproducing Hadoop traffic, for use with network simulators to better capture the behaviour of Hadoop. By imitating the Hadoop traffic generation process and considering the YARN resource allocation, Keddah can be used to create Hadoop traffic workloads, enabling reproducible Hadoop research in more realistic scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Keddah

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Modeling and Computer Simulation

Lead the way for us

Journal: ACM Transactions on Modeling and Computer Simulation	Publication Date: Jun 19, 2019
Citations: 1

Similar Papers

Keddah: Capturing Hadoop Network Behaviour
Jie Deng ... Steve Uhlig
-
Jie Deng, et. al.Jie Deng ... Steve Uhlig
01 Jun 2017
01 Jun 2017

Scalable software architecture for distributed MMORPG traffic generation based on integration of UrBBaN-Gen and IMUNES
Valter Vasić ... Maja Matijašević
Journal of Communications Software and Systems | VOL. 8
Valter Vasić, et. al.Valter Vasić ... Maja Matijašević
21 Dec 2012
Journal of Communications Software and Systems | VOL. 8

Characterization of Hadoop Jobs Using Unsupervised Learning
Sonali Aggarwal ... Shashank Phadke
-
Sonali Aggarwal, et. al.Sonali Aggarwal ... Shashank Phadke
01 Nov 2010
01 Nov 2010

Comparison of SUMO’s vehicular demand generators in vehicular communications via graph-theory metrics
Luis Urquiza-Aguiar ... Xavier Calderón-Hinojosa
Ad Hoc Networks | VOL. 106
Luis Urquiza-Aguiar, et. al.Luis Urquiza-Aguiar ... Xavier Calderón-Hinojosa
03 Jul 2020
Ad Hoc Networks | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Keddah

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Modeling and Computer Simulation