Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

P China Venkanna Varma,V Valli Kumari,Venkata Kalyan Chakravarthy K,S Viswanadha Raju

doi:10.1016/j.bdr.2015.12.002

Abstract

Abstract We live in a world increasingly driven by data with more information about individuals, companies and governments available than ever before. Now, every business is powered by Information Technology and generating Big data. Future Business Intelligence can be extracted from the big data. NoSQL [1] and Map-Reduce [2] technologies find an efficient way to store, organize and process the big data using Virtualization and Linux Container (a.k.a. Container) [3] technologies. Provisioning containers on top of virtual machines is a better model for high resource utilization. As the more containers share the same CPU, the context switch latency for each container increases significantly. Such increase leads to a negative impact on the network IO throughput and creates a bottleneck in the big data environments. As part of this paper, we studied container networking and various factors of context switch latency. We evaluate Hadoop benchmarks [5] against the number of containers and virtual machines. We observed a bottleneck where Hadoop [4] cluster throughput is not linear with the number of nodes sharing the same CPU. This bottleneck is due to virtual network layers which adds a significant delay to Round Trip Time (RTT) of data packets. Future work of this paper can be extended to analyze the practical implications of virtual network layers and a solution to improve the performance of big data environments based on containers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

Abstract

Talk to us

Similar Papers

More From: Big Data Research

Lead the way for us

Journal: Big Data Research	Publication Date: Jan 4, 2016
Citations: 9

Similar Papers

Analysis of Network IO Performance in Hadoop Cluster Environments Based on Docker Containers
P China Venkanna Varma ... V Valli Kumari
-
P China Venkanna Varma, et. al.P China Venkanna Varma ... V Valli Kumari
01 Jan 2015
01 Jan 2015

Performance Optimization of Hypervisor’s Network Bridge by Reducing Latency in Virtual Layers
Ponnamanda China Venkanna Varma ... V Valli Kumari
-
Ponnamanda China Venkanna Varma, et. al.Ponnamanda China Venkanna Varma ... V Valli Kumari
12 Dec 2018
12 Dec 2018

The Disclosure of Social Responsibility Information of Coal Enterprises in Big Data Environment
Jing-Jing Li ... Lin Zhu
DEStech Transactions on Economics, Business and Management | VOL. -
Jing-Jing Li, et. al.Jing-Jing Li ... Lin Zhu
03 Jul 2018
DEStech Transactions on Economics, Business and Management | VOL. -

Discussion on geological science big data and its applications
Chonglong Wu ... Zhiting Zhang
Chinese Science Bulletin | VOL. 61
Chonglong Wu, et. al.Chonglong Wu ... Zhiting Zhang
16 May 2016
Chinese Science Bulletin | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of a Network IO Bottleneck in Big Data Environments Based on Docker Containers

Abstract

Talk to us

Similar Papers

More From: Big Data Research