TATA: Throughput-Aware TAsk Placement in Heterogeneous Stream Processing with Deep Reinforcement Learning

Xiao Huang,Huayun Tang,Jin Jin,Yiping Wang,Hai Wan,Yu Jiang,Xibin Zhao,Hao Fan

doi:10.1109/ispa-bdcloud-socialcom-sustaincom52081.2021.00021

Abstract

Data Stream Processing (DSP) applications, which generate real-time analytics on continuous data flows, have become prevalent recently. For the deployment of DSP applications, task placement is an important and essential part. As determining the optimal task placement is an NP-hard problem, several efficient heuristics have been designed and Deep Reinforcement Learning (DRL) was used to train the scheduling agent. Current DRL-based approach assumes all resources including CPU, memory and networking are homogeneous. However, the available computation and network resources are heterogeneous in many scenarios. To deal with it, we devise a general DRL-based resource-aware framework, which models resources using graph embedding and attention mechanism to predict the placement. Furthermore, in order to accelerate the training process and improve the throughput, we propose an efficient throughput estimation tool, which can estimate the throughput with high accuracy. We integrated our scheduling heuristic framework into Apache Flink and conducted comprehensive testings using multiple synthetic and real DSP applications. The experimental results show that our framework increases the throughput by 64%, 42%, 29% on average respectively compared with three state-of-the-art strategies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TATA: Throughput-Aware TAsk Placement in Heterogeneous Stream Processing with Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Placement of distributed stream processing over heterogeneous infrastructures
Matteo Nardelli
-
Matteo NardelliMatteo Nardelli
13 Jun 2016
13 Jun 2016

QoS-aware deployment of data streaming applications over distributed infrastructures
Matteo Nardelli
-
Matteo NardelliMatteo Nardelli
01 May 2016
01 May 2016

Elastic stateful stream processing in storm
Valeria Cardellini ... Matteo Nardelli
-
Valeria Cardellini, et. al.Valeria Cardellini ... Matteo Nardelli
01 Jul 2016
01 Jul 2016

On QoS-aware scheduling of data stream applications over fog computing infrastructures
Valeria Cardellini ... Francesco Lo Presti
-
Valeria Cardellini, et. al.Valeria Cardellini ... Francesco Lo Presti
01 Jul 2015
01 Jul 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TATA: Throughput-Aware TAsk Placement in Heterogeneous Stream Processing with Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers