Application-Oriented Network Scheduling With Metaflow

Yang Shi,Jiawei Fei,Mei Wen,Chunyuan Zhang

doi:10.1109/access.2019.2957765

Abstract

Distributed applications usually feature a set of correlated flows between two consecutive computation stages. The scheduling of these flows has a crucial influence on job completion time. Coflow improves performance by optimizing the finish time of the entire set of flows. However, the flows and computing tasks in one application have more complex relationships that exceed the coflow's barrier assumption. In this context, scheduling via coflow abstraction may hurt application performance. Accordingly, we propose metaflow, a traffic abstraction derived from the computation graph of the application. Metaflow reveals the detailed flow requirements of the application and makes it easier to reduce the job completion time. Based on the metaflow, we first develop a mathematical model and formulate the scheduling problem as an integer linear programming (ILP) problem. We further prove that it has an equivalent linear programming (LP) problem through rigorous theoretical analysis in order to solve this ILP problem efficiently. To demonstrate the effectiveness of scheduling with metaflow, we have conducted extensive simulations with both synthetic single jobs and production traces containing multiple jobs. The simulation results verify that our new scheduler adapts well to different jobs and can achieve a significant increase in an average speed of 2.87× on a real-life workload, compared to the state-of-the-art coflow scheduler.

Highlights

Datacenter networks are critical to the performance of distributed applications
We propose an algorithm to calculate the metaflow completion time (MCT) and successfully formulate the metaflow scheduling problem (MSP) expressed as an integer linear programming (ILP) model with optimal solutions. (§IV)
Using workload traces from real datacenters, we show that the distributed applications can be boosted significantly through network scheduling with metaflow, compared with the state-of-the-art coflow scheduler (§VI)

Summary

INTRODUCTION

Datacenter networks are critical to the performance of distributed applications. It is reported that, at times, 50% of the time taken to complete a job is spent on transferring data across the networks [1]. Traditional scheduling algorithms focus on reducing flow completion time (FCT) [3]–[6] or improving per-flow fairness [7], [8] Since they are based on the abstraction of flows, they cannot capture the semantics of communication in a distributed application; the optimization of flow-level objectives can be at odds with application-level goals. Coflow assumes that a job cannot begin to process the stage until all flows within the coflow have finished; that is to say, a barrier exists between two consecutive stages Under this condition, minimizing the average CCT usually aligns application-level performance, thereby decreasing job completion time (JCT). Coflow can not convey these application semantics to the network controller To address this problem, in this paper we propose metaflow, a new application-oriented traffic abstraction that leverages the computation dependency graph to guide the network transfer. We verify the performance of metaflow scheduler with other three schedulers in extensive experiments

RELATED WORK

METAFLOW SCHEDULING PROBLEM

TRANSFORMATION INTO A NONLINEAR PROGRAMMING PROBLEM

TRANSFORMING THE NONLINEAR PROGRAMMING PROBLEM INTO AN LP

EXPERIMENTAL EVALUATION

COMPUTATIONAL COST

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Application-Oriented Network Scheduling With Metaflow

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2019
License type: CC BY 4.0

Similar Papers

Towards High-Efficiency Data Centers via Job-Aware Network Scheduling
Yang Shi ... Mei Wen
-
Yang Shi, et. al.Yang Shi ... Mei Wen
17 Aug 2020
17 Aug 2020

An Innovative Formulation Tightening Approach for Job-Shop Scheduling
Bing Yan ... Peter B. Luh
IEEE Transactions on Automation Science and Engineering | VOL. 19
Bing Yan, et. al.Bing Yan ... Peter B. Luh
01 Jul 2022
IEEE Transactions on Automation Science and Engineering | VOL. 19

Traffic Grooming Fault Tolerant Technique for Load Balanced Routing and Wavelength Assignment in WDM Networks
Achala Deshmukh ... Surendra Bhosale
-
Achala Deshmukh, et. al.Achala Deshmukh ... Surendra Bhosale
11 Oct 2015
11 Oct 2015

A method to improve integer linear programming problem with branch-and-bound procedure
Din-Yuen Chan ... Ming-Chai Li
Applied Mathematics and Computation | VOL. 179
Din-Yuen Chan, et. al.Din-Yuen Chan ... Ming-Chai Li
15 Feb 2006
Applied Mathematics and Computation | VOL. 179

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application-Oriented Network Scheduling With Metaflow

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access