Scheduling In-Band Network Telemetry With Convergence-Preserving Federated Learning

Yibo Jin,Ning Chen,Sheng Zhang,Mingtao Ji,Lei Jiao,Sanglu Lu,Zhuzhong Qian

doi:10.1109/tnet.2023.3253302

Abstract

Conducting federated learning across distributed sites with In-Band Network Telemetry (INT) based data collection faces critical challenges, including control decisions of different frequencies, convergence of the models being trained, and resource provisioning coupled over time. To study this problem, we formulate a non-linear mixed-integer program to optimize the long-term INT overhead, resource cost, and federated learning cost. We then design polynomial-time online algorithms to solve this problem with only observable inputs on the fly, featuring laziness-aware resource adaption, online-learning-based INT flow selection and model aggregation control, as well as expectation-preserving randomized dependent rounding. We rigorously prove the parameterized-constant competitive ratio of our approach against the offline optimum, and the time-averaged constraint violation that vanishes in the long run. With extensive trace-driven evaluations, we confirm the superiority of our approach over other alternative approaches for reducing total cost and the efficacy of our trained models for solving real machine learning problems, reducing the real-time cost by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$34\%$</tex-math> </inline-formula> on average.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scheduling In-Band Network Telemetry With Convergence-Preserving Federated Learning

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication

Lead the way for us

Journal: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication	Publication Date: Oct 1, 2023
Citations: 9

Similar Papers

SINT: Toward a Blockchain-Based Secure In-Band Network Telemetry Architecture
Yuyu Zhao ... Yongning Tang
IEEE Transactions on Information Forensics and Security | VOL. 18
Yuyu Zhao, et. al.Yuyu Zhao ... Yongning Tang
01 Jan 2023
IEEE Transactions on Information Forensics and Security | VOL. 18

A SmartNIC-Accelerated Monitoring Platform for In-band Network Telemetry
Yixiao Feng ... Sourav Panda
-
Yixiao Feng, et. al.Yixiao Feng ... Sourav Panda
01 Jul 2020
01 Jul 2020

A Packet Loss Monitoring System for In-Band Network Telemetry: Detection, Localization, Diagnosis and Recovery
Lizhuang Tan ... Pilar Manzanares-Lopez
IEEE Transactions on Network and Service Management | VOL. 18
Lizhuang Tan, et. al.Lizhuang Tan ... Pilar Manzanares-Lopez
01 Dec 2021
IEEE Transactions on Network and Service Management | VOL. 18

In-band Network Telemetry: A Survey
Lizhuang Tan ... Na Li
Computer Networks and ISDN Systems | VOL. 186
Lizhuang Tan, et. al.Lizhuang Tan ... Na Li
26 Dec 2020
Computer Networks and ISDN Systems | VOL. 186

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scheduling In-Band Network Telemetry With Convergence-Preserving Federated Learning

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on networking : a joint publication of the IEEE Communications Society, the IEEE Computer Society, and the ACM with its Special Interest Group on Data Communication