Reliability-Aware Online Scheduling for DNN Inference Tasks in Mobile-Edge Computing

Huirong Ma,Zhi Zhou,Xu Chen,Xiaoxi Zhang,Rui Li

doi:10.1109/jiot.2023.3243266

Abstract

Mobile Edge Computing (MEC) is widely envisioned as a promising technique for provisioning artificial intelligence (AI) capability for resource-limited Internet of Things (IoT) devices by leveraging edge servers (ESs) for executing Deep Neural Network (DNN) inference tasks in proximity. However, scheduling DNN inference tasks at the network edge under unknown system dynamics (e.g., uncertain availability of ESs) may suffer from failures, making it difficult to guarantee reliable services for the IoT device. To overcome this challenge, we propose a reliability-aware online scheduling scheme for DNN inference tasks in MEC by leveraging both online feedback and offline data to learn the uncertain availability of ESs to maximize both the inference accuracy and service reliability of DNN inference tasks (i.e., the number of DNN inference tasks processed during the system span). We first formulate the reliability-aware DNN inference tasks scheduling problem as a novel constrained combinatorial multi-armed bandit (CMAB) problem. Then by integrating the Lyapunov optimization technique, bandit learning, approximated submodular maximization, and historical data organically, we design a Reliability-Aware Task scheduling scheme with Bandit Learning (RTBL) algorithm to solve this problem. Unfortunately, even with an accurate prediction of the system uncertainties, the task scheduling problem is still NP-hard. To deal with it, we therefore design an advanced approximation algorithm based on the submodularity of the scheduling problem which obtains a near-optimal solution and provides a satisfactory performance guarantee. Finally, we conduct rigorous theoretical analysis and race-driven simulations to show RTBL’s brilliant performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reliability-Aware Online Scheduling for DNN Inference Tasks in Mobile-Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal

Lead the way for us

Journal: IEEE Internet of Things Journal	Publication Date: Jul 1, 2023
Citations: 14

Similar Papers

Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism
Jing Li ... Weifa Liang
IEEE Transactions on Mobile Computing | VOL. 22
Jing Li, et. al.Jing Li ... Weifa Liang
01 May 2023
IEEE Transactions on Mobile Computing | VOL. 22

Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism
Jing Li ... Weifa Liang
-
Jing Li, et. al.Jing Li ... Weifa Liang
04 Oct 2021
04 Oct 2021

Joint Optimization of DNN Partition and Continuous Task Scheduling for Digital Twin-Aided MEC Network With Deep Reinforcement Learning
Siyu Yuan ... Qin Li
IEEE Access | VOL. 11
Siyu Yuan, et. al.Siyu Yuan ... Qin Li
01 Jan 2023
IEEE Access | VOL. 11

MagicBatch: An Energy-Aware Scheduling Framework for DNN Inference on Heterogeneous Edge Servers in Space-Air-Ground Computation
Di Liu ... Aolin Zhang
-
Di Liu, et. al.Di Liu ... Aolin Zhang
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reliability-Aware Online Scheduling for DNN Inference Tasks in Mobile-Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Journal