Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks

Yalan Wu,Mianyang Yao,Long Chen,Siew Kei Lam,Jigang Wu,Bosheng Liu

doi:10.1109/tits.2023.3266795

Abstract

In vehicular networks, task scheduling at the microarchitecture-level and network-level offers tremendous potential to improve the quality of computing services for deep neural network (DNN) inference. However, existing task scheduling works only focus on either one of the two levels, which results in inefficient utilization of computing resources. This paper aims to fill this gap by formulating a two-level scheduling problem for DNN inference tasks in a vehicular network, with an objective of minimizing total weighted sum of response time and energy consumption for all tasks under the following constraints: per task response time, per vehicle energy consumption, per vehicle storage capacity. We first formulate the problem and prove that it is NP-hard. A group transformation based algorithm, called GTA, is proposed. GTA makes scheduling decisions at the network-level using the group transformation based approach, and at the microarchitecture-level using a greedy strategy. In addition, an algorithm, denoted as DRL, is proposed to decrease total weighted sum of response time and energy consumption for all tasks. DRL trains two models with deep reinforcement learning to achieve two-level scheduling. The proposed algorithms are evaluated on a platform consisting of a desktop, Raspberry Pi, Eyeriss, OSM, SUMO, NS-3. Simulation results show that DRL outperforms the state-of-the-art methods for all cases, while the proposed GTA outperforms the state-of-the-art methods for most cases, in terms of total weighted sum of response time and energy consumption. Compared with four baseline algorithms, GTA and DRL reduce the total weighted sum of response time and energy consumption by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$41.49\%$</tex-math> </inline-formula> and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$62.38\%$</tex-math> </inline-formula> , on average respectively, for different numbers of tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems

Lead the way for us

Journal: IEEE Transactions on Intelligent Transportation Systems	Publication Date: Sep 1, 2023
Citations: 3

Similar Papers

Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism
Jing Li ... Weifa Liang
IEEE Transactions on Mobile Computing | VOL. 22
Jing Li, et. al.Jing Li ... Weifa Liang
01 May 2023
IEEE Transactions on Mobile Computing | VOL. 22

Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism
Jing Li ... Weifa Liang
-
Jing Li, et. al.Jing Li ... Weifa Liang
04 Oct 2021
04 Oct 2021

A Strategy to Accelerate the Inference of a Complex Deep Neural Network
P Haseena Rahmath ... Kuldeep Chaurasia
-
P Haseena Rahmath, et. al.P Haseena Rahmath ... Kuldeep Chaurasia
01 Jan 2023
01 Jan 2023

MagicBatch: An Energy-Aware Scheduling Framework for DNN Inference on Heterogeneous Edge Servers in Space-Air-Ground Computation
Di Liu ... Aolin Zhang
-
Di Liu, et. al.Di Liu ... Aolin Zhang
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems