Hastening Stream Offloading of Inference via Multi-Exit DNNs in Mobile Edge Computing

Zhicheng Liu,Xu Chen,Chao Qiu,Xiaofei Wang,Jinduo Song,Hao Sheng,Qiang He

doi:10.1109/tmc.2022.3218724

Abstract

As the primary driver of intelligent mobile applications, deep neural networks (DNNs) have gradually deployed to millions of mobile devices, producing massive latency-sensitive and computation-intensive tasks daily. Mobile edge computing facilitates the deployment of computing resources at the edge, which enables fine-grained offloading of DNN inference tasks from mobile devices to edge nodes. However, most existing studies have not systematically considered three crucial performance aspects: scheduling multiple streams of DNN inference tasks, leveraging multi-exit models to hasten task processing, and partitioning inference models for partial offloading. To this end, this paper proposes an adaptive inference framework in mobile edge computing, which can dynamically select the exit point and partition point for multiple inference task streams. We design a dynamic programming algorithm to obtain an efficient solution under the ideal condition that task arrival information is known. Further, we design a learning-based algorithm for online scheduling, whose training efficiency is improved based on historical experience initialization and priority experience replay. Experimental results show that compared with the Greedy algorithm, the online algorithm improves the performance on two environmental parameters by an average of 5.9% and 32%, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hastening Stream Offloading of Inference via Multi-Exit DNNs in Mobile Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing

Lead the way for us

Journal: IEEE Transactions on Mobile Computing	Publication Date: Jan 1, 2024
Citations: 7

Similar Papers

Joint DNN partitioning and task offloading in mobile edge computing via deep reinforcement learning
Jianbing Zhang ... Jiwei Huang
Journal of Cloud Computing | VOL. 12
Jianbing Zhang, et. al.Jianbing Zhang ... Jiwei Huang
03 Aug 2023
Journal of Cloud Computing | VOL. 12

Joint Task Offloading, CNN Layer Scheduling and Resource Allocation in Cooperative Computing System
Xia Song ... Rong Chai
-
Xia Song, et. al.Xia Song ... Rong Chai
01 Jan 2020
01 Jan 2020

HAGP: A Heuristic Algorithm Based on Greedy Policy for Task Offloading with Reliability of MDs in MEC of the Industrial Internet
Min Guo ... Wei Wang
Sensors | VOL. 21
Min Guo, et. al.Min Guo ... Wei Wang
18 May 2021
Sensors | VOL. 21

Consumption considered optimal scheme for task offloading in mobile edge computing
Li Tianze ... Zhao Min
-
Li Tianze, et. al.Li Tianze ... Zhao Min
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hastening Stream Offloading of Inference via Multi-Exit DNNs in Mobile Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Mobile Computing