Collaborative Deep Neural Network Inference via Mobile Edge Computing

Wen Wu,Weiting Zhang,Ning Zhang,Peng Yang,Yujie Tang

doi:10.1007/978-3-030-98064-1_12

Abstract

AbstractDeep neural network (DNN) inference with low delay and high accuracy requirements is usually computation intensive. The collaboration among mobile devices and the network edge is a potential solution to support DNN inference. Moreover, the sampling rates of mobile devices can be dynamically configured to adapt to network conditions, which can be used to minimize the inference service delay. In this chapter, we first introduce the concept of DNN inference, and its two underlying technologies, i.e., mobile edge computing and machine learning. Then, we present a case study on collaborative DNN inference via device-edge orchestration. Specifically, taking the channel variation and task arrival randomness into consideration, we formulate the DNN inference delay minimization problem as a constrained Markov decision process (CMDP). In the problem, sampling rate adaption, inference task offloading, and edge computing resource allocation are jointly optimized while guaranteeing the long-term accuracy requirements of different inference services. To solve the problem, we propose a learning-based solution with three steps. Firstly, the CMDP is transformed into an MDP by leveraging the Lyapunov optimization technique. Secondly, a deep reinforcement learning (RL)-based algorithm is proposed to solve the transformed MDP. Thirdly, an optimization subroutine is embedded in the proposed deep RL algorithm to directly obtain the optimal edge computing resource allocation, thereby expediting the training process. Simulation results demonstrate that the proposed algorithm can reduce the average service delay and preserve long-term inference accuracy with a high probability.KeywordsDNN inferenceMobile edge computingReinforcement learningLyapunov optimizationConstrained Markov decision processAdaptive rate sampling

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Collaborative Deep Neural Network Inference via Mobile Edge Computing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu ... Xuemin Shen
IEEE Transactions on Industrial Informatics | VOL. 17
Wen Wu, et. al.Wen Wu ... Xuemin Shen
18 Aug 2020
IEEE Transactions on Industrial Informatics | VOL. 17

Deep Reinforcement Learning-based Mining Task Offloading Scheme for Intelligent Connected Vehicles in UAV-aided MEC
Chunlin Li ... Lincheng Jiang
ACM Transactions on Design Automation of Electronic Systems | VOL. 29
Chunlin Li, et. al.Chunlin Li ... Lincheng Jiang
03 May 2024
ACM Transactions on Design Automation of Electronic Systems | VOL. 29

Deep reinforcement learning based computation offloading and resource allocation for MEC
Ji Li ... Yueming Lu
-
Ji Li, et. al.Ji Li ... Yueming Lu
01 Apr 2018
01 Apr 2018

Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing
Jie Xu ... Shaolei Ren
IEEE Transactions on Cognitive Communications and Networking | VOL. 3
Jie Xu, et. al.Jie Xu ... Shaolei Ren
01 Sep 2017
IEEE Transactions on Cognitive Communications and Networking | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Collaborative Deep Neural Network Inference via Mobile Edge Computing

Abstract

Talk to us

Similar Papers