Edge Intelligence

En Li,Xu Chen,Zhi Zhou

doi:10.1145/3229556.3229562

Abstract

As the backbone technology of machine learning, deep neural networks (DNNs) have have quickly ascended to the spotlight. Running DNNs on resource-constrained mobile devices is, however, by no means trivial, since it incurs high performance and energy overhead. While offloading DNNs to the cloud for execution suffers unpredictable performance, due to the uncontrolled long wide-area network latency. To address these challenges, in this paper, we propose Edgent, a collaborative and on-demand DNN co-inference framework with device-edge synergy. Edgent pursues two design knobs: (1) DNN partitioning that adaptively partitions DNN computation between device and edge, in order to leverage hybrid computation resources in proximity for real-time DNN inference. (2) DNN right-sizing that accelerates DNN inference through early-exit at a proper intermediate DNN layer to further reduce the computation latency. The prototype implementation and extensive evaluations based on Raspberry Pi demonstrate Edgent's effectiveness in enabling on-demand low-latency edge intelligence.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Edge Intelligence

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An adaptive DNN inference acceleration framework with end–edge–cloud collaborative computing
Guozhi Liu ... Muhammad Bilal
Future Generation Computer Systems | VOL. 140
Guozhi Liu, et. al.Guozhi Liu ... Muhammad Bilal
04 Nov 2022
Future Generation Computer Systems | VOL. 140

Throughput Maximization of Delay-Aware DNN Inference in Edge Computing by Exploring DNN Model Partitioning and Inference Parallelism
Jing Li ... Weifa Liang
IEEE Transactions on Mobile Computing | VOL. 22
Jing Li, et. al.Jing Li ... Weifa Liang
01 May 2023
IEEE Transactions on Mobile Computing | VOL. 22

Delay-Aware DNN Inference Throughput Maximization in Edge Computing via Jointly Exploring Partitioning and Parallelism
Jing Li ... Weifa Liang
-
Jing Li, et. al.Jing Li ... Weifa Liang
04 Oct 2021
04 Oct 2021

Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li ... Zhi Zhou
IEEE Transactions on Wireless Communications | VOL. 19
En Li, et. al.En Li ... Zhi Zhou
25 Oct 2019
IEEE Transactions on Wireless Communications | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Edge Intelligence

Abstract

Talk to us

Similar Papers