Dynamic Workload Allocation for Edge Computing

Yi-Wen Hung,Yung-Chih Chen,Chi Lo,Shih-Chieh Chang,Austin Go So

doi:10.1109/tvlsi.2021.3049520

Abstract

Artificial intelligence models implemented in power-efficient Internet-of-Things (IoT) devices have accuracy degradation due to limited power consumption. To mitigate the accuracy loss on IoT devices, an edge-server joint inference system is introduced. On the edge-server inference system, allocate more workloads to the server end can mitigate accuracy loss, but data transmission contributes to the power consumption of the edge device. Thus, in this article, we present a novel two-stage method to allocate workloads to the server or the edge to maximize inference accuracy under a power constraint. In the first stage, we present a clusterwise threshold-based method for estimating the trustworthiness of a prediction made at the edge. In the second stage, we further determine the workload allocation of a trustworthy image based on the probability of the top 1 prediction and the power constraint. In addition, we propose a fine-tuning process to the pretrained model at the edge for achieving better accuracy. In the experiments, we apply the proposed method to several well-known deep neural network models. The results show that the proposed method can improve inference accuracy up to 3.93% under a specific power constraint compared to previous methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Workload Allocation for Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Jan 15, 2021
Citations: 31

Similar Papers

Network assistance platform for saving power consumption of IoT devices and set-top boxes
Hyunho Park ... Yong-Tae Lee
-
Hyunho Park, et. al.Hyunho Park ... Yong-Tae Lee
01 Jan 2017
01 Jan 2017

A New Wake-Up Modem for Low-Power Communications
Gun-Ho Lee ... Eui-Rim Jeong
-
Gun-Ho Lee, et. al.Gun-Ho Lee ... Eui-Rim Jeong
01 Jan 2020
01 Jan 2020

Scalable Emulated Framework for IoT Devices in Smart Logistics Based Cyber-Physical Systems: Bonded Coverage and Connectivity Analysis
Arbab Waseem Abbas ... Safdar Nawaz Khan Marwat
IEEE Access | VOL. 8
Arbab Waseem Abbas, et. al.Arbab Waseem Abbas ... Safdar Nawaz Khan Marwat
01 Jan 2020
IEEE Access | VOL. 8

Location and Complex Status Update Strategy Optimization in UAV-Assisted IoT
Xianbang Diao ... Qihui Wu
IEEE Internet of Things Journal | VOL. 10
Xianbang Diao, et. al.Xianbang Diao ... Qihui Wu
01 Jul 2023
IEEE Internet of Things Journal | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Workload Allocation for Edge Computing

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems