Model and system robustness in distributed CNN inference at the edge

Xiaotian Guo,Quan Jiang,Andy D Pimentel,Todor Stefanov

doi:10.1016/j.vlsi.2024.102299

Abstract

Prevalent large CNN models pose a significant challenge in terms of computing resources for resource-constrained devices at the Edge. Distributing the computations and coefficients over multiple edge devices collaboratively has been well studied but these works generally do not consider the presence of device failures (e.g., due to temporary connectivity issues, overload, discharged battery of edge devices). Such unpredictable failures can compromise the reliability of edge devices, inhibiting the proper execution of distributed CNN inference. In this paper, we present a novel partitioning method, called RobustDiCE, for robust distribution and inference of CNN models over multiple edge devices. Our method can tolerate intermittent and permanent device failures in a distributed system at the Edge, offering a tunable trade-off between robustness (i.e., retaining model accuracy after failures) and resource utilization. We verify the system’s robustness by validating the overall end-to-end latency under failures. We evaluate RobustDiCE using the ImageNet-1K dataset on several representative CNN models under various device failure scenarios and compare it with several state-of-the-art partitioning methods as well as an optimal robustness approach (i.e., full neuron replication). In addition, we demonstrate RobustDiCE’s advantages in terms of memory usage and energy consumption per device, and system throughput for various system setups with different device counts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model and system robustness in distributed CNN inference at the edge

Abstract

Talk to us

Similar Papers

More From: Integration

Lead the way for us

Similar Papers

Efficient Computer Vision on Edge Devices with Pipeline-Parallel Hierarchical Neural Networks
Abhinav Goel ... George K Thiruvathukal
-
Abhinav Goel, et. al.Abhinav Goel ... George K Thiruvathukal
17 Jan 2022
17 Jan 2022

EC-SNN: Splitting Deep Spiking Neural Networks for Edge Devices
Di Yu ... Xin Du
-
Di Yu, et. al.Di Yu ... Xin Du
01 Aug 2024
01 Aug 2024

Risk Assessment Edge Contract for Efficient Resource Allocation
Minghui Sheng ... Maode Ma
Mathematics | VOL. 12
Minghui Sheng, et. al.Minghui Sheng ... Maode Ma
26 Mar 2024
Mathematics | VOL. 12

Automated Exploration and Implementation of Distributed CNN Inference at the Edge
Xiaotian Guo ... Andy D Pimentel
IEEE Internet of Things Journal | VOL. 10
Xiaotian Guo, et. al.Xiaotian Guo ... Andy D Pimentel
01 Apr 2023
IEEE Internet of Things Journal | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model and system robustness in distributed CNN inference at the edge

Abstract

Talk to us

Similar Papers

More From: Integration