Offloaded Execution of Deep Learning Inference at Edge: Challenges and Insights

Swarnava Dey,Arijit Mukherjee,Jayeeta Mondal

doi:10.1109/percomw.2019.8730817

Abstract

Efforts to leverage the benefits of Deep Learning(DL) models for performing inference in resource constrained embedded devices is very popular nowadays. Researchers worldwide are trying to come up with software and hardware accelerators that make pre-trained DL models suitable for running the inference phase in these devices. Apart from software and hardware accelerators, DL model partitioning and offloading to Cloud or Edge network servers is becoming more and more practicable with increasing importance of Edge Computing. DL inference workflow partitioning and offloading can augment software / hardware acceleration for improved latency and energy efficiency in resource constrained embedded systems. Efficacy of a computation offloading to system is dependent on proper profiling of timing and energy required for processing DL algorithms. In this work we implement a DL inference offloading system using Raspberry Pi 3 based robot vehicle with a hardware accelerator from Intel (Neural Compute Stick). We report the workload partitioning approach, detailed experimental results and performance improvements achieved. We demonstrate that the current approach of DL execution profiling without considering dynamic system load of the edge device results in sub-optimal partitioning of DL algorithm and provide a solution approach to that.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Offloaded Execution of Deep Learning Inference at Edge: Challenges and Insights

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Sanjay Aneja
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Sanjay Aneja
01 Jul 2021
Cancer Research | VOL. 81

Intrusion Detection System for Industrial Internet of Things Based on Deep Reinforcement Learning
Sumegh Tharewal ... Mohammad Shabaz
Wireless Communications and Mobile Computing | VOL. 2022
Sumegh Tharewal, et. al.Sumegh Tharewal ... Mohammad Shabaz
07 Mar 2022
Wireless Communications and Mobile Computing | VOL. 2022

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
Khadijeh Moulaei ... Mitra Rahimi
Scientific Reports | VOL. 14
Khadijeh Moulaei, et. al.Khadijeh Moulaei ... Mitra Rahimi
08 Jul 2024
Scientific Reports | VOL. 14

P–260 Towards better explainable deep learning models for embryo selection in ART
...
Human Reproduction | VOL. 36
, et. al. ...
06 Aug 2021
Human Reproduction | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Offloaded Execution of Deep Learning Inference at Edge: Challenges and Insights

Abstract

Talk to us

Similar Papers