Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks

Sina Shahhosseini,Sung-Soo Lim,Anil Kanduri,Nikil Dutt,Amir M Rahmani,Bryan Donyanavard,Tianyi Hu,Dongjoo Seo

doi:10.1145/3520129

Abstract

Deep-learning-based intelligent services have become prevalent in cyber-physical applications, including smart cities and health-care. Deploying deep-learning-based intelligence near the end-user enhances privacy protection, responsiveness, and reliability. Resource-constrained end-devices must be carefully managed to meet the latency and energy requirements of computationally intensive deep learning services. Collaborative end-edge-cloud computing for deep learning provides a range of performance and efficiency that can address application requirements through computation offloading. The decision to offload computation is a communication-computation co-optimization problem that varies with both system parameters (e.g., network condition) and workload characteristics (e.g., inputs). However, deep learning model optimization provides another source of tradeoff between latency and model accuracy. An end-to-end decision-making solution that considers such computation-communication problem is required to synergistically find the optimal offloading policy and model for deep learning services. To this end, we propose a reinforcement-learning-based computation offloading solution that learns optimal offloading policy considering deep learning model selection techniques to minimize response time while providing sufficient accuracy. We demonstrate the effectiveness of our solution for edge devices in an end-edge-cloud system and evaluate with a real-setup implementation using multiple AWS and ARM core configurations. Our solution provides 35% speedup in the average response time compared to the state-of-the-art with less than 0.9% accuracy reduction, demonstrating the promise of our online learning framework for orchestrating DL inference in end-edge-cloud systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Journal: ACM Transactions on Embedded Computing Systems	Publication Date: Nov 30, 2022
Citations: 10

Similar Papers

Hybrid Learning for Orchestrating Deep Learning Inference in Multi-user Edge-cloud Networks
Sina Shahhosseini ... Nikil Dutt
-
Sina Shahhosseini, et. al.Sina Shahhosseini ... Nikil Dutt
06 Apr 2022
06 Apr 2022

Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
Justin Du ... Sanjay Aneja
Cancer Research | VOL. 81
Justin Du, et. al.Justin Du ... Sanjay Aneja
01 Jul 2021
Cancer Research | VOL. 81

Explainable artificial intelligence (XAI) for predicting the need for intubation in methanol-poisoned patients: a study comparing deep and machine learning models
Khadijeh Moulaei ... Mitra Rahimi
Scientific Reports | VOL. 14
Khadijeh Moulaei, et. al.Khadijeh Moulaei ... Mitra Rahimi
08 Jul 2024
Scientific Reports | VOL. 14

P–260 Towards better explainable deep learning models for embryo selection in ART
...
Human Reproduction | VOL. 36
, et. al. ...
06 Aug 2021
Human Reproduction | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Learning for Orchestration of Inference in Multi-user End-edge-cloud Networks

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems