Seek Common While Shelving Differences: Orchestrating Deep Neural Networks for Edge Service Provisioning

Lixing Chen,Jie Xu

doi:10.1109/jsac.2020.3036953

Lixing Chen, Jie Xu

Open Access

https://doi.org/10.1109/jsac.2020.3036953

Copy DOI

Journal: IEEE Journal on Selected Areas in Communications	Publication Date: Dec 16, 2020
Citations: 47	License type: publisher-specific-oa

Affiliation: University of Miami

Abstract

Edge computing (EC) platforms, which enable Application Service Providers (ASPs) to deploy applications in close proximity to users, are providing ultra-low latency and location-awareness to a rich portfolio of services. As monetary costs are incurred for renting computing resources on edge servers to enable service provisioning, ASP has to cautiously decide where to deploy the application and how much resources would be needed to deliver satisfactory performance. However, the service provisioning problem exhibits complex correlations with multifarious factors in EC systems, ranging from user behavior to computation offloading, which are difficult to be fully captured by mathematical modeling and also put off traditional machine learning techniques due to the induction of high-dimension state space. The recent success of deep learning (DL) underpins new tools for addressing our problem. While previous works provide valuable insights on applying DL techniques, e.g., distributed DL, deep reinforcement learning (DRL), and multi-agent DL, in EC systems, these techniques cannot solely handle the distributed and heterogeneous nature of EC systems. To address these limitations, we propose a novel framework based on multi-agent DRL, distributed neural network orchestration (N 2 O), and knowledge distilling. The multi-agent DRL enables edge servers to learn deep neural networks that shelve distinct features learned from local edge sites and hence caters to the heterogeneity of EC systems. N 2 O coordinates edge servers in a fully distributed manner toward a common goal of maximizing ASP’s reward. It requires only local communications during execution and provides provable performance guarantees. The knowledge distilling is further utilized to distill the N 2 O policy for reducing the communication overhead and stabilizing the decision-making. We also carry out systematic experiments to show the advantages of our method over state-of-the-art alternatives.

Full Text