Abstract
The aim of this paper is to propose a resource allocation strategy for dynamic training and inference of machine learning tasks at the edge of the wireless network, with the goal of exploring the trade-off between energy, delay and learning accuracy. The scenario of interest is composed of a set of devices sending a continuous flow of data to an edge server that extracts relevant information running online learning algorithms, within the emerging framework known as Edge Machine Learning (EML). Taking into account the limitations of the edge servers, with respect to a cloud, and the scarcity of resources of mobile devices, we focus on the efficient allocation of radio (e.g., data rate, quantization) and computation (e.g., CPU scheduling) resources, to strike the best trade-off between energy consumption and quality of the EML service, including service end-to-end (E2E) delay and accuracy of the learning task. To this aim, we propose two different dynamic strategies: (i) The first method aims to minimize the system energy consumption, under constraints on E2E service delay and accuracy; (ii) the second method aims to optimize the learning accuracy, while guaranteeing an E2E delay and a bounded average energy consumption. Then, we present a dynamic resource allocation framework for EML based on stochastic Lyapunov optimization. Our low-complexity algorithms do not require any prior knowledge on the statistics of wireless channels, data arrivals, and data probability distributions. Furthermore, our strategies can incorporate prior knowledge regarding the model underlying the observed data, or can work in a totally data-driven fashion. Several numerical results on synthetic and real data assess the performance of the proposed approach.
Highlights
We live at the edge of a new revolution, characterized by a massive growth of data traffic and a pervasive introduction of artificial intelligence tools aimed to extract meaning from the data. 5G networks provide an efficient way to enable many different new services using a single communication platform
Since our work focuses on dynamic resource allocation strategies for computation offloading of machine learning tasks, in the sequel we review the general literature on dynamic computation offloading and the recent advances in Edge Machine Learning (EML)
We present some numerical results obtained with computer simulations, for the resource allocation strategy devised in Sec
Summary
We live at the edge of a new revolution, characterized by a massive growth of data traffic and a pervasive introduction of artificial intelligence tools aimed to extract meaning from the data. 5G networks provide an efficient way to enable many different new services using a single communication platform. Main pillars: ultra-reliable and low-latency communications (URLLC), enhanced mobile broadband (EMBB), and Massive Machine Type Communications (mMTC) [2]; ii) a pervasive deployment of cloud capabilities at the wireless network edge, to enable a plethora of services for different sectors (verticals), such as Industry 4.0, Internet of Things (IoT), autonomous driving, remote surgery, etc. This paradigm is well-known under different names in the literature, such as Edge Computing, or Fog Computing.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.