Flexible Deployment of Machine Learning Inference Pipelines in the Cloud–Edge–IoT Continuum

Karolina Bogacka,Marcin Paprzycki,Maria Ganzha,Francisco Mahedero Biot,Piotr Sowiński,Carlos E Palau,Anastasiya Danilenka,Katarzyna Wasielewska-Michniewska

doi:10.3390/electronics13101888

Abstract

Currently, deploying machine learning workloads in the Cloud–Edge–IoT continuum is challenging due to the wide variety of available hardware platforms, stringent performance requirements, and the heterogeneity of the workloads themselves. To alleviate this, a novel, flexible approach for machine learning inference is introduced, which is suitable for deployment in diverse environments—including edge devices. The proposed solution has a modular design and is compatible with a wide range of user-defined machine learning pipelines. To improve energy efficiency and scalability, a high-performance communication protocol for inference is propounded, along with a scale-out mechanism based on a load balancer. The inference service plugs into the ASSIST-IoT reference architecture, thus taking advantage of its other components. The solution was evaluated in two scenarios closely emulating real-life use cases, with demanding workloads and requirements constituting several different deployment scenarios. The results from the evaluation show that the proposed software meets the high throughput and low latency of inference requirements of the use cases while effectively adapting to the available hardware. The code and documentation, in addition to the data used in the evaluation, were open-sourced to foster adoption of the solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Flexible Deployment of Machine Learning Inference Pipelines in the Cloud–Edge–IoT Continuum

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: May 11, 2024
License type: CC BY 4.0

Similar Papers

AutoInfer: Self-Driving Management for Resource-Efficient, SLO-Aware Machine=Learning Inference in GPU Clusters
Binlei Cai ... Xiaodong Dong
IEEE Internet of Things Journal | VOL. 10
Binlei Cai, et. al.Binlei Cai ... Xiaodong Dong
01 Apr 2023
IEEE Internet of Things Journal | VOL. 10

Machine Learning for Columnar High Energy Physics Analysis
Elliott Kauffman ... Oksana Shadura
EPJ Web of Conferences | VOL. 295
Elliott Kauffman, et. al.Elliott Kauffman ... Oksana Shadura
01 Jan 2024
EPJ Web of Conferences | VOL. 295

Pyramid: Enabling Hierarchical Neural Networks with Edge Computing
Qiang He ... Zeqian Dong
-
Qiang He, et. al.Qiang He ... Zeqian Dong
25 Apr 2022
25 Apr 2022

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
Mehran Salmani ... Saeid Ghafouri
-
Mehran Salmani, et. al.Mehran Salmani ... Saeid Ghafouri
08 May 2023
08 May 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flexible Deployment of Machine Learning Inference Pipelines in the Cloud–Edge–IoT Continuum

Abstract

Talk to us

Similar Papers

More From: Electronics