Edge AI as a Service: Configurable Model Deployment and Delay-Energy Optimization With Result Quality Constraints

Wenyu Zhang,Victor C M Leung,Sherali Zeadally,Wei Li,Jingyi Hou,Haijun Zhang

doi:10.1109/tcc.2022.3175725

Abstract

We propose a configurable model deployment architecture (CMDA) for edge AIaaS and present a flexible working mechanism by enabling the joint configuration of data quality ratios (DQRs) and model complexity ratios (MCRs) for the AI tasks. Along with commonly used resource allocation operations, the manager can improve the energy and delay performance of AI services with the desired quality of results (QoRs). We develop an energy-delay minimization problem under the framework of CMDA and propose a polynomial regression based relaxing method to solve the task configuration subproblem. We conduct experiments and simulations on the ImageNet classification and the common objects in context (COCO) object detection tasks using state-of-the-art deep learning models. We present the corresponding result quality tables (RQTs) and QoR regression models to illustrate the proposed method. The results of single task configuration and multi-task configuration and resource allocation on ImageNet classification and COCO object detection tasks demonstrate that the proposed method can achieve over 5× HDEC improvement compared with non-optimization schemes, and also show that joint configuration of DQR and MCR can achieve over 1.2× HDEC improvement compared with the methods that only configure DQR or MCR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Edge AI as a Service: Configurable Model Deployment and Delay-Energy Optimization With Result Quality Constraints

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing

Lead the way for us

Journal: IEEE Transactions on Cloud Computing	Publication Date: Apr 1, 2023
Citations: 6

Similar Papers

How Low Can You Go? Using Synthetic 3D Imagery to Drastically Reduce Real-World Training Data for Object Detection
Zoe Gastelum ... Timothy Shead
-
Zoe Gastelum, et. al.Zoe Gastelum ... Timothy Shead
01 Sep 2020
01 Sep 2020

A New Perspective for Mining COCO Dataset
-
Iraqi Journal of Computer, Communication, Control and System Engineering | VOL. -
--
28 Sep 2023
Iraqi Journal of Computer, Communication, Control and System Engineering | VOL. -

A 3D-CAE-CNN model for Deep Representation Learning of 3D images
Emmanuel Pintelas ... Panagiotis Pintelas
Engineering Applications of Artificial Intelligence | VOL. 113
Emmanuel Pintelas, et. al.Emmanuel Pintelas ... Panagiotis Pintelas
27 May 2022
Engineering Applications of Artificial Intelligence | VOL. 113

The role of deep learning for periapical lesion detection on panoramic radiographs.
Berrin Çelik ... Mahmut Emin Çelik
Dento maxillo facial radiology | VOL. 52
Berrin Çelik, et. al.Berrin Çelik ... Mahmut Emin Çelik
18 Oct 2023
Dento maxillo facial radiology | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Edge AI as a Service: Configurable Model Deployment and Delay-Energy Optimization With Result Quality Constraints

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing