Differentiate Quality of Experience Scheduling for Deep Learning Inferences With Docker Containers in the Cloud

Ying Mao,Ming Chen,Long Cheng,Weifeng Yan,Yun Song,Qingzhi Liu,Yue Zeng

doi:10.1109/tcc.2022.3154117

Abstract

With the prevalence of big-data-driven applications, such as face recognition on smartphones and tailored recommendations from Google Ads, we are on the road to a lifestyle with significantly more intelligence than ever before. Various neural network powered models are running at the back end of their intelligence to enable quick responses to users. Supporting those models requires lots of cloud-based computational resources, e.g., CPUs and GPUs. The cloud providers charge their clients by the amount of resources that they occupy. Clients have to balance the budget and quality of experiences (e.g., response time). The budget leans on individual business owners, and the required Quality of Experience (QoE) depends on usage scenarios of different applications. For instance, an autonomous vehicle requires an real-time response, but unlocking your smartphone can tolerate delays. However, cloud providers fail to offer a QoE-based option to their clients. In this paper, we propose DQoES, differentiated quality of experience scheduler for deep learning inferences. DQoES accepts clients' specifications on targeted QoEs, and dynamically adjusts resources to approach their targets. Through the extensive cloud-based experiments, DQoES demonstrates that it can schedule multiple concurrent jobs with respect to various QoEs and achieve up to 8x times more satisfied models when compared to the existing system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Differentiate Quality of Experience Scheduling for Deep Learning Inferences With Docker Containers in the Cloud

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing

Lead the way for us

Journal: IEEE Transactions on Cloud Computing	Publication Date: Apr 1, 2023
Citations: 12

Similar Papers

Identifying Persistent and Recurrent QoE Anomalies for DASH Streaming in the Cloud
Chen Wang ... Ricardo Morla
-
Chen Wang, et. al.Chen Wang ... Ricardo Morla
01 Dec 2017
01 Dec 2017

QoE Analysis of the Setup of Different Internet Services for FIFO Server Systems
Tobias Hoßfeld ... Lea Skorin-Kapov
-
Tobias Hoßfeld, et. al.Tobias Hoßfeld ... Lea Skorin-Kapov
01 Jan 2018
01 Jan 2018

Insight based dynamic QoE management in LTE
Norbert Radics ... Csaba Vulkan
-
Norbert Radics, et. al.Norbert Radics ... Csaba Vulkan
01 Aug 2015
01 Aug 2015

Users Know Better: A QoE Based Adaptive Control System for VoD in the Cloud
Chen Wang ... Ricardo Morla
-
Chen Wang, et. al.Chen Wang ... Ricardo Morla
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differentiate Quality of Experience Scheduling for Deep Learning Inferences With Docker Containers in the Cloud

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing