Abstract

Due to the high complexity of artificial intelligence (AI) algorithms, performing the AI tasks on the resource-limited Internet-of-Things (IoT) devices has been proved to be inadvisable. Edge computing provides an effective computing paradigm for executing AI tasks, where large numbers of AI tasks can be offloaded to the edge servers. Most of the existing works focus on achieving efficient computing offload through improving the Quality of Service (QoS), such as reducing the average server-side delay. However, we show that those efforts are inefficient due to the heterogeneous impact of delays on users’ Quality of Experience (QoE). Inspired by the observations, in this article, we reconsider the scheduling method from an orthometric perspective, i.e., improving the QoE by designing a QoE-aware service-enhancement strategy for edge AI applications. Besides, multiple AI algorithms are utilized in our service model to execute the same type of tasks concurrently, thus meeting users’ heterogeneity requirements of accuracy and delays. Specifically, for the online arriving AI tasks, we optimize the task allocation and scheduling strategy according to the QoE sensitivity of each task. The model can be formulated as the mixed-integer nonlinear programming problem, which is known to be NP-hard. Hence, we then propose an efficient two-phase scheduling strategy for this problem. The results of comprehensive emulations validate that our model can effectively improve the average QoE of users and achieve a higher task completion ratio.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call