QMDP: DASH Adaptation using Queueing Theory within a Markov Decision Process

Kevin Gatimu,Ben Lee

doi:10.1109/ccnc49032.2021.9369481

QMDP: DASH Adaptation using Queueing Theory within a Markov Decision Process

Kevin Gatimu, Ben Lee

https://doi.org/10.1109/ccnc49032.2021.9369481

Copy DOI

Publication Date: Jan 9, 2021

Citations: 19

Affiliation: Integra (United States)

#Reinforcement Learning Methods #Quality Of Experience + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Adaptive bitrate (ABR) streaming algorithms play an important role in ensuring a high Quality of Experience (QoE) for the consumer. However, a lot of ABR algorithms tend to be too ad hoc. In response, methods based on a Markov Decision Process (MDP) offer more intelligent models. In particular, Reinforcement Learning (RL) methods typically do so via QoE metrics. However, RL methods are plagued by high complexity and long convergence times due to their model-free nature. This paper proposes qMDP, which is an RL method with an MDP partially modeled by an M/D/1/K queue. Our study shows that qMDP results in higher QoE and faster convergence compared to a QoE-only model-free version.

Full Text