Index-aware reinforcement learning for adaptive video streaming at the wireless edge

Xiong, Guojun; Qin, Xudong; Li, Bin; Singh, Rahul; Li, Jian

doi:10.1145/3492866.3549726

Citation Details

Index-aware reinforcement learning for adaptive video streaming at the wireless edge

We study adaptive video streaming for multiple users in wireless access edge networks with unreliable channels. The key challenge is to jointly optimize the video bitrate adaptation and resource allocation such that the users' cumulative quality of experience is maximized. This problem is a finite-horizon restless multi-armed multi-action bandit problem and is provably hard to solve. To overcome this challenge, we propose a computationally appealing index policy entitled Quality Index Policy, which is well-defined without the Whittle indexability condition and is provably asymptotically optimal without the global attractor condition. These two conditions are widely needed in the design of most existing index policies, which are difficult to establish in general. Since the wireless access edge network environment is highly dynamic with system parameters unknown and time-varying, we further develop an index-aware reinforcement learning (RL) algorithm dubbed QA-UCB. We show that QA-UCB achieves a sub-linear regret with a low-complexity since it fully exploits the structure of the Quality Index Policy for making decisions. Extensive simulations using real-world traces demonstrate significant gains of proposed policies over conventional approaches. We note that the proposed framework for designing index policy and index-aware RL algorithm is of independent interest and could be useful for other large-scale multi-user problems. more »

Award ID(s):: 2148309

PAR ID:: 10410843

Author(s) / Creator(s):: Xiong, Guojun; Qin, Xudong; Li, Bin; Singh, Rahul; Li, Jian

Date Published:: 2022-10-03

Journal Name:: MobiHoc '22: Proceedings of the Twenty-Third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing

Page Range / eLocation ID:: 81 to 90

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3492866.3549726

More Like this