PnP-DRL: A Plug-and-Play Deep Reinforcement Learning Approach for Experience-Driven Networking

Xu, Zhiyuan; Wu, Kun; Zhang, Weiyi; Tang, Jian; Wang, Yanzhi; Xue, Guoliang

Citation Details

While Deep Reinforcement Learning has emerged as a de facto approach to many complex experience-driven networking problems, it remains challenging to deploy DRL into real systems. Due to the random exploration or half-trained deep neural networks during the online training process, the DRL agent may make unexpected decisions, which may lead to system performance degradation or even system crash. In this paper, we propose PnP-DRL, an offline-trained, plug and play DRL solution, to leverage the batch reinforcement learning approach to learn the best control policy from pre-collected transition samples without interacting with the system. After being trained without interaction with systems, our Plug and Play DRL agent will start working seamlessly, without additional exploration or possible disruption of the running systems. We implement and evaluate our PnP-DRL solution on a prevalent experience-driven networking problem, Dynamic Adaptive Streaming over HTTP (DASH). Extensive experimental results manifest that 1) The existing batch reinforcement learning method has its limits; 2) Our approach PnP-DRL significantly outperforms classical adaptive bitrate algorithms in average user Quality of Experience (QoE); 3) PnP-DRL, unlike the state-of-the-art online DRL methods, can be off and running without learning gaps, while achieving comparable performances. more »

Award ID(s):: 1704662

NSF-PAR ID:: 10300535

Author(s) / Creator(s):: Xu, Zhiyuan; Wu, Kun; Zhang, Weiyi; Tang, Jian; Wang, Yanzhi; Xue, Guoliang

Date Published:: 2021-08-01

Journal Name:: IEEE journal on selected areas in communications

Volume:: 39

Issue:: 8

ISSN:: 1558-0008

Page Range / eLocation ID:: 2476-2486

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this