Adaptive Auxiliary Task Weighting for Reinforcement Learning

Lin, Xingyu; Baweja, Harjatin Singh; Kantor, George; Held, David

Citation Details

Reinforcement learning is known to be sample inefficient, preventing its application to many real-world problems, especially with high dimensional observations like images. Transferring knowledge from other auxiliary tasks is a powerful tool for improving the learning efficiency. However, the usage of auxiliary tasks has been limited so far due to the difficulty in selecting and combining different auxiliary tasks. In this work, we propose a principled online learning algorithm that dynam- ically combines different auxiliary tasks to speed up training for reinforcement learning. Our method is based on the idea that auxiliary tasks should provide gradient directions that, in the long term, help to decrease the loss of the main task. We show in various environments that our algorithm can effectively combine a variety of different auxiliary tasks and achieves significant speedup compared to previous heuristic approaches of adapting auxiliary task weights. more »

Award ID(s):: 1849154

PAR ID:: 10159738

Author(s) / Creator(s):: Lin, Xingyu; Baweja, Harjatin Singh; Kantor, George; Held, David

Date Published:: 2019-12-01

Journal Name:: Advances in neural information processing systems

Volume:: 32

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this