Learning Task Informed Abstractions

Fu, Xiang; Yang, Ge; Agrawal, Pulkit; Jaakkola, Tommi

Citation Details

Current model-based reinforcement learning methods struggle when operating from complex visual scenes due to their inability to prioritize task-relevant features. To mitigate this prob- lem, we propose learning Task Informed Ab- stractions (TIA) that explicitly separates reward- correlated visual features from distractors. For learning TIA, we introduce the formalism of Task Informed MDP (TiMDP) that is realized by train- ing two models that learn visual features via coop- erative reconstruction, but one model is adversari- ally dissociated from the reward signal. Empirical evaluation shows that TIA leads to significant per- formance gains over state-of-the-art methods on many visual control tasks where natural and un- constrained visual distractions pose a formidable challenge. Project page: https://xiangfu.co/tia more »

Award ID(s):: 2019786

PAR ID:: 10280772

Author(s) / Creator(s):: Fu, Xiang; Yang, Ge; Agrawal, Pulkit; Jaakkola, Tommi

Date Published:: 2021-01-01

Journal Name:: Proceedings of the 38th International Conference on Machine Learning

Volume:: 139

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this