Provably efficient Third-person imitation from Offline Observation

Zweig, Aaron; Bruna, Joan

Citation Details

Domain adaptation in imitation learning represents an essential step towards improving gen- eralizability. However, even in the restricted setting of third-person imitation where transfer is between isomorphic Markov Decision Processes, there are no strong guarantees on the perfor- mance of transferred policies. We present problem-dependent, statistical learning guarantees for third-person imitation from observation in an offline setting, and a lower bound on performance in the online setting. more »

Award ID(s):: 1816753

PAR ID:: 10185412

Author(s) / Creator(s):: Zweig, Aaron; Bruna, Joan

Date Published:: 2020-07-01

Journal Name:: Uncertainty in artificial intelligence

ISSN:: 1525-3384

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this