f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning

Zhang, Xin; Li, Yanhua; Zhang, Ziming; Zhang, Zhi-Li

Citation Details

Imitation learning (IL) aims to learn a policy from expert demonstrations that minimizes the discrepancy between the learner and expert behaviors. Various imitation learning algorithms have been proposed with different pre-determined divergences to quantify the discrepancy. This naturally gives rise to the following question: Given a set of expert demonstrations, which divergence can recover the expert policy more accurately with higher data efficiency? In this work, we propose f-GAIL – a new generative adversarial imitation learning model – that automatically learns a discrepancy measure from the f-divergence family as well as a policy capable of producing expert-like behaviors. Compared with IL baselines with various predefined divergence measures, f-GAIL learns better policies with higher data efficiency in six physics-based control tasks. more »

Award ID(s):: 1942680 1952085

PAR ID:: 10225172

Author(s) / Creator(s):: Zhang, Xin; Li, Yanhua; Zhang, Ziming; Zhang, Zhi-Li

Date Published:: 2020-01-01

Journal Name:: Advances in neural information processing systems

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this