ROIL: Robust Offline Imitation Learning without Trajectories

Doko, Gersi; Yang, Guang; Brown, Daniel S; Petrik, Marek

Citation Details

We study the problem of imitation learning via inverse reinforcement learning where the agent attempts to learn an expert's policy from a dataset of collected state, action tuples. We derive a new Robust model-based Offline Imitation Learning method (ROIL) that mitigates covariate shift by avoiding estimating the expert's occupancy frequency. Frequently in offline settings, there is insufficient data to reliably estimate the expert's occupancy frequency and this leads to models that do not generalize well. Our proposed approach, ROIL, is a method that is guaranteed to recover the expert's occupancy frequency and is efficiently solvable as an LP. We demonstrate ROIL's ability to achieve minimal regret in large environments under covariate shift, such as when the state visitation frequency of the demonstrations does not come from the expert. more »

Award ID(s):: 2416761

PAR ID:: 10609078

Author(s) / Creator(s):: Doko, Gersi; Yang, Guang; Brown, Daniel S; Petrik, Marek

Publisher / Repository:: Reinforcement Learning Journal

Date Published:: 2024-08-09

Journal Name:: Rare royalty magazine

ISSN:: 2996-8577

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this