Actor-Critic PAC Robust Policy Search

Sheckells, Matthew; Garimella, Gowtham; Michra, Subhransu; Kobilarov, Marin

Citation Details

This work studies an approach for computing provably robust control laws for robotic systems operating in uncertain environments. We develop an actor-critic style policy search algorithm based on the idea of minimizing an upper confidence bound on the negative expected advantage of a control policy at each policy update iteration. This new algorithm is a reformulation of Probably-Approximately-Correct Robust Policy Search (PROPS) and, unlike PROPS, allows for both step-based evaluation and step-based sampling strategies in policy parameter space, enabled by the use of Generalized Advantage Estimation and Generalized Exploration. As a result, the new algorithm is more data efficient and is expected to compute higher quality policies faster. We empirically evaluate the algorithm in simulation on a challenging robot navigation task using a high-fidelity deep stochastic model of an agile ground vehicle and compare its performance to the original trajectory-based PROPS more »

Award ID(s):: 1637949

PAR ID:: 10136847

Author(s) / Creator(s):: Sheckells, Matthew; Garimella, Gowtham; Michra, Subhransu; Kobilarov, Marin

Date Published:: 2019-05-01

Journal Name:: ICRA 2019

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this