Policy-Conditioned Uncertainty Sets for Robust Markov Decision Processes

Tirinzoni, Andrea; Petrik, Marek; Chen, Xiangli; Ziebart, Brian D

Citation Details

What policy should be employed in a Markov decision process with uncertain parameters? Robust optimization answer to this question is to use rectangular uncertainty sets, which independently reflect available knowledge about each state, and then obtains a decision policy that maximizes expected reward for the worst-case decision process parameters from these uncertainty sets. While this rectangularity is convenient computationally and leads to tractable solutions, it often produces policies that are too conservative in practice, and does not facilitate knowledge transfer between portions of the state space or across related decision processes. In this work, we propose non-rectangular uncertainty sets that bound marginal moments of state-action features defined over entire trajectories through a decision process. This enables generalization to different portions of the state space while retaining appropriate uncertainty of the decision process. We develop algorithms for solving the resulting robust decision problems, which reduce to finding an optimal policy for a mixture of decision processes, and demonstrate the benefits of our approach experimentally. more »

Award ID(s):: 1652530

PAR ID:: 10098117

Author(s) / Creator(s):: Tirinzoni, Andrea; Petrik, Marek; Chen, Xiangli; Ziebart, Brian D

Date Published:: 2018-12-15

Journal Name:: Neural Information Processing Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this