NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Optimizing Percentile Criterion Using Robust MDPs

Bahram Behzadian, Reazul Hasan (January 2021, Proceedings of Machine Learning Research)

Full Text Available
Fast Algorithms for L∞-Constrained S-Rectangular Robust MDPs

Bahram Behzadian, Marek Petrik (January 2021, Advances in neural information processing systems)

Robust Markov decision processes (RMDPs) are a useful building block of robust reinforcement learning algorithms but can be hard to solve. This paper proposes a fast, exact algorithm for computing the Bellman operator for S-rectangular robust Markov decision processes with L∞-constrained rectangular ambiguity sets. The algorithm combines a novel homotopy continuation method with a bisection method to solve S-rectangular ambiguity in quasi-linear time in the number of states and actions. The algorithm improves on the cubic time required by leading general linear programming methods. Our experimental results confirm the practical viability of our method and show that it outperforms a leading commercial optimization package by several orders of magnitude.
more » « less
Full Text Available
Policy Gradient Bayesian Robust Optimization for Imitation Learning

Zaynah Javed, Daniel S. (January 2021, International Conference on Machine Learning)

Full Text Available
Robust Behavior Cloning with Adversarial Demonstration Detection

Mostafa Hussein, Brendan Crowe (January 2021, Proceedings of the IEEERSJ International Conference on Intelligent Robots and Systems)

Full Text Available
Partial Policy Iteration for L1-Robust Markov Decision Processes

Chin Pang Ho, Marek Petrik (January 2021, Journal of machine learning research)

Robust Markov decision processes (MDPs) compute reliable solutions for dynamic decision problems with partially-known transition probabilities. Unfortunately, accounting for uncertainty in the transition probabilities significantly increases the computational complexity of solving robust MDPs, which limits their scalability. This paper describes new, efficient algorithms for solving the common class of robust MDPs with s- and sa-rectangular ambiguity sets defined by weighted L1 norms. We propose partial policy iteration, a new, efficient, flexible, and general policy iteration scheme for robust MDPs. We also propose fast methods for computing the robust Bellman operator in quasi-linear time, nearly matching the ordinary Bellman operator's linear complexity. Our experimental results indicate that the proposed methods are many orders of magnitude faster than the state-of-the-art approach, which uses linear programming solvers combined with a robust value iteration.
more » « less
Full Text Available
Bayesian Robust Optimization for Imitation Learning

Daniel S. Brown, Scott Niekum (January 2020, Advances in neural information processing systems)

Full Text Available
Local management in a regional context: Simulations with process-based species distribution models

https://doi.org/10.1016/j.ecolmodel.2019.108827

Szewczyk, Tim M.; Lee, Tom; Ducey, Mark J.; Aiello-Lammens, Matthew E.; Bibaud, Hayley; Allen, Jenica M. (December 2019, Ecological Modelling)

Full Text Available
Fast Feature Selection for Linear Value Function Approximation

Bahram Behzadian, Soheil Gharatappeh (July 2019, International Conference on Autonomous Planning and Scheduling)

Full Text Available
Fast Feature Selection for Linear Value Function Approximation

Bahram Behzadian, Soheil Gharatappeh (July 2019, ICAPS)

Full Text Available
Inverse Reinforcement Learning of Interaction Dynamics from Demonstrations

Mostafa Hussein, Momotaz Begum (April 2019, International Conference on Robotics and Automation (ICRA))

Inverse Reinforcement Learning of Interaction Dynamics fromDemonstrations
more » « less
Full Text Available

« Prev Next »

Search for: All records