NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning

Venugopal, Aravind; Milani, Stephanie; Fang, Fei; Ravindran, Balaraman (May 2024, AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems)

Full Text Available
Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games

https://doi.org/10.1145/3544548.3581348

Milani, Stephanie; Juliani, Arthur; Momennejad, Ida; Georgescu, Raluca; Rzepecki, Jaroslaw; Shaw, Alison; Costello, Gavin; Fang, Fei; Devlin, Sam; Hofmann, Katja (April 2023, ACM)

Full Text Available
Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods

Topin, Nicholay; Milani, Stephanie; Fang, Fei; Veloso, Manuela (May 2021, Proceedings of the AAAI Conference on Artificial Intelligence)

Full Text Available
Planning with Abstract Learned Models While Learning Transferable Subtasks

Winder, John; Milani, Stephanie; Landen, Matthew; Oh, Erebus; Parr, Shane; Squire, Shawn; desJardins, Marie; Matuszek, Cynthia (February 2020, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.
more » « less
Full Text Available
Planning with Abstract Learned Models While Learning Transferable Subtasks

https://doi.org/10.1609/aaai.v34i06.6555

Winder, John; Milani, Stephanie; Landen, Matthew; Oh, Erebus; Parr, Shane; Squire, Shawn; desJardins, Marie; Matuszek, Cynthia (February 2020, Proceedings of the AAAI Conference on Artificial Intelligence)

We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.
more » « less
Full Text Available
Planning with Abstract Learned Models While Learning Transferable Subtasks

Winder, John; Milani, Stephanie; Landen, Matthew; Oh, Erebus; Parr, Shane; Squire, Shawn; desJardins, Marie; Matuszek, Cynthia (February 2020, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.
more » « less
Full Text Available

Search for: All records