NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robust Individualistic Learning in Many-Agent Systems

https://doi.org/10.1007/978-3-031-77367-9_22

He, Keyang; Doshi, Prashant; Banerjee, Bikramjit (November 2024, Springer Nature Switzerland)

Full Text Available
Open Human-Robot Collaboration Systems (OHRCS): A Research Perspective

https://doi.org/10.1109/CogMI62246.2024.00023

Suresh, Prasanth Sengadu; Romeres, Diego; Doshi, Prashant; Jain, Siddarth (October 2024, IEEE)

Full Text Available
Open Human-Robot Collaboration using Decentralized Inverse Reinforcement Learning

https://doi.org/10.1109/IROS58592.2024.10801889

Suresh, Prasanth Sengadu; Jain, Siddarth; Doshi, Prashant; Romeres, Diego (October 2024, IEEE)

Full Text Available
Marginal MAP Estimation for Inverse RL under Occlusion with Observer Noise

Suresh, Prasanth Sengadu; Doshi, Prashant (August 2022, 38th Conference on Uncertainty in Artificial Intelligence (UAI))

Full Text Available
A Hierarchical Bayesian Process for Inverse RL in Partially-Controlled Environments

Bogert, Kenneth; Doshi, Prashant (May 2022, AAMAS Conference proceedings)

Full Text Available
Decision making in open agent systems

https://doi.org/10.1002/aaai.12131

Eck, Adam; Soh, Leen‐Kiat; Doshi, Prashant (October 2023, AI Magazine)

Abstract In many real‐world applications of AI, the set of actors and tasks are not constant, but instead change over time. Robots tasked with suppressing wildfires eventually run out of limited suppressant resources and need to temporarily disengage from the collaborative work in order to recharge, or they might become damaged and leave the environment permanently. In a large business organization, objectives and goals change with the market, requiring workers to adapt to perform different sets of tasks across time. We call these multiagent systems (MAS)open agent systems(OASYS), and theopennessof the sets of agents and tasks necessitates new capabilities and modeling for decision making compared to planning and learning inclosedenvironments. In this article, we discuss three notions of openness: agent openness, task openness, and type openness. We also review the past and current research on addressing the novel challenges brought about by openness in OASYS. We share lessons learned from these efforts and suggest directions for promising future work in this area. We also encourage the community to engage and participate in this area of MAS research to address critical real‐world problems in the application of AI to enhance our daily lives.
more » « less
Decision-Theoretic Planning with Communication in Open Multiagent Systems

Kakarlapudi, Anirudh; Anil, Gayathri; Eck, Adam; Doshi, Prashant; Soh, Leen-Kiat (August 2022, Uncertainty in artificial intelligence)

In open multiagent systems, the set of agents operating in the environment changes over time and in ways that are nontrivial to predict. For example, if collaborative robots were tasked with fighting wildfires, they may run out of suppressants and be temporarily unavailable to assist their peers. Because an agent's optimal action depends on the actions of others, each agent must not only predict the actions of its peers, but, before that, reason whether they are even present to perform an action. Addressing openness thus requires agents to model each other’s presence, which can be enhanced through agents communicating about their presence in the environment. At the same time, communicative acts can also incur costs (e.g., consuming limited bandwidth), and thus an agent must tradeoff the benefits of enhanced coordination with the costs of communication. We present a new principled, decision-theoretic method in the context provided by the recent communicative interactive POMDP framework for planning in open agent settings that balances this tradeoff. Simulations of multiagent wildfire suppression problems demonstrate how communication can improve planning in open agent environments, as well as how agents tradeoff the benefits and costs of communication under different scenarios.
more » « less
Full Text Available
Anytime Learning of Sum-Product and Sum-Product-Max Networks

Pawar, Swaraj; Doshi, Prashant (January 2022, The 11th International Conference on Probabilistic Graphical Models)
Salmerón, Antonio; Rumı́, Rafael (Ed.)
Full Text Available
A survey of inverse reinforcement learning: Challenges, methods and progress

https://doi.org/10.1016/j.artint.2021.103500

Arora, Saurabh; Doshi, Prashant (August 2021, Artificial Intelligence)
null (Ed.)
Full Text Available
Min-Max Entropy Inverse RL of Multiple Tasks

Arora, Saurabh; Doshi, Prashant; Banerjee, Bikramjit (July 2021, IEEE International Conference on Robotics and Automation)
null (Ed.)
Multi-task IRL recognizes that expert(s) could be switching between multiple ways of solving the same problem, or interleaving demonstrations of multiple tasks. The learner aims to learn the reward functions that individually guide these distinct ways. We present a new method for multi-task IRL that generalizes the well-known maximum entropy approach by combining it with a Dirichlet process based minimum entropy clustering of the observed data. This yields a single nonlinear optimization problem, called MinMaxEnt Multi-task IRL (MME-MTIRL), which can be solved using the Lagrangian relaxation and gradient descent methods. We evaluate MME- MTIRL on the robotic task of sorting onions on a processing line where the expert utilizes multiple ways of detecting and removing blemished onions. The method is able to learn the underlying reward functions to a high level of accuracy and it improves on the previous approaches.
more » « less
Full Text Available

« Prev Next »

Search for: All records