NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Toward Grounded Commonsense Reasoning

Kwon, Minae; Hu, Hengyuan; Myers, Vivek; Karamcheti, Siddharth; Dragan, Anca; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))

Full Text Available
Toward Grounded Commonsense Reasoning

Kwon, Minae; Hu, Hengyuan; Myers, Vivek; Karamcheti, Siddharth; Dragan, Anca; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))
Toward Grounded Commonsense Reasoning

Kwon, Minae; Hu, Hengyuan; Myers, Vivek; Karamcheti, Siddharth; Dragan, Anca; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))

Full Text Available
Bayesian Robustness: A Nonasymptotic Viewpoint

https://doi.org/10.1080/01621459.2023.2174121

Bhatia, Kush; Ma, Yi-An; Dragan, Anca D.; Bartlett, Peter L.; Jordan, Michael I. (January 2023, Journal of the American Statistical Association)

Full Text Available
Assisted Robust Reward Design

He, Jerry Zhi-Yang; Dragan, Anca D. (November 2021, Conference on Robot Learning)

Real-world robotic tasks require complex reward functions. When we define the problem the robot needs to solve, we pretend that a designer specifies this complex reward exactly, and it is set in stone from then on. In practice, however, reward design is an iterative process: the designer chooses a reward, eventually encounters an "edge-case" environment where the reward incentivizes the wrong behavior, revises the reward, and repeats. What would it mean to rethink robotics problems to formally account for this iterative nature of reward design? We propose that the robot not take the specified reward for granted, but rather have uncertainty about it, and account for the future design iterations as future evidence. We contribute an Assisted Reward Design method that speeds up the design process by anticipating and influencing this future evidence: rather than letting the designer eventually encounter failure cases and revise the reward then, the method actively exposes the designer to such environments during the development phase. We test this method in a simplified autonomous driving task and find that it more quickly improves the car's behavior in held-out environments by proposing environments that are "edge cases" for the current reward.
more » « less
Value Alignment Verification

Brown, Daniel S; Schneider, Jordan; Dragan, Anca; Niekum, Scott (July 2021, International Conference on Machine Learning)
null (Ed.)
Full Text Available
Situational Confidence Assistance for Lifelong Shared Autonomy

https://doi.org/10.1109/ICRA48506.2021.9561839

Zurek, Matthew; Bobu, Andreea; Brown, Daniel; Dragan, Anca (April 2021, International Conference on Robotics and Automation)

Shared autonomy enables robots to infer user intent and assist in accomplishing it. But when the user wants to do a new task that the robot does not know about, shared autonomy will hinder their performance by attempting to assist them with something that is not their intent. Our key idea is that the robot can detect when its repertoire of intents is insufficient to explain the user’s input, and give them back control. This then enables the robot to observe unhindered task execution, learn the new intent behind it, and add it to this repertoire. We demonstrate with both a case study and a user study that our proposed method maintains good performance when the human’s intent is in the robot’s repertoire, outperforms prior shared autonomy approaches when it isn’t, and successfully learns new skills, enabling efficient lifelong learning for confidence-based shared autonomy.
more » « less
Full Text Available
Value Alignment Verification

Brown, Daniel; Schneider, Jordan; Dragan, Anca; Niekum, Scott (January 2021, 38th International Conference on Machine Learning)

As humans interact with autonomous agents to perform increasingly complicated, potentially risky tasks, it is important to be able to efficiently evaluate an agent’s performance and correctness. In this paper we formalize and theoretically analyze the problem of efficient value alignment verification: how to efficiently test whether the behavior of another agent is aligned with a human’s values. The goal is to construct a kind of “driver’s test” that a human can give to any agent which will verify value alignment via a minimal number of queries. We study alignment verification problems with both idealized humans that have an explicit reward function as well as problems where they have implicit values. We analyze verification of exact value alignment for rational agents and propose and analyze heuristic and approximate value alignment verification tests in a wide range of gridworlds and a continuous autonomous driving domain. Finally, we prove that there exist sufficient conditions such that we can verify exact and approximate alignment across an infinite set of test environments via a constant- query-complexity alignment test.
more » « less
Full Text Available
Agnostic Learning with Unknown Utilities

https://doi.org/10.4230/LIPIcs.ITCS.2021.55

Bhatia, Kush; Bartlett, Peter L.; Dragan, Anca D.; Steinhardt, Jacob (January 2021, Leibniz international proceedings in informatics)
null (Ed.)
Full Text Available
How to Be Helpful to Multiple People at Once

https://doi.org/10.1111/cogs.12841

Gates, Vael; Griffiths, Thomas L.; Dragan, Anca D. (June 2020, Cognitive Science)

Full Text Available

« Prev Next »

Search for: All records