NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PECAN: Personalizing Robot Behaviors through a Learned Canonical Space

https://doi.org/10.1145/3737894

Nemlekar, Heramb; Sanchez, Robert Ramirez; Losey, Dylan P (December 2025, ACM Transactions on Human-Robot Interaction)

Robots should personalize how they perform tasks to match the needs of individual human users. Today’s robots achieve this personalization by asking for the human’s feedback in the task space. For example, an autonomous car might show the human two different ways to decelerate at stoplights, and ask the human which of these motions they prefer. This current approach to personalization isindirect: Based on the behaviors the human selects (e.g., decelerating slowly), the robot tries to infer their underlying preference (e.g., defensive driving). By contrast, our article develops a learning and interface-based approach that enables humans todirectlyindicate their desired style. We do this by learning an abstract, low-dimensional, and continuous canonical space from human demonstration data. Each point in the canonical space corresponds to a different style (e.g., defensive or aggressive driving), and users can directly personalize the robot’s behavior by simply clicking on a point. Given the human’s selection, the robot then decodes this canonical style across each task in the dataset—e.g., if the human selects a defensive style, the autonomous car personalizes its behavior to drive defensively when decelerating, passing other cars, or merging onto highways. We refer to our resulting approach as PECAN:Personalizing Robot Behaviors through a LearnedCanonical Space. Our simulations and user studies suggest that humans prefer using PECAN to directly personalize robot behavior (particularly when those users become familiar with PECAN), and that users find the learned canonical space to be intuitive and consistent. See videos here:https://youtu.be/wRJpyr23PKI.
more » « less
Free, publicly-accessible full text available December 31, 2026
Stable-BC: Controlling Covariate Shift With Stable Behavior Cloning

https://doi.org/10.1109/LRA.2025.3526439

Mehta, Shaunak A; Ciftci, Yusuf Umut; Ramachandran, Balamurugan; Bansal, Somil; Losey, Dylan P (February 2025, IEEE Robotics and Automation Letters)

Free, publicly-accessible full text available February 1, 2026
Combining and Decoupling Rigid and Soft Grippers to Enhance Robotic Manipulation

https://doi.org/10.1089/soro.2024.0062

Keely, Maya; Kim, Yeunhee; Mehta, Shaunak A; Hoegerman, Joshua; Ramirez_Sanchez, Robert; Paul, Emily; Mills, Camryn; Losey, Dylan P; Bartlett, Michael D (March 2025, Soft Robotics)

For robot arms to perform everyday tasks in unstructured environments, these robots must be able to manipulate a diverse range of objects. Today’s robots often grasp objects with either soft grippers or rigid end-effectors. However, purely rigid or purely soft grippers have fundamental limitations as follows: soft grippers struggle with irregular heavy objects, whereas rigid grippers often cannot grasp small numerous items. In this article, we therefore introduce RISOs, a mechanics and controls approach for unifying traditional RIgid end-effectors with a novel class of SOft adhesives. When grasping an object, RISOs can use either the rigid end-effector (pinching the item between nondeformable fingers) and/or the soft materials (attaching and releasing items with switchable adhesives). This enhances manipulation capabilities by combining and decoupling rigid and soft mechanisms. With RISOs, robots can perform grasps along a spectrum from fully rigid, to fully soft, to rigid-soft, enabling real-time object manipulation across a 1.5 million times range in weight (from 2 mg to 2.9 kg). To develop RISOs, we first model and characterize the soft switchable adhesives. We then mount sheets of these soft adhesives on the surfaces of rigid end-effectors and develop control strategies that make it easier for robot arms and human operators to utilize RISOs. The resulting RISO grippers were able to pick up, carry, and release a larger set of objects than existing grippers, and participants also preferred using RISO. Overall, our experimental and user study results suggest that RISOs provide an exceptional gripper range in both capacity and object diversity.
more » « less
Free, publicly-accessible full text available March 10, 2026
Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human–Robot Interaction

https://doi.org/10.1145/3623384

Mehta, Shaunak A; Losey, Dylan P (September 2024, ACM Transactions on Human-Robot Interaction)

Humans can leverage physical interaction to teach robot arms. This physical interaction takes multiple forms depending on the task, the user, and what the robot has learned so far. State-of-the-art approaches focus on learning from a single modality, or combine some interaction types. Some methods do so by assuming that the robot has prior information about the features of the task and the reward structure. By contrast, in this article, we introduce an algorithmic formalism that unites learning from demonstrations, corrections, and preferences. Our approach makes no assumptions about the tasks the human wants to teach the robot; instead, we learn a reward model from scratch by comparing the human’s input to nearby alternatives, i.e., trajectories close to the human’s feedback. We first derive a loss function that trains an ensemble of reward models to match the human’s demonstrations, corrections, and preferences. The type and order of feedback is up to the human teacher: We enable the robot to collect this feedback passively or actively. We then apply constrained optimization to convert our learned reward into a desired robot trajectory. Through simulations and a user study, we demonstrate that our proposed approach more accurately learns manipulation tasks from physical human interaction than existing baselines, particularly when the robot is faced with new or unexpected objectives. Videos of our user study are available at https://youtu.be/FSUJsTYvEKU
more » « less
Full Text Available
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks

https://doi.org/10.1109/IROS58592.2024.10802681

Mehta, Shaunak A; Habibian, Soheil; Losey, Dylan P (October 2024, Proceedings of the International Conference on Intelligent Robots and Systems)

Robot arms should be able to learn new tasks. One framework here is reinforcement learning, where the robot is given a reward function that encodes the task, and the robot autonomously learns actions to maximize its reward. Existing approaches to reinforcement learning often frame this problem as a Markov decision process, and learn a policy (or a hierarchy of policies) to complete the task. These policies reason over hundreds of fine-grained actions that the robot arm needs to take: e.g., moving slightly to the right or rotating the end-effector a few degrees. But the manipulation tasks that we want robots to perform can often be broken down into a small number of high-level motions: e.g., reaching an object or turning a handle. In this paper we therefore propose a waypoint-based approach for model-free reinforcement learning. Instead of learning a low-level policy, the robot now learns a trajectory of waypoints, and then interpolates between those waypoints using existing controllers. Our key novelty is framing this waypoint-based setting as a sequence of multi-armed bandits: each bandit problem corresponds to one waypoint along the robot’s motion. We theoretically show that an ideal solution to this reformulation has lower regret bounds than standard frameworks. We also introduce an approximate posterior sampling solution that builds the robot’s motion one waypoint at a time. Results across benchmark simulations and two real-world experiments suggest that this proposed approach learns new tasks more quickly than state-of-the-art baselines. See our website here: https://collab.me.vt.edu/rl-waypoints/
more » « less
Full Text Available
Kiri-Spoon: A Soft Shape-Changing Utensil for Robot-Assisted Feeding

https://doi.org/10.1109/IROS58592.2024.10801346

Keely, Maya N; Nemlekar, Heramb; Losey, Dylan P (October 2024, IEEE)

Full Text Available
Aligning Learning with Communication in Shared Autonomy

https://doi.org/10.1109/IROS58592.2024.10801897

Hoegerman, Joshua; Sagheb, Shahabedin; Christie, Benjamin A; Losey, Dylan P (October 2024, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Assistive robot arms can help humans by partially automating their desired tasks. Consider an adult with motor impairments controlling an assistive robot arm to eat dinner. The robot can reduce the number of human inputs — and how precise those inputs need to be — by recognizing what the human wants (e.g., a fork) and assisting for that task (e.g., moving towards the fork). Prior research has largely focused on learning the human’s task and providing meaningful assistance. But as the robot learns and assists, we also need to ensure that the human understands the robot’s intent (e.g., does the human know the robot is reaching for a fork?). In this paper, we study the effects of communicating learned assistance from the robot back to the human operator. We do not focus on the specific interfaces used for communication. Instead, we develop experimental and theoretical models of a) how communication changes the way humans interact with assistive robot arms, and b) how robots can harness these changes to better align with the human’s intent. We first conduct online and in-person user studies where participants operate robots that provide partial assistance, and we measure how the human’s inputs change with and without communication. With communication, we find that humans are more likely to intervene when the robot incorrectly predicts their intent, and more likely to release control when the robot correctly understands their task. We then use these findings to modify an established robot learning algorithm so that the robot can correctly interpret the human’s inputs when communication is present. Our results from a second in-person user study suggest that this combination of communication and learning outperforms assistive systems that isolate either learning or communication. See videos here: https://youtu.be/BET9yuVTVU4
more » « less
Full Text Available
LIMIT: Learning Interfaces to Maximize Information Transfer

https://doi.org/10.1145/3675758

Christie, Benjamin A; Losey, Dylan P (August 2024, ACM Transactions on Human-Robot Interaction)

Robots can use auditory, visual, or haptic interfaces to convey information to human users. The way these interfaces select signals is typically pre-defined by the designer: for instance, a haptic wristband might vibrate when the robot is moving and squeeze when the robot stops. But different people interpret the same signals in different ways, so that what makes sense to one person might be confusing or unintuitive to another. In this paper we introduce a unified algorithmic formalism for learningco-adaptiveinterfaces fromscratch. Our method does not need to know the human’s task (i.e., what the human is using these signals for). Instead, our insight is that interpretable interfaces should select signals that maximizecorrelationbetween the human’s actions and the information the interface is trying to convey. Applying this insight we develop LIMIT: Learning Interfaces to Maximize Information Transfer. LIMIT optimizes a tractable, real-time proxy of information gain in continuous spaces. The first time a person works with our system the signals may appear random; but over repeated interactions the interface learns a one-to-one mapping between displayed signals and human responses. Our resulting approach is both personalized to the current user and not tied to any specific interface modality. We compare LIMIT to state-of-the-art baselines across controlled simulations, an online survey, and an in-person user study with auditory, visual, and haptic interfaces. Overall, our results suggest that LIMIT learns interfaces that enable users to complete the task more quickly and efficiently, and users subjectively prefer LIMIT to the alternatives. See videos here:https://youtu.be/IvQ3TM1_2fA.
more » « less
Full Text Available
StROL: Stabilized and Robust Online Learning From Humans

https://doi.org/10.1109/LRA.2024.3354626

Mehta, Shaunak A; Meng, Forrest; Bajcsy, Andrea; Losey, Dylan P (March 2024, IEEE Robotics and Automation Letters)

Full Text Available
RISO: Combining Rigid Grippers with Soft Switchable Adhesives

https://doi.org/10.1109/RoboSoft55895.2023.10122030

Mehta, Shaunak A.; Kim, Yeunhee; Hoegerman, Joshua; Bartlett, Michael D.; Losey, Dylan P. (January 2023, IEEE International Conference on Soft Robotics (RoboSoft))

Full Text Available

« Prev Next »

Search for: All records