NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stable-BC: Controlling Covariate Shift With Stable Behavior Cloning

https://doi.org/10.1109/LRA.2025.3526439

Mehta, Shaunak A; Ciftci, Yusuf Umut; Ramachandran, Balamurugan; Bansal, Somil; Losey, Dylan P (February 2025, IEEE Robotics and Automation Letters)

Free, publicly-accessible full text available February 1, 2026
Combining and Decoupling Rigid and Soft Grippers to Enhance Robotic Manipulation

https://doi.org/10.1089/soro.2024.0062

Keely, Maya; Kim, Yeunhee; Mehta, Shaunak A; Hoegerman, Joshua; Ramirez_Sanchez, Robert; Paul, Emily; Mills, Camryn; Losey, Dylan P; Bartlett, Michael D (March 2025, Soft Robotics)

For robot arms to perform everyday tasks in unstructured environments, these robots must be able to manipulate a diverse range of objects. Today’s robots often grasp objects with either soft grippers or rigid end-effectors. However, purely rigid or purely soft grippers have fundamental limitations as follows: soft grippers struggle with irregular heavy objects, whereas rigid grippers often cannot grasp small numerous items. In this article, we therefore introduce RISOs, a mechanics and controls approach for unifying traditional RIgid end-effectors with a novel class of SOft adhesives. When grasping an object, RISOs can use either the rigid end-effector (pinching the item between nondeformable fingers) and/or the soft materials (attaching and releasing items with switchable adhesives). This enhances manipulation capabilities by combining and decoupling rigid and soft mechanisms. With RISOs, robots can perform grasps along a spectrum from fully rigid, to fully soft, to rigid-soft, enabling real-time object manipulation across a 1.5 million times range in weight (from 2 mg to 2.9 kg). To develop RISOs, we first model and characterize the soft switchable adhesives. We then mount sheets of these soft adhesives on the surfaces of rigid end-effectors and develop control strategies that make it easier for robot arms and human operators to utilize RISOs. The resulting RISO grippers were able to pick up, carry, and release a larger set of objects than existing grippers, and participants also preferred using RISO. Overall, our experimental and user study results suggest that RISOs provide an exceptional gripper range in both capacity and object diversity.
more » « less
Free, publicly-accessible full text available March 10, 2026
Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human–Robot Interaction

https://doi.org/10.1145/3623384

Mehta, Shaunak A; Losey, Dylan P (September 2024, ACM Transactions on Human-Robot Interaction)

Humans can leverage physical interaction to teach robot arms. This physical interaction takes multiple forms depending on the task, the user, and what the robot has learned so far. State-of-the-art approaches focus on learning from a single modality, or combine some interaction types. Some methods do so by assuming that the robot has prior information about the features of the task and the reward structure. By contrast, in this article, we introduce an algorithmic formalism that unites learning from demonstrations, corrections, and preferences. Our approach makes no assumptions about the tasks the human wants to teach the robot; instead, we learn a reward model from scratch by comparing the human’s input to nearby alternatives, i.e., trajectories close to the human’s feedback. We first derive a loss function that trains an ensemble of reward models to match the human’s demonstrations, corrections, and preferences. The type and order of feedback is up to the human teacher: We enable the robot to collect this feedback passively or actively. We then apply constrained optimization to convert our learned reward into a desired robot trajectory. Through simulations and a user study, we demonstrate that our proposed approach more accurately learns manipulation tasks from physical human interaction than existing baselines, particularly when the robot is faced with new or unexpected objectives. Videos of our user study are available at https://youtu.be/FSUJsTYvEKU
more » « less
Full Text Available
Waypoint-Based Reinforcement Learning for Robot Manipulation Tasks

https://doi.org/10.1109/IROS58592.2024.10802681

Mehta, Shaunak A; Habibian, Soheil; Losey, Dylan P (October 2024, Proceedings of the International Conference on Intelligent Robots and Systems)

Robot arms should be able to learn new tasks. One framework here is reinforcement learning, where the robot is given a reward function that encodes the task, and the robot autonomously learns actions to maximize its reward. Existing approaches to reinforcement learning often frame this problem as a Markov decision process, and learn a policy (or a hierarchy of policies) to complete the task. These policies reason over hundreds of fine-grained actions that the robot arm needs to take: e.g., moving slightly to the right or rotating the end-effector a few degrees. But the manipulation tasks that we want robots to perform can often be broken down into a small number of high-level motions: e.g., reaching an object or turning a handle. In this paper we therefore propose a waypoint-based approach for model-free reinforcement learning. Instead of learning a low-level policy, the robot now learns a trajectory of waypoints, and then interpolates between those waypoints using existing controllers. Our key novelty is framing this waypoint-based setting as a sequence of multi-armed bandits: each bandit problem corresponds to one waypoint along the robot’s motion. We theoretically show that an ideal solution to this reformulation has lower regret bounds than standard frameworks. We also introduce an approximate posterior sampling solution that builds the robot’s motion one waypoint at a time. Results across benchmark simulations and two real-world experiments suggest that this proposed approach learns new tasks more quickly than state-of-the-art baselines. See our website here: https://collab.me.vt.edu/rl-waypoints/
more » « less
Full Text Available
StROL: Stabilized and Robust Online Learning From Humans

https://doi.org/10.1109/LRA.2024.3354626

Mehta, Shaunak A; Meng, Forrest; Bajcsy, Andrea; Losey, Dylan P (March 2024, IEEE Robotics and Automation Letters)

Full Text Available
RISO: Combining Rigid Grippers with Soft Switchable Adhesives

https://doi.org/10.1109/RoboSoft55895.2023.10122030

Mehta, Shaunak A.; Kim, Yeunhee; Hoegerman, Joshua; Bartlett, Michael D.; Losey, Dylan P. (January 2023, IEEE International Conference on Soft Robotics (RoboSoft))

Full Text Available

Search for: All records