NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Curricula for teaching end-users to kinesthetically program collaborative robots

https://doi.org/10.1371/journal.pone.0294786

Ajaykumar, Gopika; Hager, Gregory D; Huang, Chien-Ming (December 2023, PLOS ONE)
Adebisi, John (Ed.)
Non-expert users can now program robots using various end-user robot programming methods, which have widened the use of robots and lowered barriers preventing robot use by laypeople. Kinesthetic teaching is a common form of end-user robot programming, allowing users to forgo writing code by physically guiding the robot to demonstrate behaviors. Although it can be more accessible than writing code, kinesthetic teaching is difficult in practice because of users’ unfamiliarity with kinematics or limitations of robots and programming interfaces. Developing good kinesthetic demonstrations requires physical and cognitive skills, such as the ability to plan effective grasps for different task objects and constraints, to overcome programming difficulties. How to help users learn these skills remains a largely unexplored question, with users conventionally learning through self-guided practice. Our study compares how self-guided practice compares with curriculum-based training in building users’ programming proficiency. While we found no significant differences between study participants who learned through practice compared to participants who learned through our curriculum, our study reveals insights into factors contributing to end-user robot programmers’ confidence and success during programming and how learning interventions may contribute to such factors. Our work paves the way for further research on how to best structure training interventions for end-user robot programmers.
more » « less
Full Text Available
Learn Proportional Derivative Controllable Latent Space from Pixels

https://doi.org/10.1109/CASE49997.2022.9926535

Wang, Weiyao; Kobilarov, Marin; Hager, Gregory D. (August 2022, 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE))

Full Text Available
Dynamical Scene Representation and Control with Keypoint-Conditioned Neural Radiance Field

https://doi.org/10.1109/CASE49997.2022.9926555

Wang, Weiyao; Morgan, Andrew S.; Dollar, Aaron M.; Hager, Gregory D. (August 2022, 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE))

Full Text Available
"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer

Hundt, Andrew; Murali, Aditya; Hubli, Priyanka; Liu, Ran; Gopalan, Nakul; Gombolay, Matthew; Hager, Gregory D. (October 2021, Proceedings of Machine Learning Research)
Aleksandra Faust, David Hsu (Ed.)
Modern Reinforcement Learning (RL) algorithms are not sample efficient to train on multi-step tasks in complex domains, impeding their wider deployment in the real world. We address this problem by leveraging the insight that RL models trained to complete one set of tasks can be repurposed to complete related tasks when given just a handful of demonstrations. Based upon this insight, we propose See-SPOT-Run (SSR), a new computational approach to robot learning that enables a robot to complete a variety of real robot tasks in novel problem domains without task-specific training. SSR uses pretrained RL models to create vectors that represent model, task, and action relevance in demonstration and test scenes. SSR then compares these vectors via our Cycle Consistency Distance (CCD) metric to determine the next action to take. SSR completes 58% more task steps and 20% more trials than a baseline few-shot learning method that requires task-specific training. SSR also achieves a four order of magnitude improvement in compute efficiency and a 20% to three order of magnitude improvement in sample efficiency compared to the baseline and to training RL models from scratch. To our knowledge, we are the first to address multi-step tasks from demonstration on a real robot without task-specific training, where both the visual input and action space output are high dimensional. Code is available in the supplement.
more » « less
Full Text Available
Robust Policy Search for an Agile Ground Vehicle Under Perception Uncertainty

https://doi.org/10.1109/IROS51168.2021.9636552

Sefati, Shahriar; Mishra, Subhransu; Sheckells, Matthew; Katyal, Kapil D.; Bai, Jin; Hager, Gregory D.; Kobilarov, Marin (September 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Full Text Available
“Good Robot!”: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

https://doi.org/10.1109/LRA.2020.3015448

Hundt, Andrew; Killeen, Benjamin; Greene, Nicholas; Wu, Hongtao; Kwon, Heeyeon; Paxton, Chris; Hager, Gregory D. (October 2020, IEEE Robotics and Automation Letters)
null (Ed.)
Full Text Available
DASZL: Dynamic Action Signatures for Zero-shot Learning

Kim, Tae Soo; Jones, Jonathan; Peven, Michael; Xiao, Zihao; Bai, Jin; Zhang, Yi; Qiu, Weichao; Yuille, Alan; Hager, Gregory D (February 2021, Proceedings of the AAAI Conference on Artificial Intelligence)

There are many realistic applications of activity recognition where the set of potential activity descriptions is combinatorially large. This makes end-to-end supervised training of a recognition system impractical as no training set is practically able to encompass the entire label set. In this paper, we present an approach to fine-grained recognition that models activities as compositions of dynamic action signatures. This compositional approach allows us to reframe fine-grained recognition as zero-shot activity recognition, where a detector is composed “on the fly” from simple first-principles state machines supported by deep-learned components. We evaluate our method on the Olympic Sports and UCF101 datasets, where our model establishes a new state of the art under multiple experimental paradigms. We also extend this method to form a unique framework for zero-shot joint segmentation and classification of activities in video and demonstrate the first results in zero-shot decoding of complex action sequences on a widely-used surgical dataset. Lastly, we show that we can use off-the-shelf object detectors to recognize activities in completely de-novo settings with no additional training.
more » « less
Full Text Available
The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints

Hundt, Andrew; Jain, Varun; Lin, Chia-Hung; Paxton, Chris; Hager, Gregory D. (November 2019, IROS 2019)

A robot can now grasp an object more effectively than ever before, but once it has the object what happens next? We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances. To address this, we introduce the JHU CoSTAR Block Stacking Dataset (BSD), where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data. We discuss the ways in which this dataset provides a valuable resource for a broad range of other topics of investigation. We find that hand-designed neural networks that work on prior datasets do not generalize to this task. Thus, to establish a baseline for this dataset, we demonstrate an automated search of neural network based models using a novel multiple-input HyperTree MetaModel, and find a final model which makes reasonable 3D pose predictions for grasping and stacking on our dataset. The CoSTAR BSD, code, and instructions are available at sites.google.com/site/costardataset
more » « less
Full Text Available
Autonomously Navigating a Surgical Tool Inside the Eye by Learning from Demonstration

Kim, Ji Woong; He, Changyan; Urias, Muller; Gehlbach, Peter; Hager, Gregory D.; Iordachita, Iulian; Kobilarov, Marin (May 2019, ICRA 2019)

A fundamental challenge in retinal surgery is safely navigating a surgical tool to a desired goal position on the retinal surface while avoiding damage to surrounding tissues, a procedure that typically requires tens-of-microns accuracy. In practice, the surgeon relies on depth-estimation skills to localize the tool-tip with respect to the retina and perform the tool-navigation task, which can be prone to human error. To alleviate such uncertainty, prior work has introduced ways to assist the surgeon by estimating the tool-tip distance to the retina and providing haptic or auditory feedback. However, automating the tool-navigation task itself remains unsolved and largely un-explored. Such a capability, if reliably automated, could serve as a building block to streamline complex procedures and reduce the chance for tissue damage. Towards this end, we propose to automate the tool-navigation task by mimicking the perception-action feedback loop of an expert surgeon. Specifically, a deep network is trained to imitate expert trajectories toward various locations on the retina based on recorded visual servoing to a given goal specified by the user. The proposed autonomous navigation system is evaluated in simulation and in real-life experiments using a silicone eye phantom. We show that the network can reliably navigate a surgical tool to various desired locations within 137 µm accuracy in phantom experiments and 94 µm in simulation, and generalizes well to unseen situations such as in the presence of auxiliary surgical tools, variable eye backgrounds, and brightness conditions.
more » « less
Full Text Available
Evaluating Methods for End-User Creation of Robot Task Plans

Paxton, Chris; Jonathan, Felix; Hundt, Andrew; Bilge, Mutlu; Hager, Gregory D. (October 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems)

How can we enable users to create effective, perception-driven task plans for collaborative robots? We conducted a 35-person user study with the Behavior Tree-based CoSTAR system to determine which strategies for end user creation of generalizable robot task plans are most usable and effective. CoSTAR allows domain experts to author complex, perceptually grounded task plans for collaborative robots. As a part of CoSTAR's wide range of capabilities, it allows users to specify SmartMoves: abstract goals such as "pick up component A from the right side of the table." Users were asked to perform pick-and-place assembly tasks with either SmartMoves or one of three simpler baseline versions of CoSTAR. Overall, participants found CoSTAR to be highly usable, with an average System Usability Scale score of 73.4 out of 100. SmartMove also helped users perform tasks faster and more effectively; all SmartMove users completed the first two tasks, while not all users completed the tasks using the other strategies. SmartMove users showed better performance for incorporating perception across all three tasks.
more » « less
Full Text Available

« Prev Next »

Search for: All records