NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Natural Language Can Help Bridge the Sim2Real Gap

Yu, Albert; Foote, Adeline; Mooney, Raymond; Martín-Martín, Roberto (June 2024, Robotics, Science and Systems (RSS))

The main challenge in learning image-conditioned robotic policies is acquiring a visual representation conducive to low-level control. Due to the high dimensionality of the image space, learning a good visual representation requires a considerable amount of visual data. However, when learning in the real world, data is expensive. Sim2Real is a promising paradigm for overcoming data scarcity in the real-world target domain by using a simulator to collect large amounts of cheap data closely related to the target task. However, it is difficult to transfer an image-conditioned policy from sim to real when the domains are very visually dissimilar. To bridge the sim2real visual gap, we propose using natural language descriptions of images as a unifying signal across domains that captures the underlying task-relevant semantics. Our key insight is that if two image observations from different domains are labeled with similar language, the policy should predict similar action distributions for both images. We demonstrate that training the image encoder to predict the language description or the distance between descriptions of a sim or real image serves as a useful, data-efficient pretraining step that helps learn a domain-invariant image representation. We can then use this image encoder as the backbone of an IL policy trained simultaneously on a large amount of simulated and a handful of real demonstrations. Our approach outperforms widely used prior sim2real methods and strong vision-language pretraining baselines like CLIP and R3M by 25 to 40 percent. See additional videos and materials at our project website.
more » « less
Full Text Available
Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks

Yu, Albert; Mooney, Raymond (May 2023, International Conference on Learning Representations (ICLR))

Demonstrations and natural language instructions are two common ways to specify and teach robots novel tasks. However, for many complex tasks, a demonstration or language instruction alone contains ambiguities, preventing tasks from being specified clearly. In such cases, a combination of both a demonstration and an instruction more concisely and effectively conveys the task to the robot than either modality alone. To instantiate this problem setting, we train a single multi-task policy on a few hundred challenging robotic pick-and-place tasks and propose DeL-TaCo (Joint Demo-Language Task Conditioning), a method for conditioning a robotic policy on task embeddings comprised of two components: a visual demonstration and a language instruction. By allowing these two modalities to mutually disambiguate and clarify each other during novel task specification, DeL-TaCo (1) substantially decreases the teacher effort needed to specify a new task and (2) achieves better generalization performance on novel objects and instructions over previous task-conditioning methods. To our knowledge, this is the first work to show that simultaneously conditioning a multi-task robotic manipulation policy on both demonstration and language embeddings improves sample efficiency and generalization over conditioning on either modality alone. See additional materials at https://sites.google.com/view/del-taco-learning
more » « less
Full Text Available
A Ranking Game for Imitation Learning

Sikchi, H.; Saran, A.; Goo, W.; Niekum, S. (January 2023, Transactions on machine learning research)

Full Text Available
Understanding Acoustic Patterns of Human Teachers Demonstrating Manipulation Tasks to Robots

https://doi.org/10.1109/IROS47612.2022.9981053

Saran, A.; Desai, K.; Chang, M.L.; Lioutikov, R.; Thomaz, A.; Niekum, S. (October 2022, Proceedings of the International Conference on Intelligent Robots and Systems)

Full Text Available
Universal Off-Policy Evaluation

Chandak, Y; Niekum, S; Castro da Silva, B; Learned-Miller, E; Brunskill, E; Thomas, P (December 2021, Neural Information Processing Systems)
null (Ed.)
Full Text Available
Adversarial Intrinsic Motivation for Reinforcement Learning

Durugkar, I; Tec, M; Niekum, S; Stone, P (December 2021, Neural Information Processing Systems)
null (Ed.)
Full Text Available
SOPE: Spectrum of Off-Policy Estimators

Yuan, C; Chandak, Y; Giguere, S; Thomas, P; Niekum, S (December 2021, Neural Information Processing Systems)
null (Ed.)
Full Text Available
Distributional Depth-Based Estimation of Object Articulation Models

Jain, A; Giguere, S; Lioutikov, R; Niekum, S (November 2021, Conference on Robot Learning)
null (Ed.)
Full Text Available
You Only Evaluate Once: A Simple Baseline Algorithm for Offline RL

Goo, W; Niekum, S (November 2021, Conference on Robot Learning)
null (Ed.)
Full Text Available
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences

Kim, M; Niekum, S; Deshpande, A (November 2021, Conference on Robot Learning)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records