Search for: All records

Creators/Authors contains: "Darrell, Trevor"

« Prev Next »

Total Resources

5

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

0

Journal Article

0

Workshop Report

0

Availability
Full Text / Resource Available

5

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Guiding Pretraining in Reinforcement Learning with Large Language Models

Du, Yuqing ; Watkins, Olivia ; Wang, Zihan ; Colas, Cédric ; Darrell, Trevor ; Abbeel, Pieter ; Gupta, Abhishek ; Andreas, Jacob ( January 2023 , International Conference on Machine Learning)

Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function. Intrinsically motivated exploration methods address this limitation by rewarding agents for visiting novel states or transitions, but these methods offer limited benefits in large environments where most discovered novelty is irrelevant for downstream tasks. We describe a method that uses background knowledge from text corpora to shape exploration. This method, called ELLM (Exploring with LLMs) rewards an agent for achieving goals suggested by a language model prompted with a description of the agent’s current state. By leveraging large-scale language model pretraining, ELLM guides agents toward human-meaningful and plausibly useful behaviors without requiring a human in the loop. We evaluate ELLM in the Crafter game environment and the Housekeep robotic simulator, showing that ELLM-trained agents have better coverage of common-sense behaviors during pretraining and usually match or improve performance on a range of downstream tasks.
more » « less
Full Text Available
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension

https://doi.org/10.18653/v1/2022.acl-long.357

Subramanian, Sanjay ; Merrill, William ; Darrell, Trevor ; Gardner, Matt ; Singh, Sameer ; Rohrbach, Anna ( January 2022 , Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Training a referring expression comprehension (ReC) model for a new visual domain requires collecting referring expressions, and potentially corresponding bounding boxes, for images in the domain. While large-scale pre-trained models are useful for image classification across domains, it remains unclear if they can be applied in a zero-shot manner to more complex tasks like ReC. We present ReCLIP, a simple but strong zero-shot baseline that repurposes CLIP, a state-of-the-art large-scale model, for ReC. Motivated by the close connection between ReC and CLIP’s contrastive pre-training objective, the first component of ReCLIP is a region-scoring method that isolates object proposals via cropping and blurring, and passes them to CLIP. However, through controlled experiments on a synthetic dataset, we find that CLIP is largely incapable of performing spatial reasoning off-the-shelf. We reduce the gap between zero-shot baselines from prior work and supervised models by as much as 29% on RefCOCOg, and on RefGTA (video game imagery), ReCLIP’s relative improvement over supervised ReC models trained on real images is 8%.
more » « less
Full Text Available
Deep Mixture of Experts via Shallow Embedding

Wang, Xin ; Yu, Fisher ; Dunlap, Lisa ; Ma, Yi-An ; Wang, Ruth ; Mirhoseini, Azalia ; Darrell, Trevor ; Gonzalez, Joseph E. ( July 2019 , Uncertainty in artificial intelligence)

Larger networks generally have greater representational power at the cost of increased computational complexity. Sparsifying such networks has been an active area of research but has been generally limited to static regularization or dynamic approaches using reinforcement learning. We explore a mixture of experts (MoE) approach to deep dynamic routing, which activates certain experts in the network on a per-example basis. Our novel DeepMoE architecture increases the representational power of standard convolutional networks by adaptively sparsifying and recalibrating channel-wise features in each convolutional layer. We employ a multi-headed sparse gating network to determine the selection and scaling of channels for each input, leveraging exponential combinations of experts within a single convolutional network. Our proposed architecture is evaluated on four benchmark datasets and tasks, and we show that Deep-MoEs are able to achieve higher accuracy with lower computation than standard convolutional networks.
more » « less
Full Text Available
CyCADA: Cycle-Consistent Adversarial Domain Adaptation

Hoffman, Judy ; Tzeng, Eric ; Park, Taesung ; Zhu, Jun-Yan ; Isola, Phillip ; Saenko, Kate ; Efros, Alexei ; Darrell, Trevor ( January 2018 , Proceedings of the 35th International Conference on Machine Learning)

Domain adaptation is critical for success in new, unseen environments. Adversarial adaptation models have shown tremendous progress towards adapting to new environments by focusing either on discovering domain invariant representations or by mapping between unpaired image domains. While feature space methods are difficult to interpret and sometimes fail to capture pixel-level and low-level domain shifts, image space methods sometimes fail to incorporate high level semantic knowledge relevant for the end task. We propose a model which adapts between domains using both generative image space alignment and latent representation space alignment. Our approach, Cycle-Consistent Adversarial Domain Adaptation (CyCADA), guides transfer between domains according to a specific discriminatively trained task and avoids divergence by enforcing consistency of the relevant semantics before and after adaptation. We evaluate our method on a variety of visual recognition and prediction settings, including digit classification and semantic segmentation of road scenes, advancing state-of-the-art performance for unsupervised adaptation from synthetic to real world driving domains.
more » « less
Full Text Available
Toward Multimodal Image-to-Image Translation

Zhu, Jun-Yan ; Zhang, Richard ; Pathak, Deepak ; Darrell, Trevor ; Efros, Alexei ; Wang, Oliver ; Shechtman, Eli ( January 2017 , Advances in neural information processing systems)

Full Text Available