NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ON SPEEDING UP LANGUAGE MODEL EVALUATION

Zhou, Jin Peng; Belardi, Christian K; Wu, Ruihan; Zhang, Travis; Gomes, Carla P; Sun, Wen; Weinberger, Kilian Q (June 2025, International Conference on Learning Representations)

Developing prompt-based methods with Large Language Models (LLMs) requires making numerous decisions, which give rise to a combinatorial search problem over hyper-parameters. This exhaustive evaluation can be time-consuming and costly. In this paper, we propose an adaptive approach to explore this space. We are exploiting the fact that often only few samples are needed to identify clearly superior or inferior settings, and that many evaluation tests are highly correlated. We lean on multi-armed bandits to sequentially identify the next (method, validation sample)-pair to evaluate and utilize low-rank matrix factorization to fill in missing evaluations. We carefully assess the efficacy of our approach on several competitive benchmark problems and show that it can identify the top-performing method using only 5-15% of the typical resources—resulting in 85-95% LLM cost savings. Our code is available at https://github.com/kilian-group/banditeval.
more » « less
Free, publicly-accessible full text available June 11, 2026
Unsupervised Domain Adaptation for Self-Driving from Past Traversal Features

https://doi.org/10.1109/ICCVW60793.2023.00436

Zhang, Travis; Luo, Katie; Phoo, Cheng Perng; You, Yurong; Chao, Wei-Lun; Hariharan, Bharath; Campbell, Mark; Weinberger, Kilian Q. (October 2023, IEEE/CVF International Conference on Computer Vision Workshops)
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns

https://doi.org/10.1109/ICRA46639.2022.9811574

Buddareddygari, Prasanth; Zhang, Travis; Yang, Yezhou; Ren, Yi (May 2022, 2022 International Conference on Robotics and Automation (ICRA))

Recent studies demonstrated the vulnerability of control policies learned through deep reinforcement learning against adversarial attacks, raising concerns about the application of such models to risk-sensitive tasks such as autonomous driving. Threat models for these demonstrations are limited to (1) targeted attacks through real-time manipulation of the agent's observation, and (2) untargeted attacks through manipulation of the physical environment. The former assumes full access to the agent's states/observations at all times, while the latter has no control over attack outcomes. This paper investigates the feasibility of targeted attacks through visually learned patterns placed on physical objects in the environment, a threat model that combines the practicality and effectiveness of the existing ones. Through analysis, we demonstrate that a pre-trained policy can be hijacked within a time window, e.g., performing an unintended self-parking, when an adversarial object is present. To enable the attack, we adopt an assumption that the dynamics of both the environment and the agent can be learned by the attacker. Lastly, we empirically show the effectiveness of the proposed attack on different driving scenarios, perform a location robustness test, and study the tradeoff between the attack strength and its effectiveness Code is available at https://github.com/ASU-APG/ Targeted-Physical-Adversarial-Attacks-on-AD
more » « less
Full Text Available
Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

You, Yurong; Phoo, Cheng Perng; Luo, Katie; Zhang, Travis; Chao, Wei-Lun; Hariharan, Bharath; Campbell, Mark; Weinberger, Kilian Q. (January 2022, Advances in neural information processing systems)

For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment---ideally without additional annotation efforts. One potential solution is to leverage unlabeled data (eg, unlabeled LiDAR point clouds) collected from the end-users' environments (ie target domain) to adapt the system to the difference between training and testing environments. While extensive research has been done on such an unsupervised domain adaptation problem, one fundamental problem lingers: there is no reliable signal in the target domain to supervise the adaptation process. To overcome this issue we observe that it is easy to collect unsupervised data from multiple traversals of repeated routes. While different from conventional unsupervised domain adaptation, this assumption is extremely realistic since many drivers share the same roads. We show that this simple additional assumption is sufficient to obtain a potent signal that allows us to perform iterative self-training of 3D object detectors on the target domain. Concretely, we generate pseudo-labels with the out-of-domain detector but reduce false positives by removing detections of supposedly mobile objects that are persistent across traversals. Further, we reduce false negatives by encouraging predictions in regions that are not persistent. We experiment with our approach on two large-scale driving datasets and show remarkable improvement in 3D object detection of cars, pedestrians, and cyclists, bringing us a step closer to generalizable autonomous driving.
more » « less
Full Text Available
Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

You, Yurong; Phoo, Cheng Perng; Luo, Katie; Zhang, Travis; Chao, Wei-Lun; Hariharan, Bharath; Campbell; Campbell, Mark; Weinberger, Kilian Q. (January 2022, Conference on Neural Information Processing Systems)

Full Text Available

Search for: All records