NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles

Wu, Qiyuan; Campbell, Mark (May 2025, IEEE International Conference on Robotics and Automation (ICRA))

Free, publicly-accessible full text available May 20, 2026
Mixed Signals: A Diverse Point Cloud Dataset for Heterogeneous LiDAR V2X Collaboration

Luo, Katie Z; Dao, Minh-Quan; Liu, Zhenzhen; Campbell, Mark; Chao, Wei-Lun; Weinberger, Kilian Q; Malis, Ezio; Fremont, Vincent; Hariharan, Bharath; Shan, Mao; et al (October 2025, IEEE)

Vehicle-to-everything (V2X) collaborative perception has emerged as a promising solution to address the limitations of single-vehicle perception systems. However, existing V2X datasets are limited in scope, diversity, and quality. To address these gaps, we present Mixed Signals, a comprehensive V2X dataset featuring 45.1k point clouds and 240.6k bounding boxes collected from three connected autonomous vehicles (CAVs) equipped with two different configurations of LiDAR sensors, plus a roadside unit with dual LiDARs. Our dataset provides point clouds and bounding box annotations across 10 classes, ensuring reliable data for perception training. We provide detailed statistical analysis on the quality of our dataset and extensively benchmark existing V2X methods on it. Mixed Signals is ready-to-use, with precise alignment and consistent annotations across time and viewpoints. We hope our work advances research in the emerging, impactful field of V2X perception. Dataset details at https://mixedsignalsdataset.cs.cornell.edu/.
more » « less
Free, publicly-accessible full text available October 23, 2026
Learning 3D Perception from Others' Predictions

Yoo, Jinsu; Feng, Zhenyang; Pan, Tai-Yu; Sun, Yihong; Phoo, Cheng Perng; Chen, Xiangyu; Campbell, Mark; Weinberger, Kilian Q; Hariharan, Bharath; Chao, Wei-Lun (April 2025, International Conference on Learning Representations)

Accurate 3D object detection in real-world environments requires a huge amount of annotated data with high quality. Acquiring such data is tedious and expensive, and often needs repeated effort when a new sensor is adopted or when the detector is deployed in a new environment. We investigate a new scenario to construct 3D object detectors: learning from the predictions of a nearby unit that is equipped with an accurate detector. For example, when a self-driving car enters a new area, it may learn from other traffic participants whose detectors have been optimized for that area. This setting is label-efficient, sensor-agnostic, and communication-efficient: nearby units only need to share the predictions with the ego agent (e.g., car). Naively using the received predictions as ground-truths to train the detector for the ego car, however, leads to inferior performance. We systematically study the problem and identify viewpoint mismatches and mislocalization (due to synchronization and GPS errors) as the main causes, which unavoidably result in false positives, false negatives, and inaccurate pseudo labels. We propose a distance-based curriculum, first learning from closer units with similar viewpoints and subsequently improving the quality of other units' predictions via self-training. We further demonstrate that an effective pseudo label refinement module can be trained with a handful of annotated data, largely reducing the data quantity necessary to train an object detector. We validate our approach on the recently released real-world collaborative driving dataset, using reference cars' predictions as pseudo labels for the ego car. Extensive experiments including several scenarios (e.g., different sensors, detectors, and domains) demonstrate the effectiveness of our approach toward label-efficient learning of 3D perception from other units' predictions.
more » « less
Free, publicly-accessible full text available April 28, 2026
SWIFT: Strategic Weather-informed Image-based Forecasting for Trajectories

https://doi.org/10.1109/IROS58592.2024.10802567

Xia, Youya; Nino, Jose; Han, Yutao; Campbell, Mark (October 2024, IEEE)

Full Text Available
Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

https://doi.org/10.1109/CVPR52734.2025.01123

Pan, Tai-Yu; Jeon, Sooyoung; Fan, Mengdi; Yoo, Jinsu; Feng, Zhenyang; Campbell, Mark; Weinberger, Kilian Q; Hariharan, Bharath; Chao, Wei-Lun (June 2025, IEEE)

Self-driving cars relying solely on ego-centric perception face limitations in sensing, often failing to detect occluded, faraway objects. Collaborative autonomous driving (CAV) seems like a promising direction, but collecting data for development is non-trivial. It requires placing multiple sensor-equipped agents in a real-world driving scene, simultaneously! As such, existing datasets are limited in locations and agents. We introduce a novel surrogate to the rescue, which is to generate realistic perception from different viewpoints in a driving scene, conditioned on a real-world sample—the ego-car’s sensory data. This surrogate has huge potential: it could potentially turn any ego-car dataset into a collaborative driving one to scale up the development of CAV. We present the very first solution, using a combination of simulated collaborative data and real ego-car data. Our method Transfer Your Perspective (TYP) learns a conditioned diffusion model whose output samples are not only realistic but also consistent in both semantics and layouts with the given ego-car data. Empirical results demonstrate TYP’s effectiveness in aiding in a CAV setting. In particular, TYP enables us to (pre-)train collaborative perception algorithms like early and late fusion with little or no real-world collaborative data, greatly facilitating downstream CAV applications.
more » « less
Free, publicly-accessible full text available June 10, 2026
DiffuBox: Refining 3D Object Detection with Point Diffusion

Chen, Xiangyu; Liu, Zhenzhen; Luo, Katie Z; Datta, Siddhartha; Polavaram, Adhitya; Wang, Yan; You, Yurong; Li, Boyi; Pavone, Marco; Chao, Wei-Lun; et al (December 2024, Advances in Neural Information Processing Systems 37 (NeurIPS 2024))

Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-based box refinement approach. This method employs a domain-agnostic diffusion model, conditioned on the LiDAR points surrounding a coarse bounding box, to simultaneously refine the box’s location, size, and orientation. We evaluate this approach under various domain adaptation settings, and our results reveal significant improvements across different datasets, object classes and detectors. Our PyTorch implementation is available at https://github.com/cxy1997/DiffuBox.
more » « less
Full Text Available
Better Monocular 3D Detectors with LiDAR from the Past

https://doi.org/10.1109/ICRA57147.2024.10610444

You, Yurong; Phoo, Cheng Perng; Andres_Diaz-Ruiz, Carlos; Luo, Katie Z; Chao, Wei-Lun; Campbell, Mark; Hariharan, Bharath; Weinberger, Kilian Q (May 2024, IEEE)

Full Text Available
Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

Pan, Tai-Yu; Ma, Chenyang; Chen, Tianle; Phoo, Cheng Perng; Luo, Katie Z; You, Yurong; Campbell, Mark; Weinberger, Kilian Q; Hariharan, Bharath; Chao, Wei-Lun (May 2024, International Conference on Learning Representations)

Full Text Available
Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization

Chen, Junan; Monica, Josephine; Chao, Wei-Lun; Campbell, Mark (July 2023, Proceedings IEEE International Conference on Robotics and Automation)

The uncertainty quantification of prediction mod- els (e.g., neural networks) is crucial for their adoption in many robotics applications. This is arguably as important as making accurate predictions, especially for safety-critical applications such as self-driving cars. This paper proposes our approach to uncertainty quantification in the context of visual localization for autonomous driving, where we predict locations from images. Our proposed framework estimates probabilistic uncertainty by creating a sensor error model that maps an inter- nal output of the prediction model to the uncertainty. The sensor error model is created using multiple image databases of visual localization, each with ground-truth location. We demonstrate the accuracy of our uncertainty prediction framework using the Ithaca365 dataset, which includes variations in lighting, weather (sunny, snowy, night), and alignment errors between databases. We analyze both the predicted uncertainty and its incorporation into a Kalman-based localization filter. Our results show that prediction error variations increase with poor weather and lighting condition, leading to greater uncertainty and outliers, which can be predicted by our proposed uncertainty model. Additionally, our probabilistic error model enables the filter to remove ad hoc sensor gating, as the uncertainty automatically adjusts the model to the input data.
more » « less
Full Text Available
Probabilistic Uncertainty Quantification of Prediction Models with Application to Visual Localization

https://doi.org/10.1109/ICRA48891.2023.10160298

Chen, Junan; Monica, Josephine; Chao, Wei-Lun; Campbell, Mark (May 2023, IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available

« Prev Next »

Search for: All records