Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

Pan, Tai-Yu; Jeon, Sooyoung; Fan, Mengdi; Yoo, Jinsu; Feng, Zhenyang; Campbell, Mark; Weinberger, Kilian Q; Hariharan, Bharath; Chao, Wei-Lun

doi:10.1109/CVPR52734.2025.01123

Citation Details

Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene

Self-driving cars relying solely on ego-centric perception face limitations in sensing, often failing to detect occluded, faraway objects. Collaborative autonomous driving (CAV) seems like a promising direction, but collecting data for development is non-trivial. It requires placing multiple sensor-equipped agents in a real-world driving scene, simultaneously! As such, existing datasets are limited in locations and agents. We introduce a novel surrogate to the rescue, which is to generate realistic perception from different viewpoints in a driving scene, conditioned on a real-world sample—the ego-car’s sensory data. This surrogate has huge potential: it could potentially turn any ego-car dataset into a collaborative driving one to scale up the development of CAV. We present the very first solution, using a combination of simulated collaborative data and real ego-car data. Our method Transfer Your Perspective (TYP) learns a conditioned diffusion model whose output samples are not only realistic but also consistent in both semantics and layouts with the given ego-car data. Empirical results demonstrate TYP’s effectiveness in aiding in a CAV setting. In particular, TYP enables us to (pre-)train collaborative perception algorithms like early and late fusion with little or no real-world collaborative data, greatly facilitating downstream CAV applications. more »

Award ID(s):: 2107077

PAR ID:: 10639178

Author(s) / Creator(s):: Pan, Tai-Yu ; Jeon, Sooyoung ; Fan, Mengdi ; Yoo, Jinsu ; Feng, Zhenyang ; Campbell, Mark ; Weinberger, Kilian Q ; Hariharan, Bharath ; Chao, Wei-Lun

Publisher / Repository:: IEEE

Date Published:: 2025-06-10

Page Range / eLocation ID:: 12027 to 12036

Format(s):: Medium: X

Location:: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1109/CVPR52734.2025.01123

More Like this