NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

One-Step Diffusion with Distribution Matching Distillation

https://doi.org/10.1109/CVPR52733.2024.00632

Yin, Tianwei; Gharbi, Michaël; Zhang, Richard; Shechtman, Eli; Durand, Frédo; Freeman, William T; Park, Taesung (June 2024, IEEE)

Full Text Available
Learning Task-Specific Strategies for Accelerated MRI

https://doi.org/10.1109/TCI.2024.3410521

Wu, Zihui; Yin, Tianwei; Sun, Yu; Frost, Robert; van_der_Kouwe, Andre; Dalca, Adrian V; Bouman, Katherine L (January 2024, IEEE Transactions on Computational Imaging)

Full Text Available
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

Tewari, Ayush; Yin, Tianwei; Cazenavette, George; Rezchikov, Semon; Tenenbaum, Joshua B; Durand, Frédo; Freeman, William T; Sitzmann, Vincent (December 2023, Neural Information Processing Systems)

Denoising diffusion models have emerged as a powerful class of generative models capable of capturing the distributions of complex, real-world signals. However, current approaches can only model distributions for which training samples are directly accessible, which is not the case in many real-world tasks. In inverse graphics, for instance, we seek to sample from a distribution over 3D scenes consistent with an image but do not have access to ground-truth 3D scenes, only 2D images. We present a new class of denoising diffusion probabilistic models that learn to sample from distributions of signals that are never observed directly, but instead are only measured through a known differentiable forward model that generates partial observations of the unknown signal. To accomplish this, we directly integrate the forward model into the denoising process. At test time, our approach enables us to sample from the distribution over underlying signals consistent with some partial observation. We demonstrate the efficacy of our approach on three challenging computer vision tasks. For instance, in inverse graphics, we demonstrate that our model enables us to directly sample from the distribution 3D scenes consistent with a single 2D input image.
more » « less
Full Text Available
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

Tewari, Ayush; Yin, Tianwei; Cazenavette, George; Rezchikov, Semon; Tenenbaum, Joshua B; Durand, Frédo; Freeman, William T; Sitzmann, Vincent (December 2023, Neural Information Processing Systems)

Denoising diffusion models have emerged as a powerful class of generative models capable of capturing the distributions of complex, real-world signals. However, current approaches can only model distributions for which training samples are directly accessible, which is not the case in many real-world tasks. In inverse graphics, for instance, we seek to sample from a distribution over 3D scenes consistent with an image but do not have access to ground-truth 3D scenes, only 2D images. We present a new class of denoising diffusion probabilistic models that learn to sample from distributions of signals that are never observed directly, but instead are only measured through a known differentiable forward model that generates partial observations of the unknown signal. To accomplish this, we directly integrate the forward model into the denoising process. At test time, our approach enables us to sample from the distribution over underlying signals consistent with some partial observation. We demonstrate the efficacy of our approach on three challenging computer vision tasks. For instance, in inverse graphics, we demonstrate that our model enables us to directly sample from the distribution 3D scenes consistent with a single 2D input image.
more » « less
Full Text Available
Global Tracking Transformers

https://doi.org/10.1109/cvpr52688.2022.00857

Zhou, Xingyi; Yin, Tianwei; Koltun, Vladlen; Krahenbuhl, Philipp (June 2022, CVPR)

Full Text Available
Multimodal Virtual Point 3D Detection

Yin, Tianwei; Zhou, Xingyi; Krähenbühl, Philipp (December 2021, Advances in neural information processing systems)

Full Text Available
Center-Based 3D Object Detection and Tracking

Yin, Tianwei; Zhou, Xingyi; Krähenbühl, Philipp (June 2021, IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In this paper, we instead propose to represent, detect, and track 3D objects as points. Our framework, CenterPoint, first detects centers of objects using a keypoint detector and regresses to other attributes, including 3D size, 3D orientation, and velocity. In a second stage, it refines these estimates using additional point features on the object. In CenterPoint, 3D object tracking simplifies to greedy closest-point matching. The resulting detection and tracking algorithm is simple, efficient, and effective. CenterPoint achieved state-of-the-art performance on the nuScenes benchmark for both 3D detection and tracking, with 65.5 NDS and 63.8 AMOTA for a single model. On the Waymo Open Dataset, CenterPoint outperforms all previous single model methods by a large margin and ranks first among all Lidar-only submissions.
more » « less
Full Text Available
End-to-End Sequential Sampling and Reconstruction for MR Imaging

Yin, Tianwei; Wu, Zihui; Sun, He; Dalca, Adrian V; Yue, Yisong; Bouman, Katherine L (December 2021, Proceedings of the Machine Learning for Health Conference)

Accelerated MRI shortens acquisition time by subsampling in the measurement k-space. Recovering a high-fidelity anatomical image from subsampled measurements requires close cooperation between two components: (1) a sampler that chooses the subsampling pattern and (2) a reconstructor that recovers images from incomplete measurements. In this paper, we leverage the sequential nature of MRI measurements, and propose a fully differentiable framework that jointly learns a sequential sampling policy simultaneously with a reconstruction strategy. This co-designed framework is able to adapt during acquisition in order to capture the most informative measurements for a particular target (see the figure above). Experimental results on the fastMRI knee dataset demonstrate that the proposed approach successfully utilizes intermediate information during the sampling process to boost reconstruction performance. In particular, our proposed method outperforms the current state-of-the-art learned k-space sampling baseline on over 96% of test samples. We also investigate the individual and collective benefits of the sequential sampling and co-design strategies.
more » « less
Full Text Available

Search for: All records