NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lossy Image Compression with Conditional Diffusion Models

Yang, Ruihan; Mandt, Stephan (December 2023, Advances in neural information processing systems)

Full Text Available
Diffusion Probabilistic Modeling for Video Generation

https://doi.org/10.3390/e25101469

Yang, Ruihan; Srivastava, Prakhar; Mandt, Stephan (October 2023, Entropy)

Denoising diffusion probabilistic models are a promising new class of generative models that mark a milestone in high-quality image generation. This paper showcases their ability to sequentially generate video, surpassing prior methods in perceptual and probabilistic forecasting metrics. We propose an autoregressive, end-to-end optimized video diffusion model inspired by recent advances in neural video compression. The model successively generates future frames by correcting a deterministic next-frame prediction using a stochastic residual generated by an inverse diffusion process. We compare this approach against six baselines on four datasets involving natural and simulation-based videos. We find significant improvements in terms of perceptual quality and probabilistic frame forecasting ability for all datasets.
more » « less
Full Text Available
Insights From Generative Modeling for Neural Video Compression

https://doi.org/10.1109/TPAMI.2023.3260684

Yang, Ruihan; Yang, Yibo; Marino, Joseph; Mandt, Stephan (August 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
SC2 Benchmark: Supervised Compression for Split Computing

Matsubara, Yoshimoto; Yang, Ruihan; Levorato, Marco; Mandt, Stephan (January 2023, Transactions on machine learning research)

Full Text Available
Hierarchical Autoregressive Modeling for Neural Video Compression

Yang, Ruihan; Yang, Yibo; Marino, Joseph; Mandt, Stephan (January 2021, International Conference on Learning Representations)
null (Ed.)
Recent work by Marino et al. (2020) showed improved performance in sequential density estimation by combining masked autoregressive flows with hierarchical latent variable models. We draw a connection between such autoregressive generative models and the task of lossy video compression. Specifically, we view recent neural video compression methods (Lu et al., 2019; Yang et al., 2020b; Agustsson et al., 2020) as instances of a generalized stochastic temporal autoregressive transform, and propose avenues for enhancement based on this insight. Comprehensive evaluations on large-scale video data show improved rate-distortion performance over both state-of-the-art neural and conventional video compression methods.
more » « less
Full Text Available

Search for: All records