Diffusion Probabilistic Modeling for Video Generation

Yang, Ruihan; Srivastava, Prakhar; Mandt, Stephan

doi:10.3390/e25101469

Citation Details

Diffusion Probabilistic Modeling for Video Generation

Denoising diffusion probabilistic models are a promising new class of generative models that mark a milestone in high-quality image generation. This paper showcases their ability to sequentially generate video, surpassing prior methods in perceptual and probabilistic forecasting metrics. We propose an autoregressive, end-to-end optimized video diffusion model inspired by recent advances in neural video compression. The model successively generates future frames by correcting a deterministic next-frame prediction using a stochastic residual generated by an inverse diffusion process. We compare this approach against six baselines on four datasets involving natural and simulation-based videos. We find significant improvements in terms of perceptual quality and probabilistic frame forecasting ability for all datasets. more »

Award ID(s):: 2047418

PAR ID:: 10550282

Author(s) / Creator(s):: Yang, Ruihan; Srivastava, Prakhar; Mandt, Stephan

Publisher / Repository:: MDPI

Date Published:: 2023-10-01

Journal Name:: Entropy

Volume:: 25

Issue:: 10

ISSN:: 1099-4300

Page Range / eLocation ID:: 1469

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.3390/e25101469

More Like this