NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Computationally-Efficient Neural Image Compression with Shallow Decoders

Yang, Yibo; Mandt, Stephan (October 2023, Proceedings)

Full Text Available
Computationally-Efficient Neural Image Compression with Shallow Decoders

Yang, Yibo; Mandt, Stephan (October 2023, International Conference on Computer Vision)

Full Text Available
Estimating the Rate-Distortion Function by Wasserstein Gradient Descent

Yang, Yibo; Eckstein, Stephan; Nutz, Marcel; Mandt, Stephan (December 2023, Advances in neural information processing systems)

Full Text Available
Estimating the Rate-Distortion Function by Wasserstein Gradient Descent 37th Conference on Neural Information Processing Systems (NeurIPS)

Yang, Yibo; Eckstein, Stephan; Nutz, Marcel; Mandt, Stephan (November 2023, 37th Conference on Neural Information Processing Systems (NeurIPS))
A. Oh; T. Naumann; A. Globerson; K. Saenko; M. Hardt; S. Levine (Ed.)
In the theory of lossy compression, the rate-distortion (R-D) function R(D) describes how much a data source can be compressed (in bit-rate) at any given level of fidelity (distortion). Obtaining R(D) for a given data source establishes the fundamental performance limit for all compression algorithms. We propose a new method to estimate R(D) from the perspective of optimal transport. Unlike the classic Blahut--Arimoto algorithm which fixes the support of the reproduction distribution in advance, our Wasserstein gradient descent algorithm learns the support of the optimal reproduction distribution by moving particles. We prove its local convergence and analyze the sample complexity of our R-D estimator based on a connection to entropic optimal transport. Experimentally, we obtain comparable or tighter bounds than state-of-the-art neural network methods on low-rate sources while requiring considerably less tuning and computation effort. We also highlight a connection to maximum-likelihood deconvolution and introduce a new class of sources that can be used as test cases with known solutions to the R-D problem.
more » « less
Full Text Available
Insights From Generative Modeling for Neural Video Compression

https://doi.org/10.1109/TPAMI.2023.3260684

Yang, Ruihan; Yang, Yibo; Marino, Joseph; Mandt, Stephan (August 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Learning to simulate high energy particle collisions from unlabeled data

https://doi.org/10.1038/s41598-022-10966-7

Howard, Jessica N.; Mandt, Stephan; Whiteson, Daniel; Yang, Yibo (December 2022, Scientific Reports)

Abstract In many scientific fields which rely on statistical inference, simulations are often used to map from theoretical models to experimental data, allowing scientists to test model predictions against experimental results. Experimental data is often reconstructed from indirect measurements causing the aggregate transformation from theoretical models to experimental data to be poorly-described analytically. Instead, numerical simulations are used at great computational cost. We introduce Optimal-Transport-based Unfolding and Simulation (OTUS), a fast simulator based on unsupervised machine-learning that is capable of predicting experimental data from theoretical models. Without the aid of current simulation information, OTUS trains a probabilistic autoencoder to transform directly between theoretical models and experimental data. Identifying the probabilistic autoencoder’s latent space with the space of theoretical models causes the decoder network to become a fast, predictive simulator with the potential to replace current, computationally-costly simulators. Here, we provide proof-of-principle results on two particle physics examples, Z -boson and top-quark decays, but stress that OTUS can be widely applied to other fields.
more » « less
Full Text Available
Ultrafast Nanoimaging of Electronic Coherence of Monolayer WSe ₂

https://doi.org/10.1021/acs.nanolett.2c04536

Luo, Wenjin; Whetten, Benjamin G.; Kravtsov, Vasily; Singh, Ashutosh; Yang, Yibo; Huang, Di; Cheng, Xinbin; Jiang, Tao; Belyanin, Alexey; Raschke, Markus B. (March 2023, Nano Letters)

Full Text Available
Towards Empirical Sandwich Bounds on the Rate-Distortion Function

Yang, Yibo; Mandt, Stephan (January 2022, International Conference on Learning Representations)

Full Text Available
Hierarchical Autoregressive Modeling for Neural Video Compression

Yang, Ruihan; Yang, Yibo; Marino, Joseph; Mandt, Stephan (January 2021, International Conference on Learning Representations)
null (Ed.)
Recent work by Marino et al. (2020) showed improved performance in sequential density estimation by combining masked autoregressive flows with hierarchical latent variable models. We draw a connection between such autoregressive generative models and the task of lossy video compression. Specifically, we view recent neural video compression methods (Lu et al., 2019; Yang et al., 2020b; Agustsson et al., 2020) as instances of a generalized stochastic temporal autoregressive transform, and propose avenues for enhancement based on this insight. Comprehensive evaluations on large-scale video data show improved rate-distortion performance over both state-of-the-art neural and conventional video compression methods.
more » « less
Full Text Available
Improving Inference for Neural Image Compression

Yang, Yibo; Bamler, Robert; Mandt, Stephan (October 2020, Advances in neural information processing systems)
null (Ed.)
We consider the problem of lossy image compression with deep latent variable models. State-of-the-art methods [Ballé et al., 2018, Minnen et al., 2018, Lee et al., 2019] build on hierarchical variational autoencoders (VAEs) and learn inference networks to predict a compressible latent representation of each data point. Drawing on the variational inference perspective on compression [Alemi et al., 2018], we identify three approximation gaps which limit performance in the conventional approach: an amortization gap, a discretization gap, and a marginalization gap. We propose remedies for each of these three limitations based on ideas related to iterative inference, stochastic annealing for discrete optimization, and bits-back coding, resulting in the first application of bits-back coding to lossy compression. In our experiments, which include extensive baseline comparisons and ablation studies, we achieve new state-of-the-art performance on lossy image compression using an established VAE architecture, by changing only the inference method.
more » « less
Full Text Available

« Prev Next »

Search for: All records