Rate-Regularization and Generalization in Variational Autoencoders

Bozkurt, A; Esmaeili, B.; Tristan, J.-B.; Brooks, D.; Dy, J.; van de Meent, J.-W.

Citation Details

Variational autoencoders (VAEs) optimize an objective that comprises a reconstruction loss (the distortion) and a KL term (the rate). The rate is an upper bound on the mutual information, which is often interpreted as a regularizer that controls the degree of compression. We here examine whether inclusion of the rate term also improves generalization. We perform rate-distortion analyses in which we control the strength of the rate term, the network capacity, and the difficulty of the generalization problem. Lowering the strength of the rate term paradoxically improves generalization in most settings, and reducing the mutual information typically leads to underfitting. Moreover, we show that generalization performance continues to improve even after the mutual information saturates, indicating that the gap on the bound (i.e. the KL divergence relative to the inference marginal) affects generalization. This suggests that the standard spherical Gaussian prior is not an inductive bias that typically improves generalization, prompting further work to understand what choices of priors improve generalization in VAEs. more »

Award ID(s):: 1901117

PAR ID:: 10280434

Author(s) / Creator(s):: Bozkurt, A; Esmaeili, B.; Tristan, J.-B.; Brooks, D.; Dy, J.; van de Meent, J.-W.

Editor(s):: Banerjee, A.; Fukumizu, K.

Date Published:: 2021-04-01

Journal Name:: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics

Volume:: 130

Page Range / eLocation ID:: 3880--3888

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this