An Adaptive Empirical Bayesian Method for Sparse Deep Learning

Wei Deng, Xiao Zhang

Citation Details

We propose a novel adaptive empirical Bayesian (AEB) method for sparse deep learning, where the sparsity is ensured via a class of self-adaptive spike-and-slab priors. The proposed method works by alternatively sampling from an adaptive hierarchical posterior distribution using stochastic gradient Markov Chain Monte Carlo (MCMC) and smoothly optimizing the hyperparameters using stochastic approximation (SA). We further prove the convergence of the proposed method to the asymptotically correct distribution under mild conditions. Empirical applications of the proposed method lead to the state-of-the-art performance on MNIST and Fashion MNIST with shallow convolutional neural networks (CNN) and the state-of-the-art compression performance on CIFAR10 with Residual Networks. The proposed method also improves resistance to adversarial attacks. more »

Award ID(s):: 1736364 1555072 1821233

PAR ID:: 10188426

Author(s) / Creator(s):: Wei Deng, Xiao Zhang

Date Published:: 2019-12-01

Journal Name:: Advances in neural information processing systems

Volume:: 32

Issue:: 1

ISSN:: 1049-5258

Page Range / eLocation ID:: 8794

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
The DOI is not currently available.

More Like this