NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

HMC: Reducing the number of rejections by not using leapfrog and some results on the acceptance rate

https://doi.org/10.1016/j.jcp.2021.110333

Calvo, M.P.; Sanz-Alonso, D.; Sanz-Serna, J.M. (July 2021, Journal of Computational Physics)
null (Ed.)
Full Text Available
Bayesian Update with Importance Sampling: Required Sample Size

https://doi.org/10.3390/e23010022

Sanz-Alonso, Daniel; Wang, Zijian (January 2021, Entropy)
null (Ed.)
Importance sampling is used to approximate Bayes’ rule in many computational approaches to Bayesian inverse problems, data assimilation and machine learning. This paper reviews and further investigates the required sample size for importance sampling in terms of the χ2-divergence between target and proposal. We illustrate through examples the roles that dimension, noise-level and other model parameters play in approximating the Bayesian update with importance sampling. Our examples also facilitate a new direct comparison of standard and optimal proposals for particle filtering.
more » « less
Full Text Available
Iterative ensemble Kalman methods: A unified perspective with some new variants

https://doi.org/10.3934/fods.2021011

Chada, Neil K.; Chen, Yuming; Sanz-Alonso, Daniel (January 2021, Foundations of Data Science)
null (Ed.)
Full Text Available
Data-driven forward discretizations for Bayesian inversion

https://doi.org/10.1088/1361-6420/abb2fa

Bigoni, D; Chen, Y; Trillos, N Garcia; Marzouk, Y; Sanz-Alonso, D (October 2020, Inverse Problems)
null (Ed.)
Full Text Available
On the consistency of graph-based Bayesian semi-supervised learning and the scalability of sampling algorithms

Garcia Trillos, N; Kaplan, Z; Samakhoana, T; Sanz-Alonso, D (March 2020, Journal of machine learning research)

This paper considers a Bayesian approach to graph-based semi-supervised learning. We show that if the graph parameters are suitably scaled, the graph-posteriors converge to a continuum limit as the size of the unlabeled data set grows. This consistency result has profound algorithmic implications: we prove that when consistency holds, carefully designed Markov chain Monte Carlo algorithms have a uniform spectral gap, independent of the number of unlabeled inputs. Numerical experiments illustrate and complement the theory.
more » « less
Full Text Available
Kernel Methods for Bayesian Elliptic Inverse Problems on Manifolds

https://doi.org/10.1137/19M1295222

Harlim, John; Sanz-Alonso, Daniel; Yang, Ruiyi (January 2020, SIAM/ASA Journal on Uncertainty Quantification)
null (Ed.)
Full Text Available
Local Regularization of Noisy Point Clouds: Improved Global Geometric Estimates and Data Analysis

Garcia Trillos, N; Sanz-Alonso, D; Yang, R (August 2019, Journal of machine learning research)

Several data analysis techniques employ similarity relationships between data points to uncover the intrinsic dimension and geometric structure of the underlying data-generating mechanism. In this paper we work under the model assumption that the data is made of random perturbations of feature vectors lying on a low-dimensional manifold. We study two questions: how to define the similarity relationships over noisy data points, and what is the resulting impact of the choice of similarity in the extraction of global geometric information from the underlying manifold. We provide concrete mathematical evidence that using a local regularization of the noisy data to define the similarity improves the ap- proximation of the hidden Euclidean distance between unperturbed points. Furthermore, graph-based objects constructed with the locally regularized similarity function satisfy bet- ter error bounds in their recovery of global geometric ones. Our theory is supported by numerical experiments that demonstrate that the gain in geometric understanding facili- tated by local regularization translates into a gain in classification accuracy in simulated and real data.
more » « less
Full Text Available
Variational Characterizations of Local Entropy and Heat Regularization in Deep Learning

https://doi.org/10.3390/e21050511

García Trillos, Nicolas; Kaplan, Zachary; Sanz-Alonso, Daniel (May 2019, Entropy)

The aim of this paper is to provide new theoretical and computational understanding on two loss regularizations employed in deep learning, known as local entropy and heat regularization. For both regularized losses, we introduce variational characterizations that naturally suggest a two-step scheme for their optimization, based on the iterative shift of a probability density and the calculation of a best Gaussian approximation in Kullback–Leibler divergence. Disregarding approximation error in these two steps, the variational characterizations allow us to show a simple monotonicity result for training error along optimization iterates. The two-step optimization schemes for local entropy and heat regularized loss differ only over which argument of the Kullback–Leibler divergence is used to find the best Gaussian approximation. Local entropy corresponds to minimizing over the second argument, and the solution is given by moment matching. This allows replacing traditional backpropagation calculation of gradients by sampling algorithms, opening an avenue for gradient-free, parallelizable training of neural networks. However, our presentation also acknowledges the potential increase in computational cost of naive optimization of regularized costs, thus giving a less optimistic view than existing works of the gains facilitated by loss regularization.
more » « less
Full Text Available

Search for: All records