NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Gain with no Pain: Efficiency of Kernel-PCA by Nyström Sampling

Sterge, N; Sriperumbudur, B. K.; Rosasco, L.; and Rudi, A. (August 2020, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics)

In this paper, we analyze a Nyström based approach to efficient large scale kernel principal component analysis (PCA). The latter is a natural nonlinear extension of classical PCA based on considering a nonlinear feature map or the corresponding kernel. Like other kernel approaches, kernel PCA enjoys good mathematical and statistical properties but, numerically, it scales poorly with the sample size. Our analysis shows that Nyström sampling greatly improves computational efficiency without incurring any loss of statistical accuracy. While similar effects have been observed in supervised learning, this is the first such result for PCA. Our theoretical findings are based on a combination of analytic and concentration of measure techniques. Our study is more broadly motivated by the question of understanding the interplay between statistical and computational requirements for learning.
more » « less
Full Text Available
Gaussian Sketching yields a J-L Lemma in RKHS

Kpotufe, Samory; Sriperumbudur, Bharath K (June 2020, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR)
Chiappa, Silvia; Calandra, Roberto (Ed.)
Full Text Available
On kernel derivative approximation with random Fourier features.

Szabo, Zoltan; Sriperumbudur, Bharath K. (April 2019, The 22nd International Conference on Artificial Intelligence and Statistics)

Random Fourier features (RFF) represent one of the most popular and wide-spread techniques in machine learning to scale up kernel algorithms. Despite the numerous successful applications of RFFs, unfortunately, quite little is understood theoretically on their optimality and limitations of their performance. Only recently, precise statisticalcomputational trade-offs have been established for RFFs in the approximation of kernel values, kernel ridge regression, kernel PCA and SVM classification. Our goal is to spark the investigation of optimality of RFFbased approximations in tasks involving not only function values but derivatives, which naturally lead to optimization problems with kernel derivatives. Particularly, in this paper, we focus on the approximation quality of RFFs for kernel derivatives and prove that the existing finite-sample guarantees can be improved exponentially in terms of the domain where they hold, using recent tools from unbounded empirical process theory. Our result implies that the same approximation guarantee is attainable for kernel derivatives using RFF as achieved for kernel values.
more » « less
Full Text Available
Convergence Analysis of Deterministic Kernel-Based Quadrature Rules in Misspecified Settings

https://doi.org/10.1007/s10208-018-09407-7

Kanagawa, Motonobu; Sriperumbudur, Bharath K.; Fukumizu, Kenji (January 2019, Foundations of Computational Mathematics)

This paper presents convergence analysis of kernel-based quadrature rules in misspecified settings, focusing on deterministic quadrature in Sobolev spaces. In particular, we deal with misspecified settings where a test integrand is less smooth than a Sobolev RKHS based on which a quadrature rule is constructed. We provide convergence guarantees based on two different assumptions on a quadrature rule: one on quadrature weights, and the other on design points. More precisely, we show that convergence rates can be derived (i) if the sum of absolute weights remains constant (or does not increase quickly), or (ii) if the minimum distance between design points does not decrease very quickly. As a consequence of the latter result, we derive a rate of convergence for Bayesian quadrature in misspecified settings. We reveal a condition on design points to make Bayesian quadrature robust to misspecification, and show that, under this condition, it may adaptively achieve the optimal rate of convergence in the Sobolev space of a lesser order (i.e., of the unknown smoothness of a test integrand), under a slightly stronger regularity condition on the integrand.
more » « less
Full Text Available
Characteristic and Universal Tensor Product Kernels

Szabo, Z; Sriperumbudur, B. (January 2018, Journal of machine learning research)

Maximum mean discrepancy (MMD), also called energy distance or N-distance in statistics and Hilbert-Schmidt independence criterion (HSIC), specifically distance covariance in statistics, are among the most popular and successful approaches to quantify the difference and independence of random variables, respectively. Thanks to their kernel-based foundations, MMD and HSIC are applicable on a wide variety of domains. Despite their tremendous success, quite little is known about when HSIC characterizes independence and when MMD with tensor product kernel can discriminate probability distributions. In this paper, we answer these questions by studying various notions of the characteristic property of the tensor product kernel.
more » « less
Full Text Available
Optimal prediction for additive function-on-function regression

https://doi.org/10.1214/18-EJS1505

Reimherr, Matthew; Sriperumbudur, Bharath; Taoufik, Bahaeddine (January 2018, Electronic Journal of Statistics)

Full Text Available

Search for: All records