Search for: All records

Creators/Authors contains: "Wainwright, M. J."

« Prev Next »

Total Resources

7

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

0

Journal Article

2

Workshop Report

0

Availability
Full Text / Resource Available

7

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ROOT-SGD: Sharp Nonasymptotics and Asymptotic Efficiency in a Single Algorithm

Li, C.J. ; Mou, W. ; Wainwright, M. J. ; Jordan, M. I. ( July 2022 , Conference on Computational Learning Theory)

We study the problem of solving strongly convex and smooth unconstrained optimization problems using stochastic first-order algorithms. We devise a novel algorithm, referred to as \emph{Recursive One-Over-T SGD} (\ROOTSGD), based on an easily implementable, recursive averaging of past stochastic gradients. We prove that it simultaneously achieves state-of-the-art performance in both a finite-sample, nonasymptotic sense and an asymptotic sense. On the nonasymptotic side, we prove risk bounds on the last iterate of \ROOTSGD with leading-order terms that match the optimal statistical risk with a unity pre-factor, along with a higher-order term that scales at the sharp rate of O(n−3/2) under the Lipschitz condition on the Hessian matrix. On the asymptotic side, we show that when a mild, one-point Hessian continuity condition is imposed, the rescaled last iterate of (multi-epoch) \ROOTSGD converges asymptotically to a Gaussian limit with the Cram\'{e}r-Rao optimal asymptotic covariance, for a broad range of step-size choices.
more » « less
Full Text Available
ROOT-SGD: Sharp Nonasymptotics and Asymptotic Efficiency in a Single Algorithm

Li, C.J. ; Mou, W. ; Wainwright, M. J. ; Jordan, M. I. ( July 2022 , Conference on Computational Learning Theory)
Loh, P ; Raginsky, M. (Ed.)
We study the problem of solving strongly convex and smooth unconstrained optimization problems using stochastic first-order algorithms. We devise a novel algorithm, referred to as \emph{Recursive One-Over-T SGD} (\ROOTSGD), based on an easily implementable, recursive averaging of past stochastic gradients. We prove that it simultaneously achieves state-of-the-art performance in both a finite-sample, nonasymptotic sense and an asymptotic sense. On the nonasymptotic side, we prove risk bounds on the last iterate of \ROOTSGD with leading-order terms that match the optimal statistical risk with a unity pre-factor, along with a higher-order term that scales at the sharp rate of $O(n^{-3/2})$ under the Lipschitz condition on the Hessian matrix. On the asymptotic side, we show that when a mild, one-point Hessian continuity condition is imposed, the rescaled last iterate of (multi-epoch) \ROOTSGD converges asymptotically to a Gaussian limit with the Cram\'{e}r-Rao optimal asymptotic covariance, for a broad range of step-size choices.
more » « less
Full Text Available
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning

Zanette, A. ; Wainwright, M. J. ( January 2022 , International Conference on Machine Learning)

Full Text Available
A new similarity measure for covariate shift with applications to nonparametric regression

Pathak, R. ; Ma, C. ; Wainwright, M. J. ( January 2022 , International Conference on Machine Learning)

Full Text Available
Is Temporal Difference Learning Optimal? {A}n Instance-Dependent Analysis

Khamaru, K ; Pananjady, A. ; Ruan, F. ; Wainwright, M. J. ; Jordan, M. I. ( January 2021 , SIAM journal on mathematics of data science)
null (Ed.)
Full Text Available
Fed{S}plit: {A}n algorithmic framework for fast federated optimization

Pathak, R. ; Wainwright, M. J. ( January 2020 , Advances in neural information processing systems)
null (Ed.)
Full Text Available
Fast mixing of Metropolized Hamiltonian Monte Carlo: Benefits of multi-step gradients

Chen, Y. ; Dwivedi, R. ; Wainwright, M. J. ; Yu, B. ( January 2020 , Journal of machine learning research)

Full Text Available