Stochastic Newton Proximal Extragradient Method

Jiang, Ruichen; Dereziński, Michał; Mokhtari, Aryan

Citation Details

This content will become publicly available on December 10, 2025

Stochastic Newton Proximal Extragradient Method

Stochastic second-order methods are known to achieve fast local convergence in strongly convex optimization by relying on noisy Hessian estimates to precondition the gradient. Yet, most of these methods achieve superlinear convergence only when the stochastic Hessian noise diminishes, requiring an increase in the per-iteration cost as time progresses. Recent work in \cite{na2022hessian} addressed this issue via a Hessian averaging scheme that achieves a superlinear convergence rate without increasing the per-iteration cost. However, the considered method exhibits a slow global convergence rate, requiring up to ~O(κ^2) iterations to reach the superlinear rate of ~O((1/t)^{t/2}), where κ is the problem's condition number. In this paper, we propose a novel stochastic Newton proximal extragradient method that significantly improves these bounds, achieving a faster global linear rate and reaching the same fast superlinear rate in ~O(κ) iterations. We achieve this by developing a novel extension of the Hybrid Proximal Extragradient (HPE) framework, which simultaneously achieves fast global and local convergence rates for strongly convex functions with access to a noisy Hessian oracle. more »

Award ID(s):: 2338655

PAR ID:: 10582041

Author(s) / Creator(s):: Jiang, Ruichen; Dereziński, Michał; Mokhtari, Aryan

Publisher / Repository:: Advances in Neural Information Processing Systems (NeurIPS 2024)

Date Published:: 2024-12-10

Volume:: 37

Page Range / eLocation ID:: 90818--90852

Format(s):: Medium: X

Location:: Vancouver, Canada

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 10, 2025
Conference Paper:
The DOI is not currently available.

More Like this