Big-Step-Little-Step: Gradient Methods for Objectives with Multiple Scales

Kelner, Jonathan; Marsden, Annie; Sharan, Vatsal; Sidford, Aaron; Valiant, Gregory; Yuan, Honglin

Citation Details

We provide new gradient-based methods for efficiently solving a broad class of ill-conditioned optimization problems. We consider the problem of minimizing a function f : R d --> R which is implicitly decomposable as the sum of m unknown non-interacting smooth, strongly convex functions and provide a method which solves this problem with a number of gradient evaluations that scales (up to logarithmic factors) as the product of the square-root of the condition numbers of the components. This complexity bound (which we prove is nearly optimal) can improve almost exponentially on that of accelerated gradient methods, which grow as the square root of the condition number of f. Additionally, we provide efficient methods for solving stochastic, quadratic variants of this multiscale optimization problem. Rather than learn the decomposition of f (which would be prohibitively expensive), our methods apply a clean recursive “Big-Step-Little-Step” interleaving of standard methods. The resulting algorithms use O˜(dm) space, are numerically stable, and open the door to a more fine-grained understanding of the complexity of convex optimization beyond condition number. more »

Award ID(s):: 1813049 1704417

PAR ID:: 10354704

Author(s) / Creator(s):: Kelner, Jonathan; Marsden, Annie; Sharan, Vatsal; Sidford, Aaron; Valiant, Gregory; Yuan, Honglin

Date Published:: 2022-01-01

Journal Name:: Conference on Learning Theory (COLT)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this