Conformal symplectic and relativistic optimization

França, Guilherme; Sulam, Jeremias; Robinson, Daniel P; Vidal, René

doi:10.1088/1742-5468/abcaee

Citation Details

Conformal symplectic and relativistic optimization

Abstract Arguably, the two most popular accelerated or momentum-based optimization methods in machine learning are Nesterov’s accelerated gradient and Polyaks’s heavy ball, both corresponding to different discretizations of a particular second order differential equation with friction. Such connections with continuous-time dynamical systems have been instrumental in demystifying acceleration phenomena in optimization. Here we study structure-preserving discretizations for a certain class of dissipative (conformal) Hamiltonian systems, allowing us to analyse the symplectic structure of both Nesterov and heavy ball, besides providing several new insights into these methods. Moreover, we propose a new algorithm based on a dissipative relativistic system that normalizes the momentum and may result in more stable/faster optimization. Importantly, such a method generalizes both Nesterov and heavy ball, each being recovered as distinct limiting cases, and has potential advantages at no additional cost. more »

Award ID(s):: 2031985

PAR ID:: 10428817

Author(s) / Creator(s):: França, Guilherme; Sulam, Jeremias; Robinson, Daniel P; Vidal, René

Date Published:: 2020-12-01

Journal Name:: Journal of Statistical Mechanics: Theory and Experiment

Volume:: 2020

Issue:: 12

ISSN:: 1742-5468

Page Range / eLocation ID:: 124008

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1088/1742-5468/abcaee

More Like this