Fast Margin Maximization via Dual Acceleration

Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus

Citation Details

We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (eg, the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of O (1/t^ 2). This contrasts with a rate of O (1/log (t)) for standard gradient descent, and O (1/t) for normalized gradient descent. The momentum-based method is derived via the convex dual of the maximum-margin problem, and specifically by applying Nesterov acceleration to this dual, which manages to result in a simple and intuitive method in the primal. This dual view can also be used to derive a stochastic variant, which performs adaptive non-uniform sampling via the dual variables. more »

Award ID(s):: 1934843

NSF-PAR ID:: 10287259

Author(s) / Creator(s):: Ji, Ziwei; Srebro, Nathan; Telgarsky, Matus

Date Published:: 2021-07-01

Journal Name:: Proceedings of Machine Learning Research

ISSN:: 2640-3498

Page Range / eLocation ID:: 4860-4869

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this