Mechanic: A Learning Rate Tuner

Cutkosky, Ashok; Defazio, Aaron; Mehta, Harsh

Citation Details

We introduce a technique for tuning the learning rate scale factor of any base optimization algorithm and schedule automatically, which we call Mechanic. Our method provides a practical realization of recent theoretical reductions for accomplishing a similar goal in online convex optimization. We rigorously evaluate Mechanic on a range of large scale deep learning tasks with varying batch sizes, schedules, and base optimization algorithms. These experiments demonstrate that depending on the problem, Mechanic either comes very close to, matches or even improves upon manual tuning of learning rates. more »

Award ID(s):: 2211718 2022446

PAR ID:: 10524750

Author(s) / Creator(s):: Cutkosky, Ashok; Defazio, Aaron; Mehta, Harsh

Publisher / Repository:: Advances in neural information processing systems (NeurIPS)

Date Published:: 2023-12-10

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this