Online Learning with Optimism and Delay

Flaspohler, Genevieve E; Orabona, Francesco; Cohen, Judah; Mouatadid, Soukayna; Oprescu, Miruna; Orenstein, Paulo; Mackey, Lester

Citation Details

Inspired by the demands of real-time climate and weather forecasting, we develop optimistic online learning algorithms that require no parameter tuning and have optimal regret guarantees under delayed feedback. Our algorithms—DORM, DORM+, and AdaHedgeD—arise from a novel reduction of delayed online learning to optimistic online learning that reveals how optimistic hints can mitigate the regret penalty caused by delay. We pair this delay-as-optimism perspective with a new analysis of optimistic learning that exposes its robustness to hinting errors and a new meta-algorithm for learning effective hinting strategies in the presence of delay. We conclude by benchmarking our algorithms on four subseasonal climate forecasting tasks, demonstrating low regret relative to state-of-the-art forecasting models. more »

Award ID(s):: 1908111 2022446 1925930

NSF-PAR ID:: 10310892

Author(s) / Creator(s):: Flaspohler, Genevieve E; Orabona, Francesco; Cohen, Judah; Mouatadid, Soukayna; Oprescu, Miruna; Orenstein, Paulo; Mackey, Lester

Editor(s):: Meila, Marina; Zhang, Tong

Date Published:: 2021-07-01

Journal Name:: Proceedings of Machine Learning Research

Volume:: 139

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this