Power of Hints for Online Learning with Movement Costs

Bhaskara, Aditya; Cutkosky, Ashok; Kumar, Ravi; Purohit, Manish

Citation Details

We consider the online linear optimization problem with movement costs, a variant of online learning in which the learner must not only respond to cost vectors c_t with points x_t in order to maintain low regret, but is also penalized for movement by an additional cost. Classically, simple algorithms that obtain the optimal sqrt(T) regret already are very stable and do not incur a significant movement cost. However, recent work has shown that when the learning algorithm is provided with weak "hint" vectors that have a positive correlation with the costs, the regret can be significantly improved to log(T). In this work, we study the stability of such algorithms, and provide matching upper and lower bounds showing that incorporating movement costs results in intricate tradeoffs logarithmic and sqrt(T) regret. more »

Award ID(s):: 2047288

PAR ID:: 10337202

Author(s) / Creator(s):: Bhaskara, Aditya; Cutkosky, Ashok; Kumar, Ravi; Purohit, Manish

Date Published:: 2021-01-01

Journal Name:: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this