Logarithmic Regret from Sublinear Hints

Bhaskara, Aditya; Cutkosky, Ashok; Kumar, Ravi; Purohit, Manish

Citation Details

We consider the online linear optimization problem, where at every step the algorithm plays a point x_t in the unit ball, and suffers loss for some cost vector c_t that is then revealed to the algorithm. Recent work showed that if an algorithm receives a "hint" h_t that has non-trivial correlation with c_t before it plays x_t, then it can achieve a logarithmic regret guarantee, improving on the classical sqrt(T) bound. In this work, we study the question of whether an algorithm really requires a hint at every time step. Somewhat surprisingly, we show that an algorithm can obtain logarithmic regret with just O(sqrt(T)) hints under a natural query model. We give two applications of our result, to the well-studied setting of optimistic regret bounds and to the problem of online learning with abstention. more »

Award ID(s):: 2047288

PAR ID:: 10337201

Author(s) / Creator(s):: Bhaskara, Aditya; Cutkosky, Ashok; Kumar, Ravi; Purohit, Manish

Date Published:: 2021-12-01

Journal Name:: Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this