Online Prediction in Sub-linear Space

Peng, Binghui; Zhang, Fred

Citation Details

We provide the first sub-linear space and sub-linear regret algorithm for online learning with expert advice (against an oblivious adversary), addressing an open question raised recently by Srinivas, Woodruff, Xu and Zhou (STOC 2022). We also demonstrate a separation between oblivious and (strong) adaptive adversaries by proving a linear memory lower bound of any sub-linear regret algorithm against an adaptive adversary. Our algorithm is based on a novel pool selection procedure that bypasses the traditional wisdom of leader selection for online learning, and a generic reduction that transforms any weakly sub-linear regret o(T) algorithm to T1-α regret algorithm, which may be of independent interest. Our lower bound utilizes the connection of no-regret learning and equilibrium computation in zero-sum games, leading to a proof of a strong lower bound against an adaptive adversary. more »

Award ID(s):: 2311648

PAR ID:: 10491118

Author(s) / Creator(s):: Peng, Binghui; Zhang, Fred

Publisher / Repository:: SIAM

Date Published:: 2023-01-01

Journal Name:: Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Conference Proceeding:
The DOI is not currently available.

More Like this