NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The complexity of non-stationary reinforcement learning

Peng, Binghui; Papadimitriou, Christos (March 2024, PMLR)

Full Text Available
The complexity of non-stationary reinforcement learning

Peng, Binghui; Papadimitriou, Christos H (February 2024, International Conference on Algorithmic Learning Theory)

Full Text Available
On limitations of the transformer architecture

Peng, Binghui; Narayanan, Srini; Papadimitriou, Christos (February 2024, Collegium Beatus Rhenanus)

Full Text Available
Memory-Query Tradeoffs for Randomized Convex Optimization

https://doi.org/10.1109/FOCS57990.2023.00086

Chen, Xi; Peng, Binghui (November 2023, Proceedings of the 64th IEEE Symposium on Foundations of Computer Science)
Complexity of Equilibria in First-Price Auctions under General Tie-Breaking Rules

https://doi.org/10.1145/3564246.3585195

Chen, Xi; Peng, Binghui (June 2023, ACM)

Full Text Available
Public goods games in directed networks

https://doi.org/10.1016/j.geb.2023.02.002

Papadimitriou, Christos; Peng, Binghui (May 2023, Games and Economic Behavior)

Full Text Available
Online Prediction in Sub-linear Space

Peng, Binghui; Zhang, Fred (January 2023, Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA))

We provide the first sub-linear space and sub-linear regret algorithm for online learning with expert advice (against an oblivious adversary), addressing an open question raised recently by Srinivas, Woodruff, Xu and Zhou (STOC 2022). We also demonstrate a separation between oblivious and (strong) adaptive adversaries by proving a linear memory lower bound of any sub-linear regret algorithm against an adaptive adversary. Our algorithm is based on a novel pool selection procedure that bypasses the traditional wisdom of leader selection for online learning, and a generic reduction that transforms any weakly sub-linear regret o(T) algorithm to T1-α regret algorithm, which may be of independent interest. Our lower bound utilizes the connection of no-regret learning and equilibrium computation in zero-sum games, leading to a proof of a strong lower bound against an adaptive adversary.
more » « less
Memory Bounds for Continual Learning

https://doi.org/10.1109/FOCS54457.2022.00056

Chen, Xi; Papadimitriou, Christos; Peng, Binghui (October 2022, In Proceedings of the 63rd IEEE Symposium on Foundations of Computer Science (FOCS))

Full Text Available
On the complexity of dynamic submodular maximization

https://doi.org/10.1145/3519935.3519951

Chen, Xi; Peng, Binghui (January 2022, Proceedings of the 54th ACM Symposium on Theory of Computing (STOC 22'))

Full Text Available
Computational Hardness of the Hylland-Zeckhauser Scheme

https://doi.org/10.1137/1.9781611977073.90

Chen, Thomas; Chen, Xi; Peng, Binghui; Yannakakis, Mihalis (January 2022, Annual ACM-SIAM Symposium on Discrete Algorithms)

Full Text Available

« Prev Next »

Search for: All records