NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Price of Implicit Bias in Adversarially Robust Generalization

Tsilivis, Nikolaos; Frank, Natalie; Srebro, Nathan; Kempe, Julia (April 2025, The International Conference on Learning Representations (ICLR 2025))

We study the implicit bias of optimization in robust empirical risk minimization (robust ERM) and its connection with robust generalization. In classification settings under adversarial perturbations with linear models, we study what type of regularization should ideally be applied for a given perturbation set to improve (robust) generalization. We then show that the implicit bias of optimization in robust ERM can significantly affect the robustness of the model and identify two ways this can happen; either through the optimization algorithm or the architecture. We verify our predictions in simulations with synthetic data and experimentally study the importance of implicit bias in robust ERM with deep neural networks.
more » « less
Free, publicly-accessible full text available April 24, 2026
When is Agnostic Reinforcement Learning Statistically Tractable?

Jia, Zeyu; Li, Gene; Rakhlin, Alexander; Sekhari, Ayush; Srebro, Nathan (March 2025, Advances in Neural Information Processing Systems 36)
Oh, A; Naumann, T; Globerson, A; Saenko, K; Hardt, M; Levine, S (Ed.)
We study the problem of agnostic PAC reinforcement learning (RL): given a policy class Pi, how many rounds of interaction with an unknown MDP (with a potentially large state and action space) are required to learn an epsilon-suboptimal policy with respect to Pi? Towards that end, we introduce a new complexity measure, called the spanning capacity, that depends solely on the set Pi and is independent of the MDP dynamics. With a generative model, we show that the spanning capacity characterizes PAC learnability for every policy class Pi. However, for online RL, the situation is more subtle. We show there exists a policy class Pi with a bounded spanning capacity that requires a superpolynomial number of samples to learn. This reveals a surprising separation for agnostic learnability between generative access and online access models (as well as between deterministic/stochastic MDPs under online access). On the positive side, we identify an additional sunflower structure which in conjunction with bounded spanning capacity enables statistically efficient online RL via a new algorithm called POPLER, which takes inspiration from classical importance sampling methods as well as recent developments for reachable-state identification and policy evaluation in reward-free exploration.
more » « less
Free, publicly-accessible full text available March 30, 2026
Depth Separation in Norm-Bounded Infinite-Width Neural Networks

Parkinson, Suzanna; Ongie, Greg; Willett, Rebecca; Shamir, Ohad; Srebro, Nathan (June 2024, 37th Annual Conference on Learning Theory)

Full Text Available
Metalearning with Very Few Samples Per Task

Aliakbarpour, Maryam; Bairaktari, Konstantina; Brown, Gavin; Smith, Adam; Srebro, Nathan; Ullman, Jonathan (June 2024, Proceedings of Machine Learning Research)

Full Text Available
Metalearning with Very Few Samples Per Task

Aliakbarpour, Maryam; Bairaktari, Konstantina; Brown, Gavin; Smith, Adam; Srebro, Nathan; Ullman, Jonathan (June 2024, Conference on Learning Theory)
The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in {ReLU} Networks

Frei, Spencer; Vardi, Gal; Bartlett, Peter L; Srebro, Nathan (December 2023, Advances in neural information processing systems)

Full Text Available
Implicit Bias in Leaky ReLU Networks Trained on High-Dimensional Data

Frei, Spencer; Vardi, Gal; Bartlett, Peter L.; Srebro, Nathan; Hu, Wei (May 2023, Proceedings of ICLR 2023)

Full Text Available
A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

Zhou, Lijia; Koehler, Frederic; Sur, Pragya; Sutherland, Danica J.; Srebro, Nathan (November 2022, Advances in neural information processing systems)

Full Text Available
Implicit Bias of the Step Size in Linear Diagonal Neural Networks

Nacson, Mor Shpigel.; Ravichandran, Kavya; Srebro, Nathan; Soudry, Daniel (January 2022, International Conference on Machine Learning)

Full Text Available
How catastrophic can catastrophic forgetting be in linear regression?

Evron, Itay; Moroshko, Edward; Ward, Rachel; Srebro, Nathan; Soudry (January 2022, Conference on Learning Theory)

Full Text Available

« Prev Next »

Search for: All records