NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning

Narang, Adhyyan; Wagenmaker, Andrew; Ratliff, Lillian; Jamieson, Kevin (December 2024, Neural Information Processing Systems)

Full Text Available
Multiplayer performative prediction: Learning in decision-dependent games

Narang, Adhyyan; Faulkner, Evan; Drusvyatskiy, Dmitriy; Fazel, Maryam; Ratliff, Lillian J (December 2023, Journal of Machine Learning Research)

Full Text Available
Multiplayer Performative Prediction: Learning in Decision-Dependent Games

Narang, Adhyyan; Faulkner, Evan; Drusvyatskiy, Dmitriy; Fazel, Maryam; Ratliff, Lillian J. (July 2023, Journal of machine learning research)

Full Text Available
Multiplayer Performative Prediction: Learning in Decision-Dependent Games

Narang, Adhyyan; Faulkner, Evan; Drusvyatskiy, Dmitriy; Fazel, Maryam; Ratliff, Lillian J. (January 2023, Journal of Machine Learning Research)

Learning problems commonly exhibit an interesting feedback mechanism wherein the population data reacts to competing decision makers’ actions. This paper formulates a new game theoretic framework for this phenomenon, called multi-player performative prediction. We focus on two distinct solution concepts, namely (i) performatively stable equilibria and (ii) Nash equilibria of the game. The latter equilibria are arguably more informative, but are generally computationally difficult to find since they are solutions of nonmonotone games. We show that under mild assumptions, the performatively stable equilibria can be found efficiently by a variety of algorithms, including repeated retraining and the repeated (stochastic) gradient method. We then establish transparent sufficient conditions for strong monotonicity of the game and use them to develop algorithms for finding Nash equilibria. We investigate derivative free methods and adaptive gradient algorithms wherein each player alternates between learning a parametric description of their distribution and gradient steps on the empirical risk. Synthetic and semi-synthetic numerical experiments illustrate the results.
more » « less
Full Text Available
Towards Sample-efficient Overparameterized Meta-learning

Sun, Yue; Narang, Adhyyan; Gulluk, Ibrahim; Oymak, Samet; Fazel, Maryam (December 2021, Advances in neural information processing systems)

An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of the dataset is smaller than that of the model. While the prior literature focuses on the classical supervised setting, this paper aims to demystify overparameterization for meta-learning. Here we have a sequence of linear-regression tasks and we ask: (1) Given earlier tasks, what is the optimal linear representation of features for a new downstream task? and (2) How many samples do we need to build this representation? This work shows that surprisingly, overparameterization arises as a natural answer to these fundamental meta-learning questions. Specifically, for (1), we first show that learning the optimal representation coincides with the problem of designing a task-aware regularization to promote inductive bias. We leverage this inductive bias to explain how the downstream task actually benefits from overparameterization, in contrast to prior works on few-shot learning. For (2), we develop a theory to explain how feature covariance can implicitly help reduce the sample complexity well below the degrees of freedom and lead to small estimation error. We then integrate these findings to obtain an overall performance guarantee for our meta-learning algorithm. Numerical experiments on real and synthetic data verify our insights on overparameterized meta-learning.
more » « less
Full Text Available
Global Convergence to Local Minmax Equilibrium in Classes of Nonconvex Zero-Sum Games

Fiez, Tanner; Ratliff, Lillian J.; Mazumdar, Eric; Faulkner, Evan; Narang, Adhyyan (December 2021, Proceedings of the Conference on Neural Information Processing Systems)

Full Text Available
Towards Sample-efficient Overparameterized Meta-learning

Sun, Yue; Narang, Adhyyan; Gulluk, Halil Ibrahim; Oymak, Samet; Fazel, Maryam (January 2021, 35th Conference on Neural Information Processing Systems)

Full Text Available
Global Convergence to Local Minmax Equilibrium in Classes of Nonconvex Zero-Sum Games

Fiez, Tanner; Ratliff, Lillian J; Mazumdar, Eric; Faulkner, Evan; Narang, Adhyyan (January 2021, Advances in neural information processing systems)

Full Text Available
Classification vs regression in overparameterized regimes: Does the loss function matter?

Muthukumar, Vidya; Narang, Adhyyan; Subramanian, Vignesh; Belkin, Mikhail; Hsu, Daniel; Sahai, Anant (January 2021, Journal of machine learning research)

Full Text Available

Search for: All records