NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Faster margin maximization rates for generic and adversarially robust optimization methods

https://doi.org/10.1007/s10107-025-02283-4

Wang, Guanghui; Hu, Zihao; Gentile, Claudio; Muthukumar, Vidya; Abernethy, Jacob (October 2025, Mathematical Programming)

Abstract First-order optimization methods tend to inherently favor certain solutions over others when minimizing an underdetermined training objective that has multiple global optima. This phenomenon, known asimplicit bias, plays a critical role in understanding the generalization capabilities of optimization algorithms. Recent research has revealed that in separable binary classification tasks gradient-descent-based methods exhibit an implicit bias for the$$\ell _2$$ $ℓ_{2}$ -maximal margin classifier. Similarly, generic optimization methods, such as mirror descent and steepest descent, have been shown to converge to maximal margin classifiers defined by alternative geometries. While gradient-descent-based algorithms provably achievefastimplicit bias rates, corresponding rates in the literature for generic optimization methods are relatively slow. To address this limitation, we present a series of state-of-the-art implicit bias rates for mirror descent and steepest descent algorithms. Our primary technique involves transforming a generic optimization algorithm into an online optimization dynamic that solves a regularized bilinear game, providing a unified framework for analyzing the implicit bias of various optimization methods. Our accelerated rates are derived by leveraging the regret bounds of online learning algorithms within this game framework. We then show the flexibility of this framework by analyzing the implicit bias inadversarial training, and again obtain significantly improved convergence rates.
more » « less
Task shift: From classification to regression in overparameterized linear models

LaBonte, Tyler; Lai, Kuo-Wei; Muthukumar, Vidya (June 2025, International Conference on Artificial Intelligence and Statistics)

Free, publicly-accessible full text available June 17, 2026
Estimating stationary mass, frequency by frequency

Nakul, Milind; Muthukumar, Vidya; Pananjady, Ashwin (June 2025, Conference on Learning Theory)

Free, publicly-accessible full text available June 1, 2026
On the unreasonable effectiveness of last-layer retraining

Hill, John C; LaBonte, Tyler; Zhang, Xinchen; Muthukumar, Vidya (March 2025, ICLR Workshop on Spurious Correlations and Shortcut Learning)

Free, publicly-accessible full text available March 5, 2026
The group robustness is in the details: Revisiting finetuning under spurious correlations

LaBonte, Tyler; Hill, John C; Zhang, Xinchen; Muthukumar, Vidya; Kumar, Abhishek (January 2025, Neural Information Processing Systems)

Full Text Available
Precise asymptotics of reweighted least-squares algorithms for linear diagonal networks

Kaushik, Chiraag; Romberg, Justin; Muthukumar, Vidya (January 2025, Neural Information Processing Systems)

Full Text Available
Just Wing It: Near-optimal estimation of missing mass in a Markovian sequence

Pananjady, Ashwin; Muthukumar, Vidya; Thangaraj, Andrew (October 2024, Journal of Machine Learning Research)

Full Text Available
New Equivalences between Interpolation and SVMs: Kernels and Structured Features

https://doi.org/10.1137/23M1568764

Kaushik, Chiraag; McRae, Andrew D; Davenport, Mark; Muthukumar, Vidya (September 2024, SIAM Journal on Mathematics of Data Science)

Full Text Available
Sharp Analysis of Out-of-Distribution Error for “Importance-Weighted” Estimators in the Overparameterized Regime

https://doi.org/10.1109/ISIT57864.2024.10619252

Lai, Kuo-Wei; Muthukumar, Vidya (July 2024, IEEE)

Full Text Available
Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

Kaushik, Chiraag; Liu, Ran; Lin, Chi-Heng; Khera, Amrit; Jin, Matthew Y; Ma, Wenrui; Muthukumar, Vidya; Dyer, Eva L (July 2024, International Conference on Machine Learning)

Full Text Available

« Prev Next »

Search for: All records