NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Min-Max Optimization under Delays

Adibi, Arman; Mitra, Arbitra; Hassani, Hamed (June 2024, Americal Control Conference (ACC))

Full Text Available
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling

Adibi, Arman; DalFabbro, Nicolo; Schenato, Luca; Kulkarni, Sanjeev; Poor, Vincent; Pappas, George; Hassani, Hamed; Mitra, Aritra (May 2024, PMLR)

Full Text Available
Temporal Difference Learning with Compressed Updates: Error-Feedback meets Reinforcement Learning

Mitra, Aritra; Pappas, George; Hassani, Hamed (April 2024, Transactions on machine learning research)

Full Text Available
T-Cal: An Optimal Test for the Calibration of Predictive Models

Lee, Donghwan; Huang, Xinmeng; Hassani, Hamed; Dobriban, Edgar (November 2023, Journal of machine learning)

Full Text Available
Demystifying Disagreement-on-the-Line in High Dimensions

Donghwan Lee, Behrad Moniri (July 2023, ICML 2023)

Evaluating the performance of machine learning models under distribution shifts is challenging, especially when we only have unlabeled data from the shifted (target) domain, along with labeled data from the original (source) domain. Recent work suggests that the notion of disagreement, the degree to which two models trained with different randomness differ on the same input, is a key to tackling this problem. Experimentally, disagreement and prediction error have been shown to be strongly connected, which has been used to estimate model performance. Experiments have led to the discovery of the disagreement-on-the-line phenomenon, whereby the classification error under the target domain is often a linear function of the classification error under the source domain; and whenever this property holds, disagreement under the source and target domain follow the same linear relation. In this work, we develop a theoretical foundation for analyzing disagreement in high-dimensional random features regression; and study under what conditions the disagreement-on-the-line phenomenon occurs in our setting. Experiments on CIFAR-10-C, Tiny ImageNet-C, and Camelyon17 are consistent with our theory and support the universality of the theoretical findings.
more » « less
Full Text Available
Collaborative Learning of Discrete Distributions under Heterogeneity and Communication Constraints

Xinmeng Huang, Donghwan Lee (December 2022, Advances in neural information processing systems)

In modern machine learning, users often have to collaborate to learn distributions that generate the data. Communication can be a significant bottleneck. Prior work has studied homogeneous users—i.e., whose data follow the same discrete distribution—and has provided optimal communication-efficient methods. How- ever, these methods rely heavily on homogeneity, and are less applicable in the common case when users’ discrete distributions are heterogeneous. Here we consider a natural and tractable model of heterogeneity, where users’ discrete distributions only vary sparsely, on a small number of entries. We propose a novel two-stage method named SHIFT: First, the users collaborate by communicating with the server to learn a central distribution; relying on methods from robust statistics. Then, the learned central distribution is fine-tuned to estimate the indi- vidual distributions of users. We show that our method is minimax optimal in our model of heterogeneity and under communication constraints. Further, we provide experimental results using both synthetic data and n-gram frequency estimation in the text domain, which corroborate its efficiency.
more » « less
Full Text Available

Search for: All records