NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Decoding Speculative Decoding

https://doi.org/10.18653/v1/2025.naacl-long.328

Yan, Minghao; Agarwal, Saurabh; Venkataraman, Shivaram (April 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available April 29, 2026
Bagpipe: Accelerating Deep Recommendation Model Training

https://doi.org/10.1145/3600006.3613142

Agarwal, Saurabh; Yan, Chengpo; Zhang, Ziyi; Venkataraman, Shivaram (October 2023, ACM)
Adaptive Gradient Communication via Critical Learning Regime Identification

Agarwal, Saurabh; Wang, Hongyi; Lee, Kangwook; Venkataraman, Shivaram; Papailiopoulos, Dimitris (January 2021, Proceedings of Machine Learning and Systems)
Smola, A.; Dimakis, A.; Stoica, I. (Ed.)
Distributed model training suffers from communication bottlenecks due to frequent model updates transmitted across compute nodes. To alleviate these bottlenecks, practitioners use gradient compression techniques like sparsification, quantization, low rank updates etc. The techniques usually require choosing a static compression ratio, often requiring users to balance the trade-off between model accuracy and per-iteration speedup. In this work, we show that such performance degradation due to choosing a high compression ratio is not fundamental and that an adaptive compression strategy can reduce communication while maintaining final test accuracy.Inspired by recent findings on critical learning regimes, in which small gradient errors can have irrecoverable impact on model performance, we propose ACCORDION a simple yet effective adaptive compression algorithm. While ACCORDION maintains a high enough compression rate on average, it avoids detrimental impact by not compressing gradients too much whenever in critical learning regimes, detected by a simple gradient-norm based criterion. Our extensive experimental study over a number of machine learning tasks in distributed environments indicates that ACCORDION, maintains similar model accuracy to uncompressed training, yet achieves up to 5.5×better compression and up to 4.1×end-to-end speedup over static approaches. We show that ACCORDION also works for adjusting the batch size, another popular strategy for alleviating communication bottlenecks. Our code is available at https://github.com/uw-mad-dash/Accordion
more » « less
Full Text Available
Attack of the Tails: Yes, You Really Can Backdoor Federated Learning

Wang, Hongyi; Sreenivasan, Kartik; Rajput, Shashank; Vishwakarma, Harit; Agarwal, Saurabh; Sohn, Jy-yong; Lee, Kangwook; Papailiopoulos, Dimitris (January 2020, Advances in neural information processing systems)

Due to its decentralized nature, Federated Learning (FL) lends itself to adversarial attacks in the form of backdoors during training. The goal of a backdoor is to corrupt the performance of the trained model on specific sub-tasks (e.g., by classifying green cars as frogs). A range of FL backdoor attacks have been introduced in the literature, but also methods to defend against them, and it is currently an open question whether FL systems can be tailored to be robust against backdoors. In this work, we provide evidence to the contrary. We first establish that, in the general case, robustness to backdoors implies model robustness to adversarial examples, a major open problem in itself. Furthermore, detecting the presence of a backdoor in a FL model is unlikely assuming first order oracles or polynomial time. We couple our theoretical results with a new family of backdoor attacks, which we refer to as edge-case backdoors. An edge-case backdoor forces a model to misclassify on seemingly easy inputs that are however unlikely to be part of the training, or test data, i.e., they live on the tail of the input distribution. We explain how these edge-case backdoors can lead to unsavory failures and may have serious repercussions on fairness, and exhibit that with careful tuning at the side of the adversary, one can insert them across a range of machine learning tasks (e.g., image classification, OCR, text prediction, sentiment analysis).
more » « less
Full Text Available

Search for: All records