NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning

Raje, Arian; Askin, Baris; Jhunjhunwala, Divyansh; Joshi, Gauri (December 2025, The Thirty-ninth Annual Conference on Neural Information Processing Systems)

Free, publicly-accessible full text available December 5, 2026
Federated Communication-Efficient Multi-Objective Optimization

Askin, Baris; Sharma, Pranay; Joshi, Gauri; Joe-Wong, Carlee (May 2025, Proceedings of the International Conference on Artificial Intelligence and Statistics)

Free, publicly-accessible full text available May 8, 2026
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

https://doi.org/10.18653/v1/2024.emnlp-main.717

Cho, Yae Jee; Liu, Luyang; Xu, Zheng; Fahrezi, Aldi; Joshi, Gauri (November 2024, Association for Computational Linguistics)

Full Text Available
FedAST: Federated Asynchronous Simultaneous Training

Askin, Baris; Sharma, Pranay; Joe-Wong, Carlee; Joshi, Gauri (July 2024, Uncertainty in artificial intelligence)

Federated Learning (FL) enables edge devices or clients to collaboratively train machine learning (ML) models without sharing their private data. Much of the existing work in FL focuses on efficiently learning a model for a single task. In this paper, we study simultaneous training of multiple FL models using a common set of clients. The few existing simultaneous training methods employ synchronous aggregation of client updates, which can cause significant delays because large models and/or slow clients can bottleneck the aggregation. On the other hand, a naive asynchronous aggregation is adversely affected by stale client updates. We propose FedAST, a buffered asynchronous federated simultaneous training algorithm that overcomes bottlenecks from slow models and adaptively allocates client resources across heterogeneous tasks. We provide theoretical convergence guarantees of FedAST for smooth non-convex objective functions. Extensive experiments over multiple real-world datasets demonstrate that our proposed method outperforms existing simultaneous FL approaches, achieving up to 46.0% reduction in time to train multiple tasks to completion.
more » « less
Full Text Available
Erasure Coded Neural Network Inference via Fisher Averaging

https://doi.org/10.1109/ISIT57864.2024.10619514

Jhunjhunwala, Divyansh; Jali, Neharika; Joshi, Gauri; Wang, Shiqiang (July 2024, IEEE)

Full Text Available
FedFisher: Leveraging Fisher Information for One-Shot Federated Learning

Jhunjhunwala, Divyansh; Wang, Shiqiang; Joshi, Gauri (May 2024, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics)

Standard federated learning (FL) algorithms typically require multiple rounds of communication between the server and the clients, which has several drawbacks, including requiring constant network connectivity, repeated investment of computational resources, and susceptibility to privacy attacks. One-Shot FL is a new paradigm that aims to address this challenge by enabling the server to train a global model in a single round of communication. In this work, we present FedFisher, a novel algorithm for one-shot FL that makes use of Fisher information matrices computed on local client models, motivated by a Bayesian perspective of FL. First, we theoretically analyze FedFisher for two-layer over-parameterized ReLU neural networks and show that the error of our one-shot FedFisher global model becomes vanishingly small as the width of the neural networks and amount of local training at clients increases. Next, we propose practical variants of FedFisher using the diagonal Fisher and K-FAC approximation for the full Fisher and highlight their communication and compute efficiency for FL. Finally, we conduct extensive experiments on various datasets, which show that these variants of FedFisher consistently improve over competing baselines.
more » « less
Full Text Available
On Improved Distributed Random Reshuffling over Networks

https://doi.org/10.1109/ICASSP48485.2024.10447202

Sharma, Pranay; Li, Jiarui; Joshi, Gauri (April 2024, IEEE)

Full Text Available
Maximizing Global Model Appeal in Federated Learning

Cho, Yae Jee; Jhunjhunwala, Divyansh; Li, Tian; Smith, Virginia; Joshi, Gauri (March 2024, Transactions on machine learning research)
Bellet, Aurelien (Ed.)
Federated learning (FL) aims to collaboratively train a global model using local data from a network of clients. To warrant collaborative training, each federated client may expect the resulting global model to satisfy some individual requirement, such as achieving a certain loss threshold on their local data. However, in real FL scenarios, the global model may not satisfy the requirements of all clients in the network due to the data heterogeneity across clients. In this work, we explore the problem of global model appeal in FL, which we define as the total number of clients that find that the global model satisfies their individual requirements. We discover that global models trained using traditional FL approaches can result in a significant number of clients unsatisfied with the model based on their local requirements. As a consequence, we show that global model appeal can directly impact how clients participate in training and how the model performs on new clients at inference time. Our work proposes MaxFL, which maximizes the number of clients that find the global model appealing. MaxFL achieves a 22-40% and 18-50% improvement in the test accuracy of training clients and (unseen) test clients respectively, compared to a wide range of FL approaches that tackle data heterogeneity, aim to incentivize clients, and learn personalized/fair models.
more » « less
Full Text Available
Federated Minimax Optimization with Client Heterogeneity

Sharma, Pranay; Panda, Rohan; Joshi, Gauri (December 2023, Transactions on machine learning research)

Minimax optimization has seen a surge in interest with the advent of modern applications such as GANs, and it is inherently more challenging than simple minimization. The difficulty is exacerbated by the training data residing at multiple edge devices or clients, especially when these clients can have heterogeneous datasets and heterogeneous local computation capabilities. We propose a general federated minimax optimization framework that subsumes such settings and several existing methods like Local SGDA. We show that naive aggregation of model updates made by clients running unequal number of local steps can result in optimizing a mismatched objective function – a phenomenon previously observed in standard federated minimization. To fix this problem, we propose normalizing the client updates by the number of local steps. We analyze the convergence of the proposed algorithm for classes of nonconvex-concave and nonconvex-nonconcave functions and characterize the impact of heterogeneous client data, partial client participation, and heterogeneous local computations. For all the function classes considered, we significantly improve the existing computation and communication complexity results. Experimental results support our theoretical claims.
more » « less
Full Text Available
Correlation Aware Sparsified Mean Estimation Using Random Projection

Jiang, Shuli; Sharma, Pranay; Joshi, Gauri (December 2023, Advances in neural information processing systems)

We study the problem of communication-efficient distributed vector mean estimation, which is a commonly used subroutine in distributed optimization and Federated Learning (FL). Rand-k sparsification is a commonly used technique to reduce communication cost, where each client sends of its coordinates to the server. However, Rand-k is agnostic to any correlations, that might exist between clients in practical scenarios. The recently proposed Rand-k-Spatial estimator leverages the cross-client correlation information at the server to improve Rand-k's performance. Yet, the performance of Rand-k-Spatial is suboptimal, and improving mean estimation is key to faster convergence in distributed optimization. We propose the Rand-Proj-Spatial estimator with a more flexible encoding-decoding procedure, which generalizes the encoding of Rand- by projecting the client vectors to a random k-dimensional subspace. We utilize Subsampled Randomized Hadamard Transform (SRHT) as the projection matrix and show that Rand-Proj-Spatial with SRHT outperforms Rand-k-Spatial, using the correlation information more efficiently. Furthermore, we propose an approach to incorporate varying degrees of correlation and suggest a practical variant of Rand-Proj-Spatial when the correlation information is not available to the server. Finally, experiments on real-world distributed optimization tasks showcase the superior performance of Rand-Proj-Spatial compared to Rand-k-Spatial and other more sophisticated sparsification techniques.
more » « less
Full Text Available

« Prev Next »

Search for: All records