NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Shift Happens: Mixture of Experts based Continual Adaptation in Federated Learning

https://doi.org/10.1145/3721462.3770784

Bhope, Rahul Atul; Jayaram, K R; Venkateswaran, Praveen; Venkatasubramanian, Nalini (December 2025, ACM MIDDLEWARE '25: Proceedings of the 26th International Middleware Conference)

Not Federated Learning (FL) enables collaborative model training across decentralized clients without sharing raw data, yet faces significant challenges in real-world settings where client data distributions evolve dynamically over time. This paper tackles the critical problem of covariate and label shifts in streaming FL environments, where non-stationary data distributions degrade model performance and necessitate a middleware layer that adapts FL to distributional shifts. We introduce ShiftEx, a shift-aware mixture of experts framework that dynamically creates and trains specialized global models in response to detected distribution shifts using Maximum Mean Discrepancy for covariate shifts. The framework employs a latent memory mechanism for expert reuse and implements facility location-based optimization to jointly minimize covariate mismatch, expert creation costs, and label imbalance. Through theoretical analysis and comprehensive experiments on benchmark datasets, we demonstrate 5.5-12.9 percentage point accuracy improvements and 22-95 % faster adaptation compared to state-of-the-art FL baselines across diverse shift scenarios. The proposed approach offers a scalable, privacy-preserving middleware solution for FL systems operating in non-stationary, real-world conditions while minimizing communication and computational overhead.
more » « less
Full Text Available
OptiSeq: Ordering Examples On-The-Fly for In-Context Learning

https://doi.org/10.18653/v1/2025.findings-emnlp.1353

Bhope, Rahul Atul; Venkateswaran, Praveen; Jayaram, K R; Isahagian, Vatche; Muthusamy, Vinod; Venkatasubramanian, Nalini (November 2025, Association for Computational Linguistics Findings of the Association for Computational Linguistics: EMNLP 2025)

Developers using LLMs and LLM-based agents in their applications have provided plenty of anecdotal evidence that in-contextlearning (ICL) is fragile. In this paper, we show that in addition to the quantity and quality of examples, the order in which the incontext examples are listed in the prompt affects the output of the LLM and, consequently, their performance. While prior work has explored improving ICL through datasetdependent techniques, we introduce OptiSeq, a purely inference-time, dataset-free optimization method that efficiently determines the best example order. OptiSeq leverages log probabilities of LLM-generated outputs to systematically prune the search space of possible orderings and recommend the best order(s) by distinguishing orderings that yield high levels of accuracy and those that underperform. Extensive empirical evaluation on multiple LLMs, datasets, and prompts demonstrates that OptiSeq improves accuracy by 5.5 - 10.5 percentage points across multiple tasks.
more » « less
Full Text Available
FLIPS: Federated Learning using Intelligent Participant Selection

https://doi.org/10.1145/3590140.3629123

Bhope, Rahul Atul; Jayaram, K. R.; Venkatasubramanian, Nalini; Verma, Ashish; Thomas, Gegi (November 2023, ACM)

Search for: All records