NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Federated Learning under Distributed Concept Drift

Jothimurugesan, Ellango; Hsieh, Kevin; Wang, Jianyu; Joshi, Gauri; Gibbons Phillip B. (April 2023, Artificial Intelligence and Statistics Conference (AISTATS))

Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). Our work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solutions to drift adaptation, with their single global model, are ill-suited to staggered drifts, necessitating multiple-model solutions. We identify the problem of drift adaptation as a time-varying clustering problem, and we propose two new clustering algorithms for reacting to drifts based on local drift detection and hierarchical clustering. Empirical evaluation shows that our solutions achieve significantly higher accuracy than existing baselines, and are comparable to an idealized algorithm with oracle knowledge of the ground-truth clustering of clients to concepts at each time step.
more » « less
Full Text Available
Federated Learning under Distributed Concept Drift

Jothimurugesan, Ellango; Hsieh, Kevin; Wang, Jianyu; Joshi, Gauri; Gibbons, Phillip B. (January 2023, Proceedings of Machine Learning Research)

Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). Our work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solutions to drift adaptation, with their single global model, are ill-suited to staggered drifts, necessitating multiple-model solutions. We identify the problem of drift adaptation as a time-varying clustering problem, and we propose two new clustering algorithms for reacting to drifts based on local drift detection and hierarchical clustering. Empirical evaluation shows that our solutions achieve significantly higher accuracy than existing baselines, and are comparable to an idealized algorithm with oracle knowledge of the ground-truth clustering of clients to concepts at each time step.
more » « less
Full Text Available
DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Tahmasbi, Ashraf; Jothimurugesan, Ellango; Tirthapura, Srikanta; Gibbons, Phillip (July 2021, Proceedings of the 38th International Conference on Machine Learning, ICML'21)

Full Text Available
DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Tahmasbi, Ashraf; Jothimurugesan, Ellango; Tirthapura, Srikanta; Gibbons, Phillip B. (July 2021, Proceedings of the 38th International Conference on Machine Learning, {ICML} 2021, 18-24 July 2021, Virtual Even)

When learning from streaming data, a change in the data distribution, also known as concept drift, can render a previously-learned model inaccurate and require training a new model. We present an adaptive learning algorithm that extends previous drift-detection-based methods by incorporating drift detection into a broader stable-state/reactive-state process. The advantage of our approach is that we can use aggressive drift detection in the stable state to achieve a high detection rate, but mitigate the false positive rate of standalone drift detection via a reactive state that reacts quickly to true drifts while eliminating most false positives. The algorithm is generic in its base learner and can be applied across a variety of supervised learning problems. Our theoretical analysis shows that the risk of the algorithm is (i) statistically better than standalone drift detection and (ii) competitive to an algorithm with oracle knowledge of when (abrupt) drifts occur. Experiments on synthetic and real datasets with concept drifts confirm our theoretical analysis.
more » « less
Full Text Available
DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift

Tahmasbi, Ashraf; Jothimurugesan, Ellango; Tirthapura, Srikanta; Gibbons, Phillip B. (January 2020, ArXivorg)

When learning from streaming data, a change in the data distribution, also known as concept drift, can render a previously-learned model inaccurate and require training a new model. We present an adaptive learning algorithm that extends previous drift-detection-based methods by incorporating drift detection into a broader stable-state/reactive-state process. The advantage of our approach is that we can use aggressive drift detection in the stable state to achieve a high detection rate, but mitigate the false positive rate of standalone drift detection via a reactive state that reacts quickly to true drifts while eliminating most false positives. The algorithm is generic in its base learner and can be applied across a variety of supervised learning problems. Our theoretical analysis shows that the risk of the algorithm is competitive to an algorithm with oracle knowledge of when (abrupt) drifts occur. Experiments on synthetic and real datasets with concept drifts confirm our theoretical analysis.
more » « less
Full Text Available
Variance-Reduced Stochastic Gradient Descent on Streaming Data

Jothimurugesan, Ellango; Tahmasbi, Ashraf; Gibbons, Phillip B; Tirthapura, Srikanta (December 2018, 32nd Conference on Neural Information Processing Systems (NeurIPS 2018))

We present an algorithm STRSAGA for efficiently maintaining a machine learning model over data points that arrive over time, quickly updating the model as new training data is observed. We present a competitive analysis comparing the suboptimality of the model maintained by STRSAGA with that of an offline algorithm that is given the entire data beforehand, and analyze the risk-competitiveness of STRSAGA under different arrival patterns. Our theoretical and experimental results show that the risk of STRSAGA is comparable to that of offline algorithms on a variety of input arrival patterns, and its experimental performance is significantly better than prior algorithms suited for streaming data, such as SGD and SSVRG.
more » « less
Full Text Available
Variance-Reduced Stochastic Gradient Descent on Streaming Data

Jothimurugesan, Ellango; Tahmasbi, Ashraf; Gibbons, Phillip; Tirthapura, Srikanta (October 2018, Advances in neural information processing systems)

We present an algorithm STRSAGA for efficiently maintaining a machine learning model over data points that arrive over time, quickly updating the model as new training data is observed. We present a competitive analysis comparing the sub-optimality of the model maintained by STRSAGA with that of an offline algorithm that is given the entire data beforehand, and analyze the risk-competitiveness of STRSAGA under different arrival patterns. Our theoretical and experimental results show that the risk of STRSAGA is comparable to that of offline algorithms on a variety of input arrival patterns, and its experimental performance is significantly better than prior algorithms suited for streaming data, such as SGD and SSVRG.
more » « less
Full Text Available

Search for: All records