NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Streaming Quantiles Algorithms with Small Space and Update Time

https://doi.org/10.3390/s22249612

Ivkin, Nikita; Liberty, Edo; Lang, Kevin; Karnin, Zohar; Braverman, Vladimir (December 2022, Sensors)

Approximating quantiles and distributions over streaming data has been studied for roughly two decades now. Recently, Karnin, Lang, and Liberty proposed the first asymptotically optimal algorithm for doing so. This manuscript complements their theoretical result by providing a practical variants of their algorithm with improved constants. For a given sketch size, our techniques provably reduce the upper bound on the sketch error by a factor of two. These improvements are verified experimentally. Our modified quantile sketch improves the latency as well by reducing the worst-case update time from O(1ε) down to O(log1ε).
more » « less
Full Text Available
Flow-Level Loss Detection with Δ-Sketches

https://doi.org/10.1145/3563647.3563653

Landau Feibish, Shir; Liu, Zaoxing; Ivkin, Nikita; Chen Xiaoqi; Braverman, Vladimir; Rexford, Jennifer (October 2022, Proceedings of ACM SIGCOMM Symposium on SDN Research (SOSR '22))

Packet drops caused by congestion are a fundamental problem in network operation. Yet, it is difficult to detect where drops are happening, let alone which flows are most affected. Detecting the small-timescale drops caused by short bursts of traffic is even more challenging, and traditional monitoring techniques can easily miss them. To uncover packet drops as they occur inside a switch, the analysis must be real-time, fine-grained, and efficient. However, modern switches have distributed packet-processing pipelines that see either the arriving or departing traffic, but not the packet drops. Plus, they do not have enough memory to store per-flow state. Our MIDST system addresses these challenges through a distributed compact data structure with lightweight coordination between ingress and egress pipelines. MIDST identifies the flows experiencing loss, as well as the bursty flows responsible, across different burst durations. Our evaluation with real-world traces and TCP connections shows that MIDST uses little memory (e.g., 320KB) while providing high accuracy (95% to 98%) under varying loss rates and burst durations. We evaluate a low-rate DDoS attack and demonstrate the potential use of our measurement results for attack detection and mitigation.
more » « less
Full Text Available
Sketch and Scale: Geo-distributed tSNE and UMAP

Wei, Viska; Ivkin, Nikita; Braverman, Vladimir; Szalay, Alexander (January 2020, IEEE International Conference on Big Data)
null (Ed.)
Full Text Available
QPipe: quantiles sketch fully in the data plane

https://doi.org/10.1145/3359989.3365433

Ivkin, Nikita; Yu, Zhuolong; Braverman, Vladimir; Jin, Xin (December 2019, International Conference on emerging Networking EXperiments and Technologies (CoNEXT))

Full Text Available
FetchSGD: Communication-Efficient Federated Learning with Sketching

Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman (July 2020, Proceedings of Machine Learning Research)

Full Text Available
FetchSGD: Communication-Efficient Federated Learning with Sketching.

Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman (July 2020, Proceedings of Machine Learning Research)
null (Ed.)
Existing approaches to federated learning suffer from a communication bottleneck as well as convergence issues due to sparse client participation. In this paper we introduce a novel algorithm, called FetchSGD, to overcome these challenges. FetchSGD compresses model updates using a Count Sketch, and then takes advantage of the merge-ability of sketches to combine model updates from many workers. A key insight in the design of FetchSGD is that, because the Count Sketch is linear, momentum and error accumulation can both be carried out within the sketch. This allows the algorithm to move momentum and error accumulation from clients to the central aggregator, overcoming the challenges of sparse client participation while still achieving high compression rates and good convergence. We prove that FetchSGD has favorable convergence guarantees, and we demonstrate its empirical effectiveness by training two residual networks and a transformer model.
more » « less
Full Text Available
FetchSGD: Communication-Efficient Federated Learning with Sketching

Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman (July 2020, Proceedings of Machine Learning Research)
null (Ed.)
Existing approaches to federated learning suffer from a communication bottleneck as well as convergence issues due to sparse client participation. In this paper we introduce a novel algorithm, called FetchSGD, to overcome these challenges. FetchSGD compresses model updates using a Count Sketch, and then takes advantage of the merge-ability of sketches to combine model updates from many workers. A key insight in the design of FetchSGD is that, because the Count Sketch is linear, momentum and error accumulation can both be carried out within the sketch. This allows the algorithm to move momentum and error accumulation from clients to the central aggregator, overcoming the challenges of sparse client participation while still achieving high compression rates and good convergence. We prove that FetchSGD has favorable convergence guarantees, and we demonstrate its empirical effectiveness by training two residual networks and a transformer model.
more » « less
Full Text Available
FetchSGD: Communication-Efficient Federated Learning with Sketching

Rothchild, Daniel; Panda, Ashwinee; Ullah, Enayat; Ivkin, Nikita; Stoica, Ion; Braverman, Vladimir; Gonzalez, Joseph; Arora, Raman (July 2020, International Conference on Machine Learning)
null (Ed.)
Existing approaches to federated learning suffer from a communication bottleneck as well as convergence issues due to sparse client participation. In this paper we introduce a novel algorithm, called FetchSGD, to overcome these challenges. FetchSGD compresses model updates using a Count Sketch, and then takes advantage of the merge-ability of sketches to combine model updates from many workers. A key insight in the design of FetchSGD is that, because the Count Sketch is linear, momentum and error accumulation can both be carried out within the sketch. This allows the algorithm to move momentum and error accumulation from clients to the central aggregator, overcoming the challenges of sparse client participation while still achieving high compression rates and good convergence. We prove that FetchSGD has favorable convergence guarantees, and we demonstrate its empirical effectiveness by training two residual networks and a transformer model.
more » « less
Full Text Available
Communication-efficient Distributed SGD with Sketching

Ivkin, Nikita; Rothchild, Daniel; Ullah, Enayat; braverman, Vladimir; Stoica, Ion; Arora, Raman (December 2019, Thirty-third Conference on Neural Information Processing Systems)

Full Text Available
I Know What You Did Last Summer: Network Monitoring using Interval Queries

https://doi.org/3376928

Ivkin, Nikita; Ben Basat, Ran; Liu, Zaoxing; Einziger, Gil; Braverman, Vladimir (August 2019, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Full Text Available

« Prev Next »

Search for: All records