NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Don’t stop me now: Embedding based scheduling for LLMs

Shahout, Rana; Malach, Eran; Liu, Chunwei; Jiang, Weifan; Yu, Minlan; Mitzenmacher, Michael (April 2025, ICLR)

Free, publicly-accessible full text available April 24, 2026
Efficient d -ary Cuckoo Hashing at High Load Factors by Bubbling Up

https://doi.org/10.1137/1.9781611978322.133

Kuszmaul, William; Mitzenmacher, Michael (January 2025, Society for Industrial and Applied Mathematics)

Full Text Available
Leveraging parameterized Chernoff bounds for simplified algorithm analyses

https://doi.org/10.1016/j.ipl.2024.106516

Dillencourt, Michael; Goodrich, Michael T; Mitzenmacher, Michael (January 2025, Information Processing Letters)

In this paper, we derive parameterized Chernoff bounds and show their applications for simplifying the analysis of some well-known probabilistic algorithms and data structures. The parameterized Chernoff bounds we provide give probability bounds that are powers of two, with a clean formulation of the relation between the constant in the exponent and the relative distance from the mean. In addition, we provide new simplified analyses with these bounds for hash tables, randomized routing, and a simplified, non-recursive adaptation of the Floyd-Rivest selection algorithm.
more » « less
Full Text Available
Parallel Peeling of Invertible Bloom Lookup Tables in a Constant Number of Rounds

https://doi.org/10.1007/978-3-031-82697-9_6

Goodrich, Michael T; Kitagawa, Ryuto; Mitzenmacher, Michael (January 2025, Springer Nature Switzerland)

Full Text Available
SkipPredict: When to Invest in Predictions for Scheduling

Shahout, Rana; Mitzenmacher, Michael (December 2024, Neurips 2024)

Full Text Available
Optimal and Approximate Adaptive Stochastic Quantization

Basat, Ran; Ben-Itzhak, Yaniv; Mitzenmacher, Michael; Vargaftik, Shay (December 2024, Neurips 2024)

Full Text Available
Beyond Throughput and Compression Ratios: Towards High End-to-end Utility of Gradient Compression

https://doi.org/10.1145/3696348.3696857

Han, Wenchen; Vargaftik, Shay; Mitzenmacher, Michael; Karp, Brad; Basat, Ran Ben (November 2024, ACM)

Full Text Available
Learning-Augmented Frequency Estimation in Sliding Windows

https://doi.org/10.1109/ICNP61940.2024.10858536

Shahout, Rana; Sabek, Ibrahim; Mitzenmacher, Michael (October 2024, IEEE)

Full Text Available
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams

https://doi.org/10.1109/ICNP61940.2024.10858542

Shahout, Rana; Mitzenmacher, Michael (October 2024, IEEE)

Full Text Available
Direct Telemetry Access

https://doi.org/10.1145/3603269.3604827

Langlet, Jonatan; Ben_Basat, Ran; Oliaro, Gabriele; Mitzenmacher, Michael; Yu, Minlan; Antichi, Gianni (September 2023, Proceedings of ACM SIGCOMM)

Fine-grained network telemetry is becoming a modern datacenter standard and is the basis of essential applications such as congestion control, load balancing, and advanced troubleshooting. As network size increases and telemetry gets more fine-grained, there is a tremendous growth in the amount of data needed to be reported from switches to collectors to enable network-wide view. As a consequence, it is progressively hard to scale data collection systems. We introduce Direct Telemetry Access (DTA), a solution optimized for aggregating and moving hundreds of millions of reports per second from switches into queryable data structures in collectors' memory. DTA is lightweight and it is able to greatly reduce overheads at collectors. DTA is built on top of RDMA, and we propose novel and expressive reporting primitives to allow easy integration with existing state-of-the-art telemetry mechanisms such as INT or Marple. We show that DTA significantly improves telemetry collection rates. For example, when used with INT, it can collect and aggregate over 400M reports per second with a single server, improving over the Atomic MultiLog by up to 16x.
more » « less

« Prev Next »

Search for: All records