NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

In-Memory Key-Value Store Live Migration with NetMigrate

Zhu, Zeying; Zhu, Yibo; Liu, Zaoxing (February 2024, 22nd USENIX Conference on File and Storage Technologies (FAST '24))

Distributed key-value stores today require frequent key-value shard migration between nodes to react to dynamic workload changes for load balancing, data locality, and service elasticity. In this paper, we propose NetMigrate, a live migration approach for in-memory key-value stores based on programmable network data planes. NetMigrate migrates shards between nodes with zero service interruption and minimal performance impact. During migration, the switch data plane monitors the migration process in a fine-grained manner and directs client queries to the right server in real time, eliminating the overhead of pulling data between nodes. We implement a NetMigrate prototype on a testbed consisting of a programmable switch and several commodity servers running Redis and evaluate it under YCSB workloads. Our experiments demonstrate that NetMigrate improves the query throughput from 6.5% to 416% and maintains low access latency during migration, compared to the state-of-the-art migration approaches.
more » « less
Full Text Available
Analyzing the Benefits of Optical Topology Programming for Mitigating Link-Flood DDoS Attacks

https://doi.org/10.1109/TDSC.2024.3391188

Nance-Hall, Matthew; Liu, Zaoxing; Sekar, Vyas; Durairajan, Ramakrishnan (January 2024, IEEE Transactions on Dependable and Secure Computing)

Full Text Available
Sketchovsky: Enabling Ensembles of Sketches on Programmable Switches

Namkung, Hun; Liu, Zaoxing; Kim, Daehyeok; Sekar, Vyas; Steenkiste, Peter (April 2023, The 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI'23)))

Network operators need to run diverse measurement tasks on programmable switches to support management decisions (e.g., traffic engineering or anomaly detection). While prior work has shown the viability of running a single sketch instance, they largely ignore the problem of running an ensemble of sketch instances for a collection of measurement tasks. As such, existing efforts fall short of efficiently supporting a general ensemble of sketch instances. In this work, we present the design and implementation of Sketchovsky, a novel cross-sketch optimization and composition framework. We identify five new cross-sketch optimization building blocks to reduce critical switch hardware resources. We design efficient heuristics to select and apply these building blocks for arbitrary ensembles. To simplify developer effort, Sketchovsky automatically generates the composed code to be input to the hardware compiler. Our evaluation shows that Sketchovsky makes ensembles with up to 18 sketch instances become feasible and can reduce up to 45% of the critical hardware resources.
more » « less
Full Text Available
Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams

https://doi.org/10.14778/3551793.3551867

Manousis, Antonis; Cheng, Zhuo; Basat, Ran Ben; Liu, Zaoxing; Sekar, Vyas (September 2022, Proceedings of the VLDB Endowment)

Today’s large-scale services (e.g., video streaming platforms, data centers, sensor grids) need diverse real-time summary statistics across multiple subpopulations of multidimensional datasets. However, state-of-the-art frameworks do not offer general and accurate analytics in real time at reasonable costs. The root cause is the combinatorial explosion of data subpopulations and the diversity of summary statistics we need to monitor simultaneously. We present Hydra, an efficient framework for multidimensional analytics that presents a novel combination of using a “sketch of sketches” to avoid the overhead of monitoring exponentially-many subpopulations and universal sketching to ensure accurate estimates for multiple statistics. We build Hydra as an Apache Spark plugin and address practical system challenges to minimize overheads at scale. Across multiple real-world and synthetic multidimensional datasets, we show that Hydra can achieve robust error bounds and is an order of magnitude more efficient in terms of operational cost and memory footprint than existing frameworks (e.g., Spark, Druid) while ensuring interactive estimation times.
more » « less
Full Text Available

Search for: All records