Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures

Ene, Alina; Epasto, Alessandro; Mirrokni, Vahab; Nguyen, Hoai-An; Nguyen, Huy L; Woodruff, David P; Zhong, Peilin

Citation Details

This content will become publicly available on July 13, 2026

Maximum Coverage in Turnstile Streams with Applications to Fingerprinting Measures

In the maximum coverage problem we are given d subsets from a universe [n], and the goal is to output k subsets such that their union covers the largest possible number of distinct items. We present the first algorithm for maximum coverage in the turnstile streaming model, where updates which insert or delete an item from a subset come one-by-one. Notably our algorithm only uses polylogn update time. We also present turnstile streaming algorithms for targeted and general fingerprinting for risk management where the goal is to determine which features pose the greatest re-identification risk in a dataset. As part of our work, we give a result of independent interest: an algorithm to estimate the complement of the pth frequency moment of a vector for p ≥ 2. Empirical evaluation confirms the practicality of our fingerprinting algorithms demonstrating a speedup of up to 210x over prior work. more »

Award ID(s):: 2311649

PAR ID:: 10599721

Author(s) / Creator(s):: Ene, Alina; Epasto, Alessandro; Mirrokni, Vahab; Nguyen, Hoai-An; Nguyen, Huy L; Woodruff, David P; Zhong, Peilin

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2025-07-13

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 13, 2026
Conference Paper:
The DOI is not currently available.

More Like this