NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On the Privacy of Sublinear-Communication Jaccard Index Estimation via Min-hash

https://doi.org/10.62056/AK2I5W7SF

Liang, Mingyu; Choi, Seung Geol; Dachman-Soled, Dana; Liu, Linsheng; Yerukhimovich, Arkady (January 2025, IACR Communications in Cryptology)

The min-hash sketch is a well-known technique for low-communication approximation of the Jaccard index between two input sets. Moreover, there is a folklore belief that min-hash sketch-based protocols protect the privacy of the inputs. In this paper, we consider variants of private min-hash sketch based-protocols and investigate this folklore to quantify the privacy of the min-hash sketch. We begin our investigation by presenting a highly-efficient two-party protocol for estimating the Jaccard index while ensuring differential privacy. This protocol adds Laplacian noise to the min-hash sketch counts to provide privacy protection. Then, we aim to understand what privacy, if any, is guaranteed if the results of the min-hash are released without any additional noise, such as in the case of historical data. We begin our investigation by considering the privacy of min-hash in a centralized setting where the hash functions are chosen by the min-hash functionality and are unknown to the participants. We show that in this case the min-hash output satisfies the standard definition of differential privacy (DP) without any additional noise. We next consider a more practical distributed setting, where the hash function must be shared among all parties and is typically public. Unfortunately, we show that in this public hash function setting, the min-hash output is no longer DP. We therefore consider the notion of distributional differential privacy (DDP) introduced by Bassily et al. (FOCS 2013). We show that if the honest party's set has sufficiently high min-entropy, the min-hash output achieves DDP without requiring noise. Our findings provide guidance on how to use the min-hash sketch for private Jaccard index estimation and clarify the extent to which min-hash protocols protect input privacy, refining the common belief in their privacy guarantees.
more » « less
Full Text Available
On the Privacy of Sublinear-Communication Jaccard Index Estimation via Min-hash

Liang, Mingyu; Choi, Seung Geol; Dachman-Soled, Dana; Liu, Linsheng; Yerukhimovich, Arkady (January 2025, IACR communications in cryptology)

Full Text Available
Revisiting the Security of Approximate FHE with Noise-Flooding Countermeasures

https://doi.org/10.1007/978-3-031-91832-2_4

Bergamaschi, Flavio; Costache, Anamaria; Dachman-Soled, Dana; Kippen, Hunter; LaBuff, Lucas; Tang, Rui (January 2025, Springer Nature Switzerland)

Full Text Available
A Canonical Data Transformation for Achieving Inter- and Within-Group Fairness

https://doi.org/10.1109/TIFS.2024.3416040

Lazri, Zachary McBride; Brugere, Ivan; Tian, Xin; Dachman-Soled, Dana; Polychroniadou, Antigoni; Dervovic, Danial; Wu, Min (June 2024, IEEE Transactions on Information Forensics and Security)

Increases in the deployment of machine learning algorithms for applications that deal with sensitive data have brought attention to the issue of fairness in machine learning. Many works have been devoted to applications that require different demographic groups to be treated fairly. However, algorithms that aim to satisfy inter-group fairness (also called group fairness) may inadvertently treat individuals within the same demographic group unfairly. To address this issue, this article introduces a formal definition of within-group fairness that maintains fairness among individuals from within the same group. A pre-processing framework is proposed to meet both inter- and within-group fairness criteria with little compromise in performance. The framework maps the feature vectors of members from different groups to an inter-group fair canonical domain before feeding them into a scoring function. The mapping is constructed to preserve the relative relationship between the scores obtained from the unprocessed feature vectors of individuals from the same demographic group, guaranteeing within-group fairness. This framework has been applied to the Adult, COMPAS risk assessment, and Law School datasets, and its performance is demonstrated and compared with two regularization-based methods in achieving inter-group and within-group fairness.
more » « less
Full Text Available
Breaking RSA Generically Is Equivalent to Factoring, with Preprocessing

https://doi.org/10.4230/LIPIcs.ITC.2024.8

Dachman-Soled, Dana; Loss, Julian; O'Neill, Adam (January 2024, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Aggarwal, Divesh (Ed.)
We investigate the relationship between the classical RSA and factoring problems when preprocessing is considered. In such a model, adversaries can use an unbounded amount of precomputation to produce an "advice" string to then use during the online phase, when a problem instance becomes known. Previous work (e.g., [Bernstein, Lange ASIACRYPT '13]) has shown that preprocessing attacks significantly improve the runtime of the best-known factoring algorithms. Due to these improvements, we ask whether the relationship between factoring and RSA fundamentally changes when preprocessing is allowed. Specifically, we investigate whether there is a superpolynomial gap between the runtime of the best attack on RSA with preprocessing and on factoring with preprocessing. Our main result rules this out with respect to algorithms that perform generic computation on the RSA instance x^e od N yet arbitrary computation on the modulus N, namely a careful adaptation of the well-known generic ring model of Aggarwal and Maurer (Eurocrypt 2009) to the preprocessing setting. In particular, in this setting we show the existence of a factoring algorithm with polynomially related parameters, for any setting of RSA parameters. Our main technical contribution is a set of new information-theoretic techniques that allow us to handle or eliminate cases in which the Aggarwal and Maurer result does not yield a factoring algorithm in the standard model with parameters that are polynomially related to those of the RSA algorithm. These techniques include two novel compression arguments, and a variant of the Fiat-Naor/Hellman tables construction that is tailored to the factoring setting.
more » « less
Full Text Available
Extracting Randomness from Samplable Distributions, Revisited

https://doi.org/10.1109/FOCS57990.2023.00092

Ball, Marshall; Goldin, Eli; Dachman-Soled, Dana; Mutreja, Saachi (November 2023, 2023 IEEE 64th Annual Symposium on Foundations of Computer Science (FOCS))

Full Text Available
Secure Sampling with Sublinear Communication

Choi, Seung Geol; Dachman-Soled, Dana; Gordon, S. Dov; Liu, Linsheng; Yerukhimovich, Arkady (December 2022, Lecture notes in computer science)

Full Text Available
Secure Sampling with Sublinear Communication

https://doi.org/10.1007/978-3-031-22365-5_13

Choi, Seung Geol; Dachman-Soled, Dana; Gordon, S. Dov; Liu, Linsheng; Yerukhimovich, Arkady (December 2022, Springer)

Full Text Available
(Nondeterministic) Hardness vs. Non-malleability

https://doi.org/10.1007/978-3-031-15802-5_6

Ball, Marshall; Dachman-Soled, Dana; Loss, Julian (January 2022, Lecture notes in computer science)

Full Text Available
When Frodo Flips: End-to-End Key Recovery on FrodoKEM via Rowhammer

https://doi.org/10.1145/3548606.3560673

Fahr, Michael; Kippen, Hunter; Kwong, Andrew; Dang, Thinh; Lichtinger, Jacob; Dachman-Soled, Dana; Genkin, Daniel; Nelson, Alexander; Perlner, Ray; Yerukhimovich, Arkady; et al (November 2022, Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security)

Full Text Available

« Prev Next »

Search for: All records