NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

On the Price of Differential Privacy for Hierarchical Clustering

Deng, Chengyuan; Gao, Jie; Upadhyay, Jalaj; Wang, Chen; Zhou, Samson (April 2025, International Conference on Representation Learning 2025 (ICLR 2025))

Free, publicly-accessible full text available April 28, 2026
Low Sensitivity Hopsets

Ashvinkumar, Vikrant; Bernstein, Aaron; Deng, Chengyuan; Gao, Jie; Wein, Nicole (January 2025, The 16th Innovations in Theoretical Computer Science (ITCS 2025))

Full Text Available
Low Sensitivity Hopsets

https://doi.org/10.4230/LIPIcs.ITCS.2025.13

Ashvinkumar, Vikrant; Bernstein, Aaron; Deng, Chengyuan; Gao, Jie; Wein, Nicole (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Meka, Raghu (Ed.)
Given a weighted graph G = (V,E,w), a (β, ε)-hopset H is an edge set such that for any s,t ∈ V, where s can reach t in G, there is a path from s to t in G ∪ H which uses at most β hops whose length is in the range [dist_G(s,t), (1+ε)dist_G(s,t)]. We break away from the traditional question that asks for a hopset H that achieves small |H| and small diameter β and instead study the sensitivity of H, a new quality measure. The sensitivity of a vertex (or edge) given a hopset H is, informally, the number of times a single hop in G ∪ H bypasses it; a bit more formally, assuming shortest paths in G are unique, it is the number of hopset edges (s,t) ∈ H such that the vertex (or edge) is contained in the unique st-path in G having length exactly dist_G(s,t). The sensitivity associated with H is then the maximum sensitivity over all vertices (or edges). The highlights of our results are: - A construction for (Õ(√n), 0)-hopsets on undirected graphs with O(log n) sensitivity, complemented with a lower bound showing that Õ(√n) is tight up to polylogarithmic factors for any construction with polylogarithmic sensitivity. - A construction for (n^o(1), ε)-hopsets on undirected graphs with n^o(1) sensitivity for any ε > 0 that is at least inverse polylogarithmic, complemented with a lower bound on the tradeoff between β, ε, and the sensitivity. - We define a notion of sensitivity for β-shortcut sets (which are the reachability analogues of hopsets) and give a construction for Õ(√n)-shortcut sets on directed graphs with O(log n) sensitivity, complemented with a lower bound showing that β = Ω̃(n^{1/3}) for any construction with polylogarithmic sensitivity. We believe hopset sensitivity is a natural measure in and of itself, and could potentially find use in a diverse range of contexts. More concretely, the notion of hopset sensitivity is also directly motivated by the Differentially Private All Sets Range Queries problem [Deng et al. WADS 23]. Our result for O(log n) sensitivity (Õ(√n), 0)-hopsets on undirected graphs immediately improves the current best-known upper bound on utility from Õ(n^{1/3}) to Õ(n^{1/4}) in the pure-DP setting, which is tight up to polylogarithmic factors.
more » « less
Full Text Available
Neuc-MDS: Non-Euclidean Multidimensional Scaling Through Bilinear Forms

Deng, Chengyuan; Gao, Jie; Lu, Kevin; Luo, Feng; Sun, Hongbin; Xin, Cheng (December 2024, NIPS '24: Proceedings of the 38th International Conference on Neural Information Processing Systems)

Full Text Available
Neuc-MDS: Non-Euclidean Multidimensional Scaling Through Bilinear Forms

Deng, Chengyuan; Gao, Jie; Lu, Kevin; Luo, Feng; Sun, Hongbin; Xin, Cheng (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS))

We introduce Non-Euclidean-MDS (Neuc-MDS), an extension of classical Multidimensional Scaling (MDS) that accommodates non-Euclidean and non-metric inputs. The main idea is to generalize the standard inner product to symmetric bilinear forms to utilize the negative eigenvalues of dissimilarity Gram matrices. Neuc-MDS efficiently optimizes the choice of (both positive and negative) eigenvalues of the dissimilarity Gram matrix to reduce STRESS, the sum of squared pairwise error. We provide an in-depth error analysis and proofs of the optimality in minimizing lower bounds of STRESS. We demonstrate Neuc-MDS’s ability to address limitations of classical MDS raised by prior research, and test it on various synthetic and real-world datasets in comparison with both linear and non-linear dimension reduction methods.
more » « less
Full Text Available
The Discrepancy of Shortest Paths

Bodwin, Greg; Deng, Chengyuan; Gao, Jie; Hoppenworth, Gary; Upadhyay, Jalaj; Wang, Chen (July 2024, Proceedings of ICALP (International Colloquium on Automata, Languages and Programming))

Full Text Available
The Discrepancy of Shortest Paths

https://doi.org/10.4230/LIPIcs.ICALP.2024.27

Bodwin, Greg; Deng, Chengyuan; Gao, Jie; Hoppenworth, Gary; Upadhyay, Jalaj; Wang, Chen (January 2024, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Bringmann, Karl; Grohe, Martin; Puppis, Gabriele; Svensson, Ola (Ed.)
The hereditary discrepancy of a set system is a quantitative measure of the pseudorandom properties of the system. Roughly speaking, hereditary discrepancy measures how well one can 2-color the elements of the system so that each set contains approximately the same number of elements of each color. Hereditary discrepancy has numerous applications in computational geometry, communication complexity and derandomization. More recently, the hereditary discrepancy of the set system of shortest paths has found applications in differential privacy [Chen et al. SODA 23]. The contribution of this paper is to improve the upper and lower bounds on the hereditary discrepancy of set systems of unique shortest paths in graphs. In particular, we show that any system of unique shortest paths in an undirected weighted graph has hereditary discrepancy O(n^{1/4}), and we construct lower bound examples demonstrating that this bound is tight up to polylog n factors. Our lower bounds hold even for planar graphs and bipartite graphs, and improve a previous lower bound of Ω(n^{1/6}) obtained by applying the trace bound of Chazelle and Lvov [SoCG'00] to a classical point-line system of Erdős. As applications, we improve the lower bound on the additive error for differentially-private all pairs shortest distances from Ω(n^{1/6}) [Chen et al. SODA 23] to Ω̃(n^{1/4}), and we improve the lower bound on additive error for the differentially-private all sets range queries problem to Ω̃(n^{1/4}), which is tight up to polylog n factors [Deng et al. WADS 23].
more » « less
Full Text Available
Evaluating Stability in Massive Social Networks: Efficient Streaming Algorithms for Structural Balance

Ashvinkumar, Vikrant; Assadi, Sepehr; Deng, Chengyuan; Gao, Jie; Wang, Chen (September 2023, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2023))

Full Text Available
Differentially Private Range Query on Shortest Paths

Deng, Chengyuan; Gao, Jie; Upadhyay, Jalaj; Wang, Chen (July 2023, Proceedings of Workshop on Algorithms and Data Structures)

Full Text Available
Evaluating Stability in Massive Social Networks: Efficient Streaming Algorithms for Structural Balance

https://doi.org/10.4230/LIPICS.APPROX/RANDOM.2023.58

Ashvinkumar, Vikrant; Assadi, Sepehr; Deng, Chengyuan; Gao, Jie; Wang, Chen (January 2023, APPROX-RANDOM)
Megow, Nicole; Smith, Adam (Ed.)
Structural balance theory studies stability in networks. Given a n-vertex complete graph G = (V,E) whose edges are labeled positive or negative, the graph is considered balanced if every triangle either consists of three positive edges (three mutual "friends"), or one positive edge and two negative edges (two "friends" with a common "enemy"). From a computational perspective, structural balance turns out to be a special case of correlation clustering with the number of clusters at most two. The two main algorithmic problems of interest are: (i) detecting whether a given graph is balanced, or (ii) finding a partition that approximates the frustration index, i.e., the minimum number of edge flips that turn the graph balanced. We study these problems in the streaming model where edges are given one by one and focus on memory efficiency. We provide randomized single-pass algorithms for: (i) determining whether an input graph is balanced with O(log n) memory, and (ii) finding a partition that induces a (1 + ε)-approximation to the frustration index with O(n ⋅ polylog(n)) memory. We further provide several new lower bounds, complementing different aspects of our algorithms such as the need for randomization or approximation. To obtain our main results, we develop a method using pseudorandom generators (PRGs) to sample edges between independently-chosen vertices in graph streaming. Furthermore, our algorithm that approximates the frustration index improves the running time of the state-of-the-art correlation clustering with two clusters (Giotis-Guruswami algorithm [SODA 2006]) from n^O(1/ε²) to O(n²log³n/ε² + n log n ⋅ (1/ε)^O(1/ε⁴)) time for (1+ε)-approximation. These results may be of independent interest.
more » « less
Full Text Available

« Prev Next »

Search for: All records