NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Johnson-Lindenstrauss Lemma for Clustering and Subspace Approximation: From Coresets to Dimension Reduction

Charikar, Moses; Waingarten, Erik (January 2025, Proceedings of the 2025 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA))

Free, publicly-accessible full text available January 12, 2026
A Quasi-Monte Carlo Data Structure for Smooth Kernel Evaluations

Charikar, Moses; Kapralov, Michael; Waingarten, Erik (January 2024, Society for Industrial and Applied Mathematics)

Full Text Available
Distortion in metric matching with ordinal preferences

https://doi.org/10.1145/3580507.3597740

Anari, Nima; Charikar, Moses; Ramakrishnan, Prasanna (July 2023, ACM)

Suppose that we have $$n$$ agents and $$n$$ items which lie in a shared metric space. We would like to match the agents to items such that the total distance from agents to their matched items is as small as possible. However, instead of having direct access to distances in the metric, we only have each agent's ranking of the items in order of distance. Given this limited information, what is the minimum possible worst-case approximation ratio (known as the \emph{distortion}) that a matching mechanism can guarantee? Previous work by \citet{CFRF+16} proved that the (deterministic) Serial Dictatorship mechanism has distortion at most $$2^n - 1$. We improve this by providing a simple deterministic mechanism that has distortion $O(n^2)$. We also provide the first nontrivial lower bound on this problem, showing that any matching mechanism (deterministic or randomized) must have worst-case distortion $$\Omega(\log n)$$. In addition to these new bounds, we show that a large class of truthful mechanisms derived from Deferred Acceptance all have worst-case distortion at least $2^n - 1$, and we find an intriguing connection between \emph{thin matchings} (analogous to the well-known thin trees conjecture) and the distortion gap between deterministic and randomized mechanisms.
more » « less
Full Text Available
Fast Algorithms for a New Relaxation of Optimal Transport

Charikar, Moses; Chen, Beidi; Re, Christopher; Waingarten, Erik (July 2023, Proceedings of Machine Learning Research)

Full Text Available
Distributed algorithms from arboreal ants for the shortest path problem

https://doi.org/10.1073/pnas.2207959120

Garg, Shivam; Shiragur, Kirankumar; Gordon, Deborah M.; Charikar, Moses (February 2023, Proceedings of the National Academy of Sciences)

Colonies of the arboreal turtle ant create networks of trails that link nests and food sources on the graph formed by branches and vines in the canopy of the tropical forest. Ants put down a volatile pheromone on the edges as they traverse them. At each vertex, the next edge to traverse is chosen using a decision rule based on the current pheromone level. There is a bidirectional flow of ants around the network. In a previous field study, it was observed that the trail networks approximately minimize the number of vertices, thus solving a variant of the popular shortest path problem without any central control and with minimal computational resources. We propose a biologically plausible model, based on a variant of the reinforced random walk on a graph, which explains this observation and suggests surprising algorithms for the shortest path problem and its variants. Through simulations and analysis, we show that when the rate of flow of ants does not change, the dynamics converges to the path with the minimum number of vertices, as observed in the field. The dynamics converges to the shortest path when the rate of flow increases with time, so the colony can solve the shortest path problem merely by increasing the flow rate. We also show that to guarantee convergence to the shortest path, bidirectional flow and a decision rule dividing the flow in proportion to the pheromone level are necessary, but convergence to approximately short paths is possible with other decision rules.
more » « less
Full Text Available
Simple, Scalable and Effective Clustering via One-Dimensional Projections

Charikar, Moses; Henzinger, Monika; Hu, Lunjia; Vötsch, Maximilian; Waingarten, Erik (January 2023, Advances in Neural Information Processing Systems)
Oh, A; Naumann, T; Globerson, A; Saenko, K; Hardt, M; Levine, S (Ed.)
Full Text Available
Polylogarithmic Sketches for Clustering

https://doi.org/10.4230/LIPIcs.ICALP.2022.38

Charikar, Moses; Waingarten, Erik (January 2022, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Bojańczyk, Mikołaj; Merelli, Emanuela; Woodruff, David P (Ed.)
Given n points in 𝓁_p^d, we consider the problem of partitioning points into k clusters with associated centers. The cost of a clustering is the sum of p-th powers of distances of points to their cluster centers. For p ∈ [1,2], we design sketches of size poly(log(nd),k,1/ε) such that the cost of the optimal clustering can be estimated to within factor 1+ε, despite the fact that the compressed representation does not contain enough information to recover the cluster centers or the partition into clusters. This leads to a streaming algorithm for estimating the clustering cost with space poly(log(nd),k,1/ε). We also obtain a distributed memory algorithm, where the n points are arbitrarily partitioned amongst m machines, each of which sends information to a central party who then computes an approximation of the clustering cost. Prior to this work, no such streaming or distributed-memory algorithm was known with sublinear dependence on d for p ∈ [1,2).
more » « less
Full Text Available
Near-Optimal Explainable k-Means for All Dimensions

https://doi.org/10.1137/1.9781611977073.101

Charikar, Moses; Hu, Lunjia (January 2022, Proceedings of the annual ACMSIAM Symposium on Discrete Algorithms)

Full Text Available
On the Efficient Implementation of High Accuracy Optimality of Profile Maximum Likelihood

Charikar, Moses; Jiang, Zhihao; Shiragur, Kirankumar; Sidford, Aaron (January 2022, Advances in Neural Information Processing Systems 35 (NeurIPS 2022))

Full Text Available
Brief Announcement: A Randomness-efficient Massively Parallel Algorithm for Connectivity

https://doi.org/10.1145/3465084.3467951

Charikar, Moses; Ma, Weiyun; Tan, Li-Yang (July 2021, PODC'21: Proceedings of the 2021 ACM Symposium on Principles of Distributed Computing)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records