NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast parallel sampling under isoperimetry

Anari, Nima; Chewi, Sinho; Vuong, Thuy-Duong (June 2024, Proceedings of Thirty Seventh Conference on Learning Theory)

Full Text Available
Trickle-Down in Localization Schemes and Applications

https://doi.org/10.1145/3618260.3649622

Anari, Nima; Koehler, Frederic; Vuong, Thuy-Duong (June 2024, ACM)

Full Text Available
Parallel Sampling via Counting

https://doi.org/10.1145/3618260.3649744

Anari, Nima; Gao, Ruiquan; Rubinstein, Aviad (June 2024, ACM)

Full Text Available
Universality of Spectral Independence with Applications to Fast Mixing in Spin Glasses

Anari, Nima; Jain, Vishesh; Koehler, Frederic; Pham, Huy_Tuan; Vuong, Thuy-Duong (April 2024, Proceedings of the 2024 ACM-SIAM Symposium on Discrete Algorithms, SODA 2024)

Full Text Available
Parallel Sampling of Diffusion Models

Shih, Andy; Belkhale, Suneel; Ermon, Stefano; Sadigh, Dorsa; Anari, Nima (March 2024, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023)

Full Text Available
Distortion in metric matching with ordinal preferences

https://doi.org/10.1145/3580507.3597740

Anari, Nima; Charikar, Moses; Ramakrishnan, Prasanna (July 2023, ACM)

Suppose that we have $$n$$ agents and $$n$$ items which lie in a shared metric space. We would like to match the agents to items such that the total distance from agents to their matched items is as small as possible. However, instead of having direct access to distances in the metric, we only have each agent's ranking of the items in order of distance. Given this limited information, what is the minimum possible worst-case approximation ratio (known as the \emph{distortion}) that a matching mechanism can guarantee? Previous work by \citet{CFRF+16} proved that the (deterministic) Serial Dictatorship mechanism has distortion at most $$2^n - 1$. We improve this by providing a simple deterministic mechanism that has distortion $O(n^2)$. We also provide the first nontrivial lower bound on this problem, showing that any matching mechanism (deterministic or randomized) must have worst-case distortion $$\Omega(\log n)$$. In addition to these new bounds, we show that a large class of truthful mechanisms derived from Deferred Acceptance all have worst-case distortion at least $2^n - 1$, and we find an intriguing connection between \emph{thin matchings} (analogous to the well-known thin trees conjecture) and the distortion gap between deterministic and randomized mechanisms.
more » « less
Full Text Available
Quadratic Speedups in Parallel Sampling from Determinantal Distributions

https://doi.org/10.1145/3558481.3591104

Anari, Nima; Burgess, Callum; Tian, Kevin; Vuong, Thuy-Duong (June 2023, ACM)

We study the problem of parallelizing sampling from distributions related to determinants: symmetric, nonsymmetric, and partition-constrained determinantal point processes, as well as planar perfect matchings. For these distributions, the partition function, a.k.a.\ the count, can be obtained via matrix determinants, a highly parallelizable computation; Csanky proved it is in NC. However, parallel counting does not automatically translate to parallel sampling, as classic reductions between the two are inherently sequential. We show that a nearly quadratic parallel speedup over sequential sampling can be achieved for all the aforementioned distributions. If the distribution is supported on subsets of size $$k$$ of a ground set, we show how to approximately produce a sample in $$\widetilde{O}(k^{\frac{1}{2} + c})$$ time with polynomially many processors for any constant $c>0$. In the two special cases of symmetric determinantal point processes and planar perfect matchings, our bound improves to $$\widetilde{O}(\sqrt k)$$ and we show how to sample exactly in these cases. As our main technical contribution, we fully characterize the limits of batching for the steps of sampling-to-counting reductions. We observe that only $O(1)$ steps can be batched together if we strive for exact sampling, even in the case of nonsymmetric determinantal point processes. However, we show that for approximate sampling, $$\widetilde{\Omega}(k^{\frac{1}{2}-c})$$ steps can be batched together, for any entropically independent distribution, which includes all mentioned classes of determinantal point processes. Entropic independence and related notions have been the source of breakthroughs in Markov chain analysis in recent years, so we expect our framework to prove useful for distributions beyond those studied in this work.
more » « less
Full Text Available
Parallel Discrete Sampling via Continuous Walks

https://doi.org/10.1145/3564246.3585207

Anari, Nima; Huang, Yizhi; Liu, Tianyu; Vuong, Thuy-Duong; Xu, Brian; Yu, Katherine (June 2023, ACM)

We develop a framework for sampling from discrete distributions $$\mu$$ on the hypercube $$\{\pm 1\}^n$$ by sampling from continuous distributions supported on $$\mathbb{R}^n$$ obtained by convolution with spherical Gaussians. We show that for well-studied families of discrete distributions $$\mu$$, convolving $$\mu$$ with Gaussians yields well-conditioned log-concave distributions, as long as the variance of the Gaussian is above an $O(1)$ threshold. We then reduce the task of sampling from $$\mu$$ to sampling from Gaussian-convolved distributions. Our reduction is based on a stochastic process widely studied under different names: backward diffusion in diffusion models, and stochastic localization. We discretize this process in a novel way that allows for high accuracy and parallelism. As our main application, we resolve open questions Anari, Hu, Saberi, and Schild raised on the parallel sampling of distributions that admit parallel counting. We show that determinantal point processes can be sampled via RNC algorithms, that is in time $$\log(n)^{O(1)}$$ using $$n^{O(1)}$$ processors. For a wider class of distributions, we show our framework yields Quasi-RNC sampling, i.e., $$\log(n)^{O(1)}$$ time using $$n^{O(\log n)}$$ processors. This wider class includes non-symmetric determinantal point processes and random Eulerian tours in digraphs, the latter nearly resolving another open question raised by prior work. Of potentially independent interest, we introduce and study a notion of smoothness for discrete distributions that we call transport stability, which we use to control the propagation of error in our framework. Additionally, we connect transport stability to constructions of optimally mixing local random walks and concentration inequalities.
more » « less
Full Text Available
Optimal Sublinear Sampling of Spanning Trees and Determinantal Point Processes via Average-Case Entropic Independence

https://doi.org/10.1109/FOCS54457.2022.00019

Anari, Nima; Liu, Yang P.; Vuong, Thuy-Duong (October 2022, 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS))

We design fast algorithms for repeatedly sampling from strongly Rayleigh distributions, which include as special cases random spanning tree distributions and determinantal point processes. For a graph $G=(V, E)$, we show how to approximately sample uniformly random spanning trees from $$G$$ in $$\widetilde{O}(\lvert V\rvert)$$\footnote{Throughout, $$\widetilde{O}(\cdot)$$ hides polylogarithmic factors in $$n$$.} time per sample after an initial $$\widetilde{O}(\lvert E\rvert)$$ time preprocessing. This is the first nearly-linear runtime in the output size, which is clearly optimal. For a determinantal point process on $$k$$-sized subsets of a ground set of $$n$$ elements, defined via an $$n\times n$$ kernel matrix, we show how to approximately sample in $$\widetilde{O}(k^\omega)$$ time after an initial $$\widetilde{O}(nk^{\omega-1})$$ time preprocessing, where $$\omega<2.372864$$ is the matrix multiplication exponent. The time to compute just the weight of the output set is simply $$\simeq k^\omega$$, a natural barrier that suggests our runtime might be optimal for determinantal point processes as well. As a corollary, we even improve the state of the art for obtaining a single sample from a determinantal point process, from the prior runtime of $$\widetilde{O}(\min\{nk^2, n^\omega\})$$ to $$\widetilde{O}(nk^{\omega-1})$$. In our main technical result, we achieve the optimal limit on domain sparsification for strongly Rayleigh distributions. In domain sparsification, sampling from a distribution $$\mu$$ on $$\binom{[n]}{k}$$ is reduced to sampling from related distributions on $$\binom{[t]}{k}$$ for $$t\ll n$$. We show that for strongly Rayleigh distributions, the domain size can be reduced to nearly linear in the output size $$t=\widetilde{O}(k)$$, improving the state of the art from $$t= \widetilde{O}(k^2)$$ for general strongly Rayleigh distributions and the more specialized $$t=\widetilde{O}(k^{1.5})$$ for spanning tree distributions. Our reduction involves sampling from $$\widetilde{O}(1)$$ domain-sparsified distributions, all of which can be produced efficiently assuming approximate overestimates for marginals of $$\mu$$ are known and stored in a convenient data structure. Having access to marginals is the discrete analog of having access to the mean and covariance of a continuous distribution, or equivalently knowing ``isotropy'' for the distribution, the key behind optimal samplers in the continuous setting based on the famous Kannan-Lov\'asz-Simonovits (KLS) conjecture. We view our result as analogous in spirit to the KLS conjecture and its consequences for sampling, but rather for discrete strongly Rayleigh measures.
more » « less
Full Text Available
From Sampling to Optimization on Discrete Domains with Applications to Determinant Maximization

Anari, Nima; Vuong, Thuy-Duong (July 2022, Proceedings of Thirty Fifth Conference on Learning Theory)
Loh, Po-Ling; Raginsky, Maxim (Ed.)
We establish a connection between sampling and optimization on discrete domains. For a family of distributions $$\mu$$ defined on size $$k$$ subsets of a ground set of elements, that is closed under external fields, we show that rapid mixing of natural local random walks implies the existence of simple approximation algorithms to find $$\max \mu(\cdot)$$. More precisely, we show that if $$t$$-step down-up random walks have spectral gap at least inverse polynomially large, then $$t$$-step local search finds $$\max \mu(\cdot)$$ within a factor of $$k^{O(k)}$$. As the main application of our result, we show that $$2$$-step local search achieves a nearly-optimal $$k^{O(k)}$$-factor approximation for MAP inference on nonsymmetric $$k$$-DPPs. This is the first nontrivial multiplicative approximation algorithm for this problem. In our main technical result, we show that an exchange inequality, a concept rooted in discrete convex analysis, can be derived from fast mixing of local random walks. We further advance the state of the art on the mixing of random walks for nonsymmetric DPPs and more generally sector-stable distributions, by obtaining the tightest possible bound on the step size needed for polynomial-time mixing of random walks. We bring the step size down by a factor of $$2$$ compared to prior works, and consequently get a quadratic improvement on the runtime of local search steps; this improvement is potentially of independent interest in sampling applications.
more » « less
Full Text Available

« Prev Next »

Search for: All records