Search for: All records

Award ID contains: 1750362

« Prev Next »

Total Resources

16

Resource Type
Conference Paper

11

Conference Proceeding

0

Dataset

0

Journal Article

5

Workshop Report

0

Availability
Full Text / Resource Available

16

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fundamental limits for rank-one matrix estimation with groupwise heteroskedasticity

Behne, J. ( January 2022 , International Conference on Artificial Intelligence and Statistics)

Low-rank matrix recovery problems involving high-dimensional and heterogeneous data appear in applications throughout statistics and machine learning. The contribution of this paper is to establish the fundamental limits of recovery for a broad class of these problems. In particular, we study the problem of estimating a rank-one matrix from Gaussian observations where different blocks of the matrix are observed under different noise levels. In the setting where the number of blocks is fixed while the number of variables tends to infinity, we prove asymptotically exact formulas for the minimum mean-squared error in estimating both the matrix and underlying factors. These results are based on a novel reduction from the low-rank matrix tensor product model (with homogeneous noise) to a rank-one model with heteroskedastic noise. As an application of our main result, we show that show recently proposed methods based on applying principal component analysis (PCA) to weighted combinations of the data are optimal in some settings but sub-optimal in others. We also provide numerical results comparing our asymptotic formulas with the performance of methods based weighted PCA, gradient descent, and approximate message passing.
more » « less
Full Text Available
k-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension

Goldfeld, Z ; Greenewald, K ; Nuradha. T. ; Reeves, G. ( January 2022 , Conference on Neural Information Processing Systems)

Full Text Available
Gaussian Approximation of Quantization Error for Estimation From Compressed Data

https://doi.org/10.1109/TIT.2021.3083271

Kipnis, Alon ; Reeves, Galen ( August 2021 , IEEE Transactions on Information Theory)

Full Text Available
Convergence of Gaussian-smoothed optimal transport distance with sub-gamma distributions and dependent samples

Zhang, Yixing ; Cheng, Xiuyuan ; Reeves, Galen ( April 2021 , International Conference on Artificial Intelligence and Statistics)

The Gaussian-smoothed optimal transport (GOT) framework, recently proposed by Goldfeld et al., scales to high dimensions in estimation and provides an alternative to entropy regularization. This paper provides convergence guarantees for estimating the GOT distance under more general settings. For the Gaussian-smoothed $p$-Wasserstein distance in $d$ dimensions, our results require only the existence of a moment greater than $d + 2p$. For the special case of sub-gamma distributions, we quantify the dependence on the dimension $d$ and establish a phase transition with respect to the scale parameter. We also prove convergence for dependent samples, only requiring a condition on the pairwise dependence of the samples measured by the covariance of the feature map of a kernel space. A key step in our analysis is to show that the GOT distance is dominated by a family of kernel maximum mean discrepancy (MMD) distances with a kernel that depends on the cost function as well as the amount of Gaussian smoothing. This insight provides further interpretability for the GOT framework and also introduces a class of kernel MMD distances with desirable properties. The theoretical results are supported by numerical experiments.The Gaussian-smoothed optimal transport (GOT) framework, recently proposed by Goldfeld et al., scales to high dimensions in estimation and provides an alternative to entropy regularization. This paper provides convergence guarantees for estimating the GOT distance under more general settings. For the Gaussian-smoothed $p$-Wasserstein distance in $d$ dimensions, our results require only the existence of a moment greater than $d + 2p$. For the special case of sub-gamma distributions, we quantify the dependence on the dimension $d$ and establish a phase transition with respect to the scale parameter. We also prove convergence for dependent samples, only requiring a condition on the pairwise dependence of the samples measured by the covariance of the feature map of a kernel space. A key step in our analysis is to show that the GOT distance is dominated by a family of kernel maximum mean discrepancy (MMD) distances with a kernel that depends on the cost function as well as the amount of Gaussian smoothing. This insight provides further interpretability for the GOT framework and also introduces a class of kernel MMD distances with desirable properties. The theoretical results are supported by numerical experiments.
more » « less
Full Text Available
A Two-Moment Inequality with Applications to Rényi Entropy and Mutual Information

https://doi.org/10.3390/e22111244

Reeves, Galen ( November 2020 , Entropy)
null (Ed.)
This paper explores some applications of a two-moment inequality for the integral of the rth power of a function, where 0 more » « less
Full Text Available
Information-Theoretic Limits for the Matrix Tensor Product

https://doi.org/10.1109/JSAIT.2020.3040598

Reeves, Galen ( November 2020 , IEEE Journal on Selected Areas in Information Theory)
Information-theoretic limits of a multiview low-rank symmetric spiked matrix model

https://doi.org/10.1109/ISIT44484.2020.9173970

Barbier, Jean ; Reeves, Galen ( June 2020 , 2020 IEEE International Symposium on Information Theory (ISIT))
null (Ed.)
We consider a generalization of an important class of high-dimensional inference problems, namely spiked symmetric matrix models, often used as probabilistic models for principal component analysis. Such paradigmatic models have recently attracted a lot of attention from a number of communities due to their phenomenological richness with statistical-to-computational gaps, while remaining tractable. We rigorously establish the information-theoretic limits through the proof of single-letter formulas for the mutual information and minimum mean-square error. On a technical side we improve the recently introduced adaptive interpolation method, so that it can be used to study low-rank models (i.e., estimation problems of "tall matrices") in full generality, an important step towards the rigorous analysis of more complicated inference and learning models.
more » « less
Full Text Available
The all-or-nothing phenomenon in sparse linear regression

https://doi.org/10.4171/MSL/22

Reeves, Galen ; Xu, Jiaming ; Zadik, Ilias ( January 2020 , Mathematical Statistics and Learning)

Full Text Available
All-or-Nothing Phenomena: From Single-Letter to High Dimensions

https://doi.org/10.1109/CAMSAP45676.2019.9022473

Reeves, Galen ; Xu, Jiaming ; Zadik, Ilias ( December 2019 , 2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP))

We consider the problem of estimating a $p$ -dimensional vector $\beta$ from $n$ observations $Y=X\beta+W$ , where $\beta_{j}\mathop{\sim}^{\mathrm{i.i.d}.}\pi$ for a real-valued distribution $\pi$ with zero mean and unit variance’ $X_{ij}\mathop{\sim}^{\mathrm{i.i.d}.}\mathcal{N}(0,1)$ , and $W_{i}\mathop{\sim}^{\mathrm{i.i.d}.}\mathcal{N}(0,\ \sigma^{2})$ . In the asymptotic regime where $n/p\rightarrow\delta$ and $p/\sigma^{2}\rightarrow$ snr for two fixed constants $\delta,\ \mathsf{snr}\in(0,\ \infty)$ as $p\rightarrow\infty$ , the limiting (normalized) minimum mean-squared error (MMSE) has been characterized by a single-letter (additive Gaussian scalar) channel. In this paper, we show that if the MMSE function of the single-letter channel converges to a step function, then the limiting MMSE of estimating $\beta$ converges to a step function which jumps from 1 to 0 at a critical threshold. Moreover, we establish that the limiting mean-squared error of the (MSE-optimal) approximate message passing algorithm also converges to a step function with a larger threshold, providing evidence for the presence of a computational-statistical gap between the two thresholds.
more » « less
Full Text Available
Gaussian Mixture Models for Stochastic Block Models with Non-Vanishing Noise

https://doi.org/10.1109/CAMSAP45676.2019.9022612

Mathews, Heather ; Mayya, Vaishakhi ; Volfovsky, Alexander ; Reeves, Galen ( December 2019 , 2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP))

Community detection tasks have received a lot of attention across statistics, machine learning, and information theory with work concentrating on providing theoretical guarantees for different methodological approaches to the stochastic block model. Recent work on community detection has focused on modeling the spectral embedding of a network using Gaussian mixture models (GMMs) in scaling regimes where the ability to detect community memberships improves with the size of the network. However, these regimes are not very realistic. This paper provides tractable methodology motivated by new theoretical results for networks with non-vanishing noise. We present a procedure for community detection using novel GMMs that incorporate truncation and shrinkage effects. We provide empirical validation of this new representation as well as experimental results using a large email dataset.
more » « less
Full Text Available

« Prev Next »