NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps

Kelner, Jonathan; Koehler, Frederic; Meka, Raghu; Rohatgi, Dhruv (February 2025, stat.ML)

t is well-known that the statistical performance of Lasso can suffer significantly when the covariates of interest have strong correlations. In particular, the prediction error of Lasso becomes much worse than computationally inefficient alternatives like Best Subset Selection. Due to a large conjectured computational-statistical tradeoff in the problem of sparse linear regression, it may be impossible to close this gap in general. In this work, we propose a natural sparse linear regression setting where strong correlations between covariates arise from unobserved latent variables. In this setting, we analyze the problem caused by strong correlations and design a surprisingly simple fix. While Lasso with standard normalization of covariates fails, there exists a heterogeneous scaling of the covariates with which Lasso will suddenly obtain strong provable guarantees for estimation. Moreover, we design a simple, efficient procedure for computing such a "smart scaling." The sample complexity of the resulting "rescaled Lasso" algorithm incurs (in the worst case) quadratic dependence on the sparsity of the underlying signal. While this dependence is not information-theoretically necessary, we give evidence that it is optimal among the class of polynomial-time algorithms, via the method of low-degree polynomials. This argument reveals a new connection between sparse linear regression and a special version of sparse PCA with a near-critical negative spike. The latter problem can be thought of as a real-valued analogue of learning a sparse parity. Using it, we also establish the first computational-statistical gap for the closely related problem of learning a Gaussian Graphical Model.
more » « less
Free, publicly-accessible full text available February 26, 2026
Semi-Random Matrix Completion via Flow-Based Adaptive Reweighting

Kelner, Jonathan A; Li, Jerry; Liu, Allen; Sidford, Aaron; Tian, Kevin (December 2024, Neural Information Processing Systems)

Full Text Available
Semi-Random Matrix Completion via Flow-Based Adaptive Reweighting

Kelner, Jonathan; Li, J; Liu, Allen; Sidford, Aaron; Tian, Kevin (November 2024, NeurIPS 2024 https://openreview.net/forum?id=XZp1uP0hh2)

We consider the well-studied problem of completing a rank- , -incoherent matrix from incomplete observations. We focus on this problem in the semi-random setting where each entry is independently revealed with probability at least . Whereas multiple nearly-linear time algorithms have been established in the more specialized fully-random setting where each entry is revealed with probablity exactly , the only known nearly-linear time algorithm in the semi-random setting is due to [CG18], whose sample complexity has a polynomial dependence on the inverse accuracy and condition number and thus cannot achieve high-accuracy recovery. Our main result is the first high-accuracy nearly-linear time algorithm for solving semi-random matrix completion, and an extension to the noisy observation setting. Our result builds upon the recent short-flat decomposition framework of [KLLST23a, KLLST23b] and leverages fast algorithms for flow problems on graphs to solve adaptive reweighting subproblems efficiently
more » « less
Full Text Available
Sampling Polytopes with Riemannian HMC: Faster Mixing via the Lewis Weights Barrier

Gatmiry, Khashayar; Kelner, Jonathan; Vempala, Santosh S (July 2024, Conference on Learning Theory 2024)

Full Text Available
Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps

Kelner, Jonathan; Koehler, Frederic; Meka, Raghu; Rohatgi, Dhruv (June 2024, Journal of machine learning research)
Sampling Polytopes with Riemannian HMC: Faster Mixing via the Lewis Weights Barrier

Gatmiry, Khashayar; Kelner, Jonathan A; Vempala, Santosh S (June 2024, Proceedings of Machine Learning Research)

Full Text Available
Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps

Kelner, Jonathan A; Koehler, Frederic; Meka, Raghu; Rohatgi, Dhruv (June 2024, Proceedings of Machine Learning Research)

Full Text Available
Feature Adaptation for Sparse Linear Regression

Kelner, Jonathan A; Koehler, Frederic; Meka, Raghu; Rohatgi, Dhruv (December 2023, Advances in neural information processing systems)

Full Text Available
Matrix Completion in Almost-Verification Time

https://doi.org/10.1109/FOCS57990.2023.00129

Kelner, Jonathan A.; Li, Jerry; Liu, Allen; Sidford, Aaron; Tian, Kevin (November 2023, IEEE)

Full Text Available
Semi-Random Sparse Recovery in Nearly-Linear Time

Kelner, Jonathan; Li, Jerry; Liu, Allen X; Sidford, Aaron; Tian, Kevin (July 2023, Proceedings of Machine Learning Research)

Full Text Available

« Prev Next »

Search for: All records