NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Alternating minimization for generalized rank-1 matrix sensing: sharp predictions from a random initialization

https://doi.org/10.1093/imaiai/iaae025

Chandrasekher, Kabir_Aladin; Lou, Mengqi; Pananjady, Ashwin (September 2024, Information and Inference: A Journal of the IMA)

Abstract We consider the problem of estimating the factors of a rank-$$1$$ matrix with i.i.d. Gaussian, rank-$$1$$ measurements that are nonlinearly transformed and corrupted by noise. Considering two prototypical choices for the nonlinearity, we study the convergence properties of a natural alternating update rule for this non-convex optimization problem starting from a random initialization. We show sharp convergence guarantees for a sample-split version of the algorithm by deriving a deterministic one-step recursion that is accurate even in high-dimensional problems. Notably, while the infinite-sample population update is uninformative and suggests exact recovery in a single step, the algorithm—and our deterministic one-step prediction—converges geometrically fast from a random initialization. Our sharp, non-asymptotic analysis also exposes several other fine-grained properties of this problem, including how the nonlinearity and noise level affect convergence behaviour. On a technical level, our results are enabled by showing that the empirical error recursion can be predicted by our deterministic one-step updates within fluctuations of the order $$n^{-1/2}$$ when each iteration is run with $$n$$ observations. Our technique leverages leave-one-out tools originating in the literature on high-dimensional $$M$$-estimation and provides an avenue for sharply analyzing complex iterative algorithms from a random initialization in other high-dimensional optimization problems with random data.
more » « less
What governs attitudes toward artificial intelligence adoption and governance?

https://doi.org/10.1093/scipol/scac056

O’Shaughnessy, Matthew R.; Schiff, Daniel S.; Varshney, Lav R.; Rozell, Christopher J.; Davenport, Mark A. (October 2022, Science and Public Policy)

Abstract Designing effective and inclusive governance and public communication strategies for artificial intelligence (AI) requires understanding how stakeholders reason about its use and governance. We examine underlying factors and mechanisms that drive attitudes toward the use and governance of AI across six policy-relevant applications using structural equation modeling and surveys of both US adults (N = 3,524) and technology workers enrolled in an online computer science master’s degree program (N = 425). We find that the cultural values of individualism, egalitarianism, general risk aversion, and techno-skepticism are important drivers of AI attitudes. Perceived benefit drives attitudes toward AI use but not its governance. Experts hold more nuanced views than the public and are more supportive of AI use but not its regulation. Drawing on these findings, we discuss challenges and opportunities for participatory AI governance, and we recommend that trustworthy AI governance be emphasized as strongly as trustworthy AI.
more » « less
Computationally efficient reductions between some statistical models

Lou, Mengqi; Bresler, Guy; Pananjady, Ashwin (February 2025, Proceedings of Machine Learning Research)

Free, publicly-accessible full text available February 27, 2026
Just Wing It: Near-Optimal Estimation of Missing Mass in a Markovian Sequence

Pananjady, Ashwin; Muthukumar, Vidya; Thangaraj, Andrew (October 2024, Journal of machine learning research)

Full Text Available
Learning the Eye of the Beholder: Statistical Modeling and Estimation for Personalized Color Perception

https://doi.org/10.1109/Allerton63246.2024.10735273

Chen, Xuanzhou; Xu, Austin; Wang, Jingyan; Pananjady, Ashwin (September 2024, Annual Allerton Conference on Communication Control and Computing)

Full Text Available
One-shot inverse reinforcement learning for stochastic linear bandits

Guha, Etash; James, Jim; Acharya, Krishna; Muthukumar, Vidya; Pananjady, Ashwin (July 2024, International Conference on Uncertainty in Artificial Intelligence (UAI))

Full Text Available
Do Algorithms and Barriers for Sparse Principal Component Analysis Extend to Other Structured Settings?

https://doi.org/10.1109/TSP.2024.3421618

Wang, Guanyi; Lou, Mengqi; Pananjady, Ashwin (January 2024, IEEE Transactions on Signal Processing)

Full Text Available
Active metric learning and classification using similarity queries

Nadagouda, Namrata; Xu, Austin; Davenport, Mark A. (August 2023, Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence)

Active learning is commonly used to train label-efficient models by adaptively selecting the most informative queries. However, most active learning strategies are designed to either learn a representation of the data (e.g., embedding or metric learning) or perform well on a task (e.g., classification) on the data. However, many machine learning tasks involve a combination of both representation learning and a task-specific goal. Motivated by this, we propose a novel unified query framework that can be applied to any problem in which a key component is learning a representation of the data that reflects similarity. Our approach builds on similarity or nearest neighbor (NN) queries which seek to select samples that result in improved embeddings. The queries consist of a reference and a set of objects, with an oracle selecting the object most similar (i.e., nearest) to the reference. In order to reduce the number of solicited queries, they are chosen adaptively according to an information theoretic criterion. We demonstrate the effectiveness of the proposed strategy on two tasks - active metric learning and active classification - using a variety of synthetic and real world datasets. In particular, we demonstrate that actively selected NN queries outperform recently developed active triplet selection methods in a deep metric learning setting. Further, we show that in classification, actively selecting class labels can be reformulated as a process of selecting the most informative NN query, allowing direct application of our method.
more » « less
Full Text Available
Modeling and Correcting Bias in Sequential Evaluation

https://doi.org/10.1145/3580507.3597747

Wang, Jingyan; Pananjady, Ashwin (July 2023, 24th ACM Conference on Economics and Computation)

Full Text Available
Sharp analysis of EM for learning mixtures of pairwise differences

Dhawan, Abhishek; Mao, Cheng; Pananjady, Ashwin (July 2023, Proceedings of Thirty Sixth Conference on Learning Theory)

We consider a symmetric mixture of linear regressions with random samples from the pairwise comparison design, which can be seen as a noisy version of a type of Euclidean distance geometry problem. We analyze the expectation-maximization (EM) algorithm locally around the ground truth and establish that the sequence converges linearly, providing an $$\ell_\infty$$-norm guarantee on the estimation error of the iterates. Furthermore, we show that the limit of the EM sequence achieves the sharp rate of estimation in the $$\ell_2$$-norm, matching the information-theoretically optimal constant. We also argue through simulation that convergence from a random initialization is much more delicate in this setting, and does not appear to occur in general. Our results show that the EM algorithm can exhibit several unique behaviors when the covariate distribution is suitably structured.
more » « less
Full Text Available

« Prev Next »

Search for: All records