NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Mitigating over-exploration in latent space optimization using LES

Ronen, Omer; Humayun, Ahmed_Imtiaz; Baraniuk, Richard; Balestriero, Randall; Yu, Bin (May 2025, ICML)

We develop Latent Exploration Score (LES) to mitigate over-exploration in Latent Space Op- timization (LSO), a popular method for solv- ing black-box discrete optimization problems. LSO utilizes continuous optimization within the latent space of a Variational Autoencoder (VAE) and is known to be susceptible to over- exploration, which manifests in unrealistic solu- tions that reduce its practicality. LES leverages the trained decoder’s approximation of the data distribution, and can be employed with any VAE decoder–including pretrained ones–without addi- tional training, architectural changes or access to the training data. Our evaluation across five LSO benchmark tasks and twenty-two VAE mod- els demonstrates that LES always enhances the quality of the solutions while maintaining high objective values, leading to improvements over ex- isting solutions in most cases. We believe that new avenues to LSO will be opened by LES’ ability to identify out of distribution areas, differentiability, and computational tractability.
more » « less
Free, publicly-accessible full text available May 1, 2026
DeepTensor: Low-Rank Tensor Decomposition With Deep Network Priors

https://doi.org/10.1109/TPAMI.2024.3450575

Saragadam, Vishwanath; Balestriero, Randall; Veeraraghavan, Ashok; Baraniuk, Richard G (December 2024, IEEE Transactions on Pattern Analysis and Machine Intelligence)

DeepTensor is a computationally efficient framework for low-rank decomposition of matrices and tensors using deep generative networks. We decompose a tensor as the product of low-rank tensor factors where each low-rank tensor is generated by a deep network (DN) that is trained in a self-supervised manner to minimize the mean-square approximation error. Our key observation is that the implicit regularization inherent in DNs enables them to capture nonlinear signal structures that are out of the reach of classical linear methods like the singular value decomposition (SVD) and principal components analysis (PCA). We demonstrate that the performance of DeepTensor is robust to a wide range of distributions and a computationally efficient drop-in replacement for the SVD, PCA, nonnegative matrix factorization (NMF), and similar decompositions by exploring a range of real-world applications, including hyperspectral image denoising, 3D MRI tomography, and image classification.
more » « less
Free, publicly-accessible full text available December 1, 2025
The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression

https://doi.org/10.1137/23M1563062

Dar, Yehuda; LeJeune, Daniel; Baraniuk, Richard G (June 2024, SIAM Journal on Mathematics of Data Science)

Full Text Available
Learning Transferable Features for Implicit Neural Representations

Vyas, Kushal; Humayun, Ahmed I; Dashpute, Aniket; Baraniuk, Richard B; Veeraraghavan, Ashok; Balakrishnan, Guha (September 2024, ArXiv)

Implicit neural representations (INRs) have demonstrated success in a variety of applications, including inverse problems and neural rendering. An INR is typically trained to capture one signal of interest, resulting in learned neural features that are highly attuned to that signal. Assumed to be less generalizable, we explore the aspect of transferability of such learned neural features for fitting similar signals. We introduce a new INR training framework, STRAINER that learns transferrable features for fitting INRs to new signals from a given distribution, faster and with better reconstruction quality. Owing to the sequential layer-wise affine operations in an INR, we propose to learn transferable representations by sharing initial encoder layers across multiple INRs with independent decoder layers. At test time, the learned encoder representations are transferred as initialization for an otherwise randomly initialized INR. We find STRAINER to yield extremely powerful initialization for fitting images from the same domain and allow for ≈+10dB gain in signal quality early on compared to an untrained INR itself. STRAINER also provides a simple way to encode data-driven priors in INRs. We evaluate STRAINER on multiple in-domain and out-of-domain signal fitting tasks and inverse problems and further provide detailed analysis and discussion on the transferability of STRAINER's features.
more » « less
Full Text Available
PIDformer: transformer meets control theory

Nguyen, Tam; Uribe, Cesar A; Nguyen, Tan M; Baraniuk, Richard G (July 2024, JMLR.org)

Full Text Available
ScaLES: Scalable Latent Exploration Score for Pre-Trained Generative Networks

Ronen, Omer; Humayun, Ahmed; Balestriero, Randall; Baraniuk, Richard; Yu, Bin (June 2024, Arxiv)

Full Text Available
“Implicit Neural Representations and the Algebra of Complex Wavelets,

Roddenberry, T Mitchell; Saragadam, Vishwanath; de_Hoop, Marten; Baraniuk, Richard G (May 2024, International Conference on Learning Representations (ICLR))

Full Text Available
Asymptotics of the Sketched Pseudoinverse

https://doi.org/10.1137/22M1530264

LeJeune, Daniel; Patil, Pratik; Javadi, Hamid; Baraniuk, Richard G; Tibshirani, Ryan J (March 2024, SIAM Journal on Mathematics of Data Science)

We take a random matrix theory approach to random sketching and show an asymptotic first-order equivalence of the regularized sketched pseudoinverse of a positive semidefinite matrix to a certain evaluation of the resolvent of the same matrix. We focus on real-valued regularization and extend previous results on an asymptotic equivalence of random matrices to the real setting, providing a precise characterization of the equivalence even under negative regularization, including a precise characterization of the smallest nonzero eigenvalue of the sketched matrix. We then further characterize the second-order equivalence of the sketched pseudoinverse. We also apply our results to the analysis of the sketch-and-project method and to sketched ridge regression. Last, we prove that these results generalize to asymptotically free sketching matrices, obtaining the resulting equivalence for orthogonal sketching matrices and comparing our results to several common sketches used in practice.
more » « less
Full Text Available
Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals

Nguyen, Tam; Nguyen, Tan M; Baraniuk, Richard (December 2023, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available
CLASS: A Design Framework for Building Intelligent Tutoring Systems Based on Learning Science principles

https://doi.org/10.18653/v1/2023.findings-emnlp.130

Sonkar, Shashank; Liu, Naiming; Mallick, Debshila; Baraniuk, Richard (December 2023, Findings of the Association for Computational Linguistics: EMNLP 2023)

« Prev Next »

Search for: All records