NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Compositional Reasoning with Transformers, RNNs, and Chain of Thought

Yehudai, Gilad; Amsel, Noah; Bruna, Joan (December 2025, Neural Information Processing Systems)

Free, publicly-accessible full text available December 9, 2026
The Generative Leap: Sharp Sample Complexity for Efficiently Learning Gaussian Multi-Index Models

Damian, Alex; Lee, Jason; Bruna, Joan (December 2025, Neural Information Processing Systems)

Free, publicly-accessible full text available December 9, 2026
Survey on algorithms for multi-index models

Bruna, Joan; Hsu, Daniel (September 2025, Statistical science)

Free, publicly-accessible full text available September 18, 2026
Thermalizer: Stable autoregressive neural emulation of spatiotemporal chaos

Pedersen, Chris; Zanna, Laure; Bruna, Joan (July 2025, International Conference on Machine Learning)

Free, publicly-accessible full text available July 21, 2026
Propagation of Chaos in One-hidden-layer Neural Networks beyond Logarithmic Time

Glasgow; Margalit; Wu, Denny; Bruna, Joan (July 2025, Conference on Learning Theory)

Free, publicly-accessible full text available July 1, 2026
DISTRIBUTIONAL ASSOCIATIONS VS IN-CONTEXT REASONING: A STUDY OF FEED-FORWARD AND ATTENTION LAYERS

Chen, Lei; Bietti, Alberto; Bruna, Joan (April 2025, International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Posterior Sampling with Denoising Oracles via Tilted Transport

Bruna, Joan; Han, Jiequn (December 2024, Neural Information Processing Systems)

Full Text Available
Stochastic Optimal Control Matching

Domingo-Enrich, Carles; Han, Jiequn; Amos, Brandon; Bruna, Joan; Chen, Ricky (December 2024, Neural Information Processing Systems)

Full Text Available
Computational-Statistical Gaps in Gaussian Single-Index Models

Damian, Alex; Pillaud-Vivien, Loucas; Lee, Jason; Bruna, Joan (June 2024, Conference on Learning Theory (COLT))

Single-Index Models are high-dimensional regression problems with planted structure, whereby labels depend on an unknown one-dimensional projection of the input via a generic, non-linear, and potentially non-deterministic transformation. As such, they encompass a broad class of statistical inference tasks, and provide a rich template to study statistical and computational trade-offs in the high-dimensional regime. While the information-theoretic sample complexity to recover the hidden direction is lin- ear in the dimension d, we show that computationally efficient algorithms, both within the Statistical Query (SQ) and the Low-Degree Polynomial (LDP) framework, necessarily require Ω(dk⋆/2) samples, where k⋆ is a “generative” exponent associated with the model that we explicitly characterize. Moreover, we show that this sample complexity is also sufficient, by establishing matching upper bounds using a partial-trace algorithm. Therefore, our results pro- vide evidence of a sharp computational-to-statistical gap (under both the SQ and LDP class) whenever k⋆ > 2. To complete the study, we construct smooth and Lipschitz deterministic target functions with arbitrarily large generative exponents k⋆.
more » « less
Full Text Available
Symmetric Single-Index Learning

Zweig, Aaron; Bruna, Joan (May 2024, International Conference on Learning Representations (ICLR))

Few neural architectures lend themselves to provable learning with gradient based methods. One popular model is the single-index model, in which labels are produced by composing an unknown linear projection with a possibly unknown scalar link function. Learning this model with SGD is relatively well-understood, whereby the so-called information exponent of the link function governs a polynomial sample complexity rate. However, extending this analysis to deeper or more complicated architectures remains challenging. In this work, we consider single index learning in the setting of symmetric neural net- works. Under analytic assumptions on the activation and maximum degree assumptions on the link function, we prove that gradient flow recovers the hidden planted direction, represented as a finitely supported vector in the feature space of power sum polynomials. We characterize a notion of information exponent adapted to our setting that controls the efficiency of learning.
more » « less
Full Text Available

« Prev Next »

Search for: All records