NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Learning High-Precision Least Squares Algorithms with Sequence Models

Liu, Jerry; Grogan, Jessica; Dugan, Owen; Rao, Ashish; Arora, Simran; Rudra, Atri; Re, Chris (April 2025, The Thirteenth International Conference on Learning Representations (ICLR), 2025)

Free, publicly-accessible full text available April 24, 2026
FastPDB: Towards Bag-Probabilistic Queries at Interactive Speeds

https://doi.org/10.1145/3709691

Huber, Aaron; Kennedy, Oliver; Rudra, Atri; Zhao, Zhuoyue; Feng, Su; Glavic, Boris (February 2025, Proceedings of the ACM on Management of Data)

Probabilistic databases (PDBs) provide users with a principled way to query data that is incomplete or imprecise. In this work, we study computing expected multiplicities of query results over probabilistic databases under bag semantics which has PTIME data complexity. However, does this imply that bag probabilistic databases are practical? We strive to answer this question from both a theoretical as well as a systems perspective. We employ concepts from fine-grained complexity to demonstrate that exact bag probabilistic query processing is fundamentally less efficient than deterministic bag query evaluation, but that fast approximations are possible by sampling monomials from a circuit representation of a result tuple's lineage. A remaining issue, however, is that constructing such circuits, while in PTIME, can nonetheless have significant overhead. To avoid this cost, we utilize approximate query processing techniques to directly sample monomials without materializing lineage upfront. Our implementation inFastPDBprovides accurate anytime approximation of probabilistic query answers and scales to datasets orders of magnitude larger than competing methods.
more » « less
Free, publicly-accessible full text available February 10, 2026
Simple linear attention language models balance the recall-throughput tradeoff

Arora, Simran; Eyuboglu, Sabri; Zhang, Michael; Timalsina, Aman; Alberti, Silas; Zou, James; Rudra, Atri; Re, Christopher (July 2024, Proceedings of the 41st International Conference on Machine Learning)

Full Text Available
Zoology: Measuring and Improving Recall in Efficient Language Models

Arora, Simran; Eyuboglu, Sabri; Timalsina, Aman; Johnson, Isys; Poli, Michael; Zou, James; Rudra, Atri; Ré, Christopher (May 2024, Proceedings of 12th International Conference on Learning Representations (ICLR))

Full Text Available
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Fu, Daniel Y.; Arora, Simran; Grogan, Jessica; Johnson, Isys; Eyuboglu, Sabri; Thomas, Armin W.; Spector, Benjamin; Poli, Michael; Rudra, Atri; Ré, Christopher (December 2023, Proceedings of the 36th Neural Information Processing Systems Conference (NeurIPS))

Full Text Available
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections

Gu, Albert; Johnson, Isys; Timalsina, Aman; Rudra, Atri; Ré, Christopher (May 2023, Proceedings of the 11th International Conference on Learning Representations (ICLR))

Full Text Available
Hungry Hungry Hippos: Towards Language Modeling with State Space Models

Dao, Tri; Fu, Daniel Y.; Saab, Khaled K.; Thomas, Armin W.; Rudra, Atri; Ré, Christopher (May 2023, Proceedings of the 11th International Conference on Learning Representations (ICLR))

Full Text Available
Simple Hardware-Efficient Long Convolutions for Sequence Modeling

Fu, Daniel Y.; Epstein, Elliot L.; Nguyen, Eric; Thomas, Armin W.; Zhang, Michael; Dao, Tri; Rudra, Atri; Ré, Christopher (July 2023, Proceedings of the 40th International Conference on Machine Learning (ICML))

Full Text Available
Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

Massaroli, Stefano; Poli, Michael; Fu, Daniel Y.; Kumbong, Hermann; Parnichkun, Rom N.; Timalsina, Aman; Romero, David W.; McIntyre, Quinn; Chen, Beidi; Rudra, Atri; et al (December 2023, Proceedings of the 36th Neural Information Processing Systems Conference (NeurIPS))

Full Text Available
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Dao, Tri; Fu, Daniel Y.; Ermon, Stefano; Rudra, Atri; Ré, Christopher (December 2022, Proceedings of the 35th Neural Information Processing Systems Conference (NeurIPS))

Full Text Available

« Prev Next »

Search for: All records