NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Block diffusion: Interpolating between autoregressive and diffusion language models

Arriola, Marianne; Gokaslan, Aaron; Chiu, Justin; Yang, Zhihan; Qi, Zhixuan; Han, Jiaqi; Sahoo, Subham; Kuleshov, Volodymyr (April 2025, ICLR)

Free, publicly-accessible full text available April 21, 2026
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors

Piriyakulkij, Wasu; Wang, Yingheng; Kuleshov, Volodymyr (January 2025, AAAI)

Free, publicly-accessible full text available January 20, 2026
Cross-species modeling of plant genomes at single-nucleotide resolution using a pretrained DNA language model

https://doi.org/10.1073/pnas.2421738122

Zhai, Jingjing; Gokaslan, Aaron; Schiff, Yair; Berthel, Ana; Liu, Zong-Yan; Lai, Wei-Yun; Miller, Zachary R; Scheben, Armin; Stitzer, Michelle C; Romay, M Cinta; et al (June 2025, Proceedings of the National Academy of Sciences)

Interpreting function and fitness effects in diverse plant genomes requires transferable models. Language models (LMs) pretrained on large-scale biological sequences can capture evolutionary conservation and offer cross-species prediction better than supervised models through fine-tuning limited labeled data. We introduce PlantCaduceus, a plant DNA LM that learns evolutionary conservation patterns in 16 angiosperm genomes by modeling both DNA strands simultaneously. When fine-tuned on a small set of labeledArabidopsisdata for tasks such as predicting translation initiation/termination sites and splice donor/acceptor sites, PlantCaduceus demonstrated remarkable transferability to maize, which diverged 160 Mya. The model outperformed the best existing DNA language model by 1.45-fold in maize splice donor prediction and 7.23-fold in maize translation initiation site prediction. In variant effect prediction, PlantCaduceus showed performance comparative to state-of-the-art protein LMs. Mutations predicted to be deleterious by PlantCaduceus showed threefold lower average minor allele frequencies compared to those identified by multiple sequence alignment-based methods. Additionally, PlantCaduceus successfully identifies well-known causal variants in bothArabidopsisand maize. Overall, PlantCaduceus is a versatile DNA LM that can accelerate plant genomics and crop breeding applications.
more » « less
Free, publicly-accessible full text available June 17, 2026
Simple Guidance Mechanisms for Discrete Diffusion Models

Schiff, Yair; Sahoo, Subham; Phung, Hao; Wang, Guanghan; Boshar, Sam; Dalla-torre, Hugo; de_Almeida, Bernardo; Rush, Alexander; Pierrot, Thomas; Kuleshov, Volodymyr (April 2025, ICLR)

Free, publicly-accessible full text available April 21, 2026
Diffusion Models With Learned Adaptive Noise

Sahoo, Subham; Gokaslan, Aaron; De_Sa, Christopher; Kuleshov, Volodymyr (December 2024, NeurIPS 2024)

Diffusion models have gained traction as powerful algorithms for synthesizing high-quality images. Central to these algorithms is the diffusion process, a set of equations which maps data to noise in a way that can significantly affect performance. In this paper, we explore whether the diffusionprocess can be learned from data.Our work is grounded in Bayesian inference and seeks to improve log-likelihood estimation by casting the learned diffusion process as an approximate variational posterior that yields a tighter lower bound (ELBO) on the likelihood.A widely held assumption is that the ELBO is invariant to the noise process: our work dispels this assumption and proposes multivariate learned adaptive noise (MuLAN), a learned diffusion process that applies noise at different rates across an image. Our method consists of three components: a multivariate noise schedule, adaptive input-conditional diffusion, and auxiliary variables; these components ensure that the ELBO is no longer invariant to the choice of the noise schedule as in previous works. Empirically, MuLAN sets a new state-of-the-art in density estimation on CIFAR-10 and ImageNet while matching the performance of previous state-of-the-art models with 50% fewer steps. We provide the code, along with a blog post and video tutorial on the project page: https://s-sahoo.com/MuLAN
more » « less
Free, publicly-accessible full text available December 9, 2025
The GAN is dead; long live the GAN! A Modern Baseline GAN

Huang, Nick; Gokaslan, Aaron; Kuleshov, Volodymyr; Tompkin, James (December 2024, Neurips)

Free, publicly-accessible full text available December 9, 2025
Simple and Effective Masked Diffusion Language Models

Sahoo, Subham; Arriola, Marianne; Schiff, Yair; Gokaslan, Aaron; Marroquin, Edgar; Chiu, Justin; Rush, Alexander; Kuleshov, Volodymyr (December 2024, Neurips)

Free, publicly-accessible full text available December 9, 2025
Quip#: Even better LLM quantization with hadamard incoherence and lattice codebooks.

Tseng, Albert; Chee, Jerry; Sun, Qingyao; Kuleshov, Volodymyr; De_Sa, Christopher (July 2024, ICML)

Full Text Available
Online Calibrated and Conformal Prediction Improves Bayesian Optimization

Deshpande, Shachi; Marx, Charles; Kuleshov, Volodymyr (May 2024, Proceedings of Machine Learning Research)
Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs

Maasch, Jacqueline RMA; Pan, Weishen; Gupta, Shantanu; Kuleshov, Volodymyr; Gan, Kyra; Wang, Fei (July 2024, Proceedings of The 40th Conference on Uncertainty in Artificial Intelligence.)

Full Text Available

« Prev Next »

Search for: All records