NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Protein Design with Guided Discrete Diffusion

Gruver, N; Stanton, S; Frey, N; Rudner, T; Hotzel, I; Lafrance-Vanasse, J; Rajpal, A; Cho, K; Wilson, AG (December 2023, Advances in Neural Information Processing Systems)

Full Text Available
Protein Design with Guided Discrete Diffusion

Gruver, N; Stanton, S; Frey, Nathan C; Rudner, Tim G; Hotzel, I; Lafrance-Vanasse, J; Rajpal, A; Cho, K; Wilson, Andrew G (December 2023, Advances in Neural Information Processing Systems)

A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling. The generative model samples plausible sequences while the discriminative model guides a search for sequences with high fitness. Given its broad success in conditional sampling, classifier-guided diffusion modeling is a promising foundation for protein design, leading many to develop guided diffusion models for structure with inverse folding to recover sequences. In this work, we propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models that follows gradients in the hidden states of the denoising network. NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods, including scarce data and challenging inverse design. Moreover, we use NOS to generalize LaMBO, a Bayesian optimization procedure for sequence design that facilitates multiple objectives and edit-based constraints. The resulting method, LaMBO-2, enables discrete diffusions and stronger performance with limited edits through a novel application of saliency maps. We apply LaMBO-2 to a real-world protein design task, optimizing antibodies for higher expression yield and binding affinity to several therapeutic targets under locality and developability constraints, attaining a 99% expression rate and 40% binding rate in exploratory in vitro experiments.
more » « less
Full Text Available
Bayesian Optimization with Conformal Prediction Sets

Stanton, S.; Maddox, W.; Wilson, A.G. (January 2023, Artificial Intelligence and Statistics)

Bayesian optimization is a coherent, ubiquitous approach to decision-making under uncertainty, with applications including multi-arm bandits, active learning, and black-box optimization. Bayesian optimization selects decisions (i.e. objective function queries) with maximal expected utility with respect to the posterior distribution of a Bayesian model, which quantifies reducible, epistemic uncertainty about query outcomes. In practice, subjectively implausible outcomes can occur regularly for two reasons: 1) model misspecification and 2) covariate shift. Conformal prediction is an uncertainty quantification method with coverage guarantees even for misspecified models and a simple mechanism to correct for covariate shift. We propose conformal Bayesian optimization, which directs queries towards regions of search space where the model predictions have guaranteed validity, and investigate its behavior on a suite of black-box optimization tasks and tabular ranking tasks. In many cases we find that query coverage can be significantly improved without harming sample-efficiency.
more » « less
Full Text Available
Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Stanton, S; Maddox, W; Gruver, N; Maffettone, P; Delaney, E; Greenside, P; Wilson, AG. (July 2022, International Conference on Machine Learning)
Kernel Interpolation for Scalable Online Gaussian Processes

Stanton, S; Maddox, W; Delbridge, I; Wilson, A.G. (January 2021, Artificial Intelligence and Statistics (AISTATS))

Full Text Available
Kernel Interpolation for Scalable Online Gaussian Processes

Stanton, S.; Maddox, W.J.; Delbridge, I.; Wilson, A.G. (January 2021, Proceedings of The 24th International Conference on Artificial Intelligence and Statistics)
null (Ed.)
Gaussian processes (GPs) provide a gold standard for performance in online settings, such as sample-efficient control and black box optimization, where we need to update a posterior distribution as we acquire data in a sequential fashion. However, updating a GP posterior to accommodate even a single new observation after having observed n points incurs at least O(n) computations in the exact setting. We show how to use structured kernel interpolation to efficiently recycle computations for constant-time O(1) online updates with respect to the number of points n, while retaining exact inference. We demonstrate the promise of our approach in a range of online regression and classification settings, Bayesian optimization, and active sampling to reduce error in malaria incidence forecasting.
more » « less
Full Text Available
On the model-based stochastic value gradient for continuous reinforcement learning

Amos, B; Stanton, S; Yarats, D; Wilson, A.G. (January 2021, Learning for Dynamics and Control (L4DC))
null (Ed.)
For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of reinforcement learning agents. While model-based agents are conceptually appealing, their policies tend to lag behind those of model-free agents in terms of final reward, especially in non-trivial environments. In response, researchers have proposed model-based agents with increasingly complex components, from ensembles of probabilistic dynamics models, to heuristics for mitigating model error. In a reversal of this trend, we show that simple model-based agents can be derived from existing ideas that not only match, but outperform state-of-the-art model-free agents in terms of both sample-efficiency and final reward. We find that a model-free soft value estimate for policy evaluation and a model-based stochastic value gradient for policy improvement is an effective combination, achieving state-of-the-art results on a high-dimensional humanoid control task, which most model-based agents are unable to solve. Our findings suggest that model-based policy evaluation deserves closer attention.
more » « less
Full Text Available
Conditioning Sparse Variational Gaussian Processes for Online Decision-making

Maddox, W. J.; Stanton, S.; Wilson, A. G. (January 2021, Advances in neural information processing systems)

With a principled representation of uncertainty and closed form posterior updates, Gaussian processes (GPs) are a natural choice for online decision making. However, Gaussian processes typically require at least O(n2) computations for n training points, limiting their general applicability. Stochastic variational Gaussian processes (SVGPs) can provide scalable inference for a dataset of fixed size, but are difficult to efficiently condition on new data. We propose online variational conditioning (OVC), a procedure for efficiently conditioning SVGPs in an online setting that does not require re-training through the evidence lower bound with the addition of new data. OVC enables the pairing of SVGPs with advanced look-ahead acquisition functions for black-box optimization, even with non-Gaussian likelihoods. We show OVC provides compelling performance in a range of applications including active learning of malaria incidence, and reinforcement learning on MuJoCo simulated robotic control tasks.
more » « less
Full Text Available
Generalizing Convolutional Neural Networks for Equivarianceto Lie Groups on Arbitrary Continuous Data

Finzi, M; Stanton, S; Izmailov, P; Wilson, A.G. (January 2020, International Conference on Machine Learning)

The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group with a surjective exponential map. Incorporating equivariance to a new group requires implementing only the group exponential and logarithm maps, enabling rapid prototyping. Showcasing the simplicity and generality of our method, we apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems. For Hamiltonian systems, the equivariance of our models is especially impactful, leading to exact conservation of linear and angular momentum.
more » « less
Full Text Available
Generalizing Convolutional Networks for Equivariance to Lie Groups on Arbitrary Continuous Data.

Finzi, M; Stanton, S; Izmailov, P; Wilson, A.G. (January 2020, Proceedings of the International Conference on Machine Vision and Machine Learning)

The translation equivariance of convolutional layers enables convolutional neural networks to generalize well on image problems. While translation equivariance provides a powerful inductive bias for images, we often additionally desire equivariance to other transformations, such as rotations, especially for non-image data. We propose a general method to construct a convolutional layer that is equivariant to transformations from any specified Lie group with a surjective exponential map. Incorporating equivariance to a new group requires implementing only the group exponential and logarithm maps, enabling rapid prototyping. Showcasing the simplicity and generality of our method, we apply the same model architecture to images, ball-and-stick molecular data, and Hamiltonian dynamical systems. For Hamiltonian systems, the equivariance of our models is especially impactful, leading to exact conservation of linear and angular momentum.
more » « less
Full Text Available

« Prev Next »

Search for: All records