Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Stanton, Samuel; Maddox, Wesley; Gruver, Nate; Maffettone, Phillip; Delaney, Emily; Greenside, Peyton; Wilson, Andrew Gordon

doi:10.48550/arXiv.2203.12742

Citation Details

Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders

Bayesian optimization (BayesOpt) is a gold standard for query-efficient continuous optimization. However, its adoption for drug design has been hindered by the discrete, high-dimensional nature of the decision variables. We develop a new approach (LaMBO) which jointly trains a denoising autoencoder with a discriminative multi-task Gaussian process head, allowing gradient-based optimization of multi-objective acquisition functions in the latent space of the autoencoder. These acquisition functions allow LaMBO to balance the explore-exploit tradeoff over multiple design rounds, and to balance objective tradeoffs by optimizing sequences at many different points on the Pareto frontier. We evaluate LaMBO on two small-molecule design tasks, and introduce new tasks optimizing \emph{in silico} and \emph{in vitro} properties of large-molecule fluorescent proteins. In our experiments LaMBO outperforms genetic optimizers and does not require a large pretraining corpus, demonstrating that BayesOpt is practical and effective for biological sequence design. more »

Award ID(s):: 1922658

PAR ID:: 10342936

Author(s) / Creator(s):: Stanton, Samuel; Maddox, Wesley; Gruver, Nate; Maffettone, Phillip; Delaney, Emily; Greenside, Peyton; Wilson, Andrew Gordon

Date Published:: 2022-07-01

Journal Name:: International Conference on Machine Learning

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.48550/arXiv.2203.12742

More Like this