Attention-based generative models for de novo molecular design

Dollar, Orion; Joshi, Nisarg; Beck, David A.; Pfaendtner, Jim

doi:10.1039/D1SC01050F

Citation Details

Attention-based generative models for de novo molecular design

Attention mechanisms have led to many breakthroughs in sequential data modeling but have yet to be incorporated into any generative algorithms for molecular design. Here we explore the impact of adding self-attention layers to generative β -VAE models and show that those with attention are able to learn a complex “molecular grammar” while improving performance on downstream tasks such as accurately sampling from the latent space (“model memory”) or exploring novel chemistries not present in the training data. There is a notable relationship between a model's architecture, the structure of its latent memory and its performance during inference. We demonstrate that there is an unavoidable tradeoff between model exploration and validity that is a function of the complexity of the latent memory. However, novel sampling schemes may be used that optimize this tradeoff. We anticipate that attention will play an important role in future molecular design algorithms that can make efficient use of the detailed molecular substructures learned by the transformer. more »

Award ID(s):: 1934292

PAR ID:: 10282032

Author(s) / Creator(s):: Dollar, Orion; Joshi, Nisarg; Beck, David A.; Pfaendtner, Jim

Date Published:: 2021-06-23

Journal Name:: Chemical Science

Volume:: 12

Issue:: 24

ISSN:: 2041-6520

Page Range / eLocation ID:: 8362 to 8372

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
https://doi.org/10.1039/D1SC01050F

More Like this