De novo molecule design towards biased properties via a deep generative framework and iterative transfer learning

Sattari, Kianoosh; Li, Dawei; Kalita, Bhupalee; Xie, Yunchao; Lighvan, Fatemeh Barmaleki; Isayev, Olexandr; Lin, Jian

doi:10.1039/D3DD00210A

Citation Details

De novo molecule design towards biased properties via a deep generative framework and iterative transfer learning

De novo design of molecules with targeted properties represents a new frontier in molecule development. Despite enormous progress, two main challenges remain: (i) generating novel molecules conditioned on targeted, continuous property values; (ii) obtaining molecules with property values beyond the range in the training data. To tackle these challenges, we propose a reinforced regressional and conditional generative adversarial network (RRCGAN) to generate chemically valid molecules with targeted HOMO–LUMO energy gap (ΔEH–L) as a proof-of-concept study. As validated by density functional theory (DFT) calculation, 75% of the generated molecules have a relative error (RE) of <20% of the targeted ΔEH–L values. To bias the generation toward the ΔEH–L values beyond the range of the original training molecules, transfer learning was applied to iteratively retrain the RRCGAN model. After just two iterations, the mean ΔEH–L of the generated molecules increases to 8.7 eV from the mean value of 5.9 eV shown in the initial training dataset. Qualitative and quantitative analyses reveal that the model has successfully captured the underlying structure–property relationship, which agrees well with the established physical and chemical rules. These results present a trustworthy, purely data-driven methodology for the highly efficient generation of novel molecules with different targeted properties. more »

Award ID(s):: 2154428

PAR ID:: 10408410

Author(s) / Creator(s):: Sattari, Kianoosh; Li, Dawei; Kalita, Bhupalee; Xie, Yunchao; Lighvan, Fatemeh Barmaleki; Isayev, Olexandr; Lin, Jian

Publisher / Repository:: RSC

Date Published:: 2024-02-14

Journal Name:: Digital Discovery

Volume:: 3

Issue:: 2

ISSN:: 2635-098X

Page Range / eLocation ID:: DOI: 10.26434/chemrxiv-2023-0zv2f-v2

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1039/D3DD00210A

More Like this