3M-Diffusion: Latent Multi-Modal Diffusion for Language-Guided Molecular Structure Generation

Zhu, H; Xiao, T; Honavar, V

Citation Details

Generating molecular structures with desired properties is a critical task with broad applications in drug discovery and materials design. We propose 3M-Diffusion, a novel multi-modal molecular graph generation method, to generate diverse, ideally novel molecular structures with desired properties. 3M-Diffusion encodes molecular graphs into a graph latent space which it then aligns with the text space learned by encoder based LLMs from textual descriptions. It then reconstructs the molecular structure and atomic attributes based on the given text descriptions using the molecule decoder. It then learns a probabilistic mapping from the text space to the latent molecular graph space using a diffusion model. The results of our extensive experiments on several datasets demonstrate that 3M-Diffusion can generate high-quality, novel and diverse molecular graphs that semantically match the textual description provided. The code is available on github. more »

Award ID(s):: 2020243

PAR ID:: 10566043

Author(s) / Creator(s):: Zhu, H; Xiao, T; Honavar, V

Publisher / Repository:: openreview.net

Date Published:: 2024-10-07

Subject(s) / Keyword(s):: Materials discovery Large Language Models Machine Learning

Format(s):: Medium: X

Location:: Philadelphia, PA, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this