NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Small molecule generation via disentangled representation learning

https://doi.org/10.1093/bioinformatics/btac296

Du, Yuanqi; Guo, Xiaojie; Wang, Yinkai; Shehu, Amarda; Zhao, Liang; Xu, ed., Jinbo (May 2022, Bioinformatics)

Abstract MotivationExpanding our knowledge of small molecules beyond what is known in nature or designed in wet laboratories promises to significantly advance cheminformatics, drug discovery, biotechnology and material science. In silico molecular design remains challenging, primarily due to the complexity of the chemical space and the non-trivial relationship between chemical structures and biological properties. Deep generative models that learn directly from data are intriguing, but they have yet to demonstrate interpretability in the learned representation, so we can learn more about the relationship between the chemical and biological space. In this article, we advance research on disentangled representation learning for small molecule generation. We build on recent work by us and others on deep graph generative frameworks, which capture atomic interactions via a graph-based representation of a small molecule. The methodological novelty is how we leverage the concept of disentanglement in the graph variational autoencoder framework both to generate biologically relevant small molecules and to enhance model interpretability. ResultsExtensive qualitative and quantitative experimental evaluation in comparison with state-of-the-art models demonstrate the superiority of our disentanglement framework. We believe this work is an important step to address key challenges in small molecule generation with deep generative frameworks. Availability and implementationTraining and generated data are made available at https://ieee-dataport.org/documents/dataset-disentangled-representation-learning-interpretable-molecule-generation. All code is made available at https://anonymous.4open.science/r/D-MolVAE-2799/. Supplementary informationSupplementary data are available at Bioinformatics online.
more » « less
SparseLLM: Towards Global Pruning of Pre-trained Language Models

Bai, Guangji; Li, Yijiang; Ling, Chen; Kim, Kibaek; Zhao, Liang (December 2024, NeurIPS)

Full Text Available
Controllable Data Generation by Deep Learning: A Review

https://doi.org/10.1145/3648609

Wang, Shiyu; Du, Yuanqi; Guo, Xiaojie; Pan, Bo; Qin, Zhaohui; Zhao, Liang (October 2024, ACM Computing Surveys)

Designing and generating new data under targeted properties has been attracting various critical applications such as molecule design, image editing and speech synthesis. Traditional hand-crafted approaches heavily rely on expertise experience and intensive human efforts, yet still suffer from the insufficiency of scientific knowledge and low throughput to support effective and efficient data generation. Recently, the advancement of deep learning has created the opportunity for expressive methods to learn the underlying representation and properties of data. Such capability provides new ways of determining the mutual relationship between the structural patterns and functional properties of the data and leveraging such relationships to generate structural data, given the desired properties. This article is a systematic review that explains this promising research area, commonly known as controllable deep data generation. First, the article raises the potential challenges and provides preliminaries. Then the article formally defines controllable deep data generation, proposes a taxonomy on various techniques and summarizes the evaluation metrics in this specific domain. After that, the article introduces exciting applications of controllable deep data generation, experimentally analyzes and compares existing works. Finally, this article highlights the promising future directions of controllable deep data generation and identifies five potential challenges.
more » « less
Full Text Available
Going Beyond XAI: A Systematic Survey for Explanation-Guided Learning

https://doi.org/10.1145/3644073

Gao, Yuyang; Gu, Siyi; Jiang, Junji; Hong, Sungsoo Ray; Yu, Dazhou; Zhao, Liang (July 2024, ACM Computing Surveys)

As the societal impact of Deep Neural Networks (DNNs) grows, the goals for advancing DNNs become more complex and diverse, ranging from improving a conventional model accuracy metric to infusing advanced human virtues such as fairness, accountability, transparency, and unbiasedness. Recently, techniques in Explainable Artificial Intelligence (XAI) have been attracting considerable attention and have tremendously helped Machine Learning (ML) engineers in understand AI models. However, at the same time, we started to witness the emerging need beyond XAI among AI communities; based on the insights learned from XAI, how can we better empower ML engineers in steering their DNNs so that the model’s reasonableness and performance can be improved as intended? This article provides a timely and extensive literature overview of the field Explanation-Guided Learning (EGL), a domain of techniques that steer the DNNs’ reasoning process by adding regularization, supervision, or intervention on model explanations. In doing so, we first provide a formal definition of EGL and its general learning paradigm. Second, an overview of the key factors for EGL evaluation, as well as summarization and categorization of existing evaluation procedures and metrics for EGL are provided. Finally, the current and potential future application areas and directions of EGL are discussed, and an extensive experimental study is presented aiming at providing comprehensive comparative studies among existing EGL models in various popular application domains, such as Computer Vision and Natural Language Processing domains. Additional resources related to event prediction are included in the article website:https://kugaoyang.github.io/EGL/
more » « less
Full Text Available
Deep Generative Model for Periodic Graphs

Wang, Shiyu; Guo, Xiaojie; Zhao, Liang (May 2023, The Thirty-Sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022))

Full Text Available
A Systematic Survey on Deep Generative Models for Graph Generation

https://doi.org/10.1109/TPAMI.2022.3214832

Guo, Xiaojie; Zhao, Liang (April 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Dynamic Activation of Clients and Parameters for Federated Learning over Heterogeneous Graphs

https://doi.org/10.1109/ICDE55515.2023.00126

Gu, Zishan; Zhang, Ke; Bai, Guangji; Chen, Liang; Zhao, Liang; Yang, Carl (April 2023, The 39th IEEE International Conference on Data Engineering (ICDE 2023))

Full Text Available
"Temporal Domain Generalization with Drift-Aware Dynamic Neural Networks."

Bai, Guangji; Chen Ling; Liang Zhao. (March 2023, In The Eleventh International Conference on Learning Representations. 2022.)

Full Text Available
Open-ended Commonsense Reasoning with Unrestricted Answer Candidates

https://doi.org/10.18653/v1/2023.findings-emnlp.540

Ling, Chen; Zhang, Xuchao; Zhao, Xujiang; Liu, Yanchi; Cheng, Wei; Oishi, Mika; Osaki, Takao; Matsuda, Katsushi; Chen, Haifeng; Zhao, Liang (January 2023, Association for Computational Linguistics)

Full Text Available
Multi-objective Deep Data Generation with Correlated Property Control

Shiyu Wang, Xiaojie Guo (November 2022, The Thirty-Sixth Annual Conference on Neural Information Processing Systems (NeurIPS 2022))

Full Text Available

« Prev Next »

Search for: All records