Search for: All records

Creators/Authors contains: "Gao, Kaifu"

« Prev Next »

Total Resources

13

Resource Type
Conference Paper

0

Conference Proceeding

0

Dataset

0

Journal Article

13

Workshop Report

0

Availability
Full Text / Resource Available

13

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Algebraic graph-assisted bidirectional transformers for molecular property prediction

https://doi.org/10.1038/s41467-021-23720-w

Chen, Dong ; Gao, Kaifu ; Nguyen, Duc Duy ; Chen, Xin ; Jiang, Yi ; Wei, Guo-Wei ; Pan, Feng ( December 2021 , Nature Communications)
null (Ed.)
Abstract The ability of molecular property prediction is of great significance to drug discovery, human health, and environmental protection. Despite considerable efforts, quantitative prediction of various molecular properties remains a challenge. Although some machine learning models, such as bidirectional encoder from transformer, can incorporate massive unlabeled molecular data into molecular representations via a self-supervised learning strategy, it neglects three-dimensional (3D) stereochemical information. Algebraic graph, specifically, element-specific multiscale weighted colored algebraic graph, embeds complementary 3D molecular information into graph invariants. We propose an algebraic graph-assisted bidirectional transformer (AGBT) framework by fusing representations generated by algebraic graph and bidirectional transformer, as well as a variety of machine learning algorithms, including decision trees, multitask learning, and deep neural networks. We validate the proposed AGBT framework on eight molecular datasets, involving quantitative toxicity, physical chemistry, and physiology datasets. Extensive numerical experiments have shown that AGBT is a state-of-the-art framework for molecular property prediction.
more » « less
Full Text Available
Vaccine-escape and fast-growing mutations in the United Kingdom, the United States, Singapore, Spain, India, and other COVID-19-devastated countries

https://doi.org/10.1016/j.ygeno.2021.05.006

Wang, Rui ; Chen, Jiahui ; Gao, Kaifu ; Wei, Guo-Wei ( July 2021 , Genomics)
null (Ed.)
Full Text Available
Prediction and mitigation of mutation threats to COVID-19 vaccines and antibody therapies

https://doi.org/10.1039/d1sc01203g

Chen, Jiahui ; Gao, Kaifu ; Wang, Rui ; Wei, Guo-Wei ( May 2021 , Chemical Science)
null (Ed.)
Antibody therapeutics and vaccines are among our last resort to end the raging COVID-19 pandemic. They, however, are prone to over 5000 mutations on the spike (S) protein uncovered by a Mutation Tracker based on over 200 000 genome isolates. It is imperative to understand how mutations will impact vaccines and antibodies in development. In this work, we first study the mechanism, frequency, and ratio of mutations on the S protein which is the common target of most COVID-19 vaccines and antibody therapies. Additionally, we build a library of 56 antibody structures and analyze their 2D and 3D characteristics. Moreover, we predict the mutation-induced binding free energy (BFE) changes for the complexes of S protein and antibodies or ACE2. By integrating genetics, biophysics, deep learning, and algebraic topology, we reveal that most of the 462 mutations on the receptor-binding domain (RBD) will weaken the binding of S protein and antibodies and disrupt the efficacy and reliability of antibody therapies and vaccines. A list of 31 antibody disrupting mutants is identified, while many other disruptive mutations are detailed as well. We also unveil that about 65% of the existing RBD mutations, including those variants recently found in the United Kingdom (UK) and South Africa, will strengthen the binding between the S protein and human angiotensin-converting enzyme 2 (ACE2), resulting in more infectious COVID-19 variants. We discover the disparity between the extreme values of RBD mutation-induced BFE strengthening and weakening of the bindings with antibodies and angiotensin-converting enzyme 2 (ACE2), suggesting that SARS-CoV-2 is at an advanced stage of evolution for human infection, while the human immune system is able to produce optimized antibodies. This discovery, unfortunately, implies the vulnerability of current vaccines and antibody drugs to new mutations. Our predictions were validated by comparison with more than 1400 deep mutations on the S protein RBD. Our results show the urgent need to develop new mutation-resistant vaccines and antibodies and to prepare for seasonal vaccinations.
more » « less
Full Text Available
Review of COVID-19 Antibody Therapies

https://doi.org/10.1146/annurev-biophys-062920-063711

Chen, Jiahui ; Gao, Kaifu ; Wang, Rui ; Nguyen, Duc Duy ; Wei, Guo-Wei ( May 2021 , Annual Review of Biophysics)
null (Ed.)
In the global health emergency caused by coronavirus disease 2019 (COVID-19), efficient and specific therapies are urgently needed. Compared with traditional small-molecular drugs, antibody therapies are relatively easy to develop; they are as specific as vaccines in targeting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2); and they have thus attracted much attention in the past few months. This article reviews seven existing antibodies for neutralizing SARS-CoV-2 with 3D structures deposited in the Protein Data Bank (PDB). Five 3D antibody structures associated with the SARS-CoV spike (S) protein are also evaluated for their potential in neutralizing SARS-CoV-2. The interactions of these antibodies with the S protein receptor-binding domain (RBD) are compared with those between angiotensin-converting enzyme 2 and RBD complexes. Due to the orders of magnitude in the discrepancies of experimental binding affinities, we introduce topological data analysis, a variety of network models, and deep learning to analyze the binding strength and therapeutic potential of the 14 antibody–antigen complexes. The current COVID-19 antibody clinical trials, which are not limited to the S protein target, are also reviewed.
more » « less
Full Text Available
Generative Network Complex for the Automated Generation of Drug-like Molecules

https://doi.org/10.1021/acs.jcim.0c00599

Gao, Kaifu ; Nguyen, Duc Duy ; Tu, Meihua ; Wei, Guo-Wei ( December 2020 , Journal of Chemical Information and Modeling)
null (Ed.)
Full Text Available
Analysis of SARS-CoV-2 mutations in the United States suggests presence of four substrains and novel variants

https://doi.org/10.1038/s42003-021-01754-6

Wang, Rui ; Chen, Jiahui ; Gao, Kaifu ; Hozumi, Yuta ; Yin, Changchuan ; Wei, Guo-Wei ( February 2021 , Communications Biology)

Abstract
SARS-CoV-2 has been mutating since it was first sequenced in early January 2020. Here, we analyze 45,494 complete SARS-CoV-2 geneome sequences in the world to understand their mutations. Among them, 12,754 sequences are from the United States. Our analysis suggests the presence of four substrains and eleven top mutations in the United States. These eleven top mutations belong to 3 disconnected groups. The first and second groups consisting of 5 and 8 concurrent mutations are prevailing, while the other group with three concurrent mutations gradually fades out. Moreover, we reveal that female immune systems are more active than those of males in responding to SARS-CoV-2 infections. One of the top mutations, 27964C > T-(S24L) on ORF8, has an unusually strong gender dependence. Based on the analysis of all mutations on the spike protein, we uncover that two of four SARS-CoV-2 substrains in the United States become potentially more infectious.

more » « less
Unveiling the molecular mechanism of SARS-CoV-2 main protease inhibition from 137 crystal structures using algebraic topology and deep learning

https://doi.org/10.1039/d0sc04641h

Nguyen, Duc Duy ; Gao, Kaifu ; Chen, Jiahui ; Wang, Rui ; Wei, Guo-Wei ( November 2020 , Chemical Science)
null (Ed.)
Currently, there is neither effective antiviral drugs nor vaccine for coronavirus disease 2019 (COVID-19) caused by acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Due to its high conservativeness and low similarity with human genes, SARS-CoV-2 main protease (M pro ) is one of the most favorable drug targets. However, the current understanding of the molecular mechanism of M pro inhibition is limited by the lack of reliable binding affinity ranking and prediction of existing structures of M pro –inhibitor complexes. This work integrates mathematics ( i.e. , algebraic topology) and deep learning (MathDL) to provide a reliable ranking of the binding affinities of 137 SARS-CoV-2 M pro inhibitor structures. We reveal that Gly143 residue in M pro is the most attractive site to form hydrogen bonds, followed by Glu166, Cys145, and His163. We also identify 71 targeted covalent bonding inhibitors. MathDL was validated on the PDBbind v2016 core set benchmark and a carefully curated SARS-CoV-2 inhibitor dataset to ensure the reliability of the present binding affinity prediction. The present binding affinity ranking, interaction analysis, and fragment decomposition offer a foundation for future drug discovery efforts.
more » « less
Full Text Available
Repositioning of 8565 Existing Drugs for COVID-19

https://doi.org/10.1021/acs.jpclett.0c01579

Gao, Kaifu ; Nguyen, Duc Duy ; Chen, Jiahui ; Wang, Rui ; Wei, Guo-Wei ( July 2020 , The Journal of Physical Chemistry Letters)

Full Text Available
MathDL: mathematical deep learning for D3R Grand Challenge 4

https://doi.org/10.1007/s10822-019-00237-5

Nguyen, Duc Duy ; Gao, Kaifu ; Wang, Menglun ; Wei, Guo-Wei ( February 2020 , Journal of Computer-Aided Molecular Design)

Full Text Available
Are 2D fingerprints still valuable for drug discovery?

https://doi.org/10.1039/D0CP00305K

Gao, Kaifu ; Nguyen, Duc Duy ; Sresht, Vishnu ; Mathiowetz, Alan M. ; Tu, Meihua ; Wei, Guo-Wei ( April 2020 , Physical Chemistry Chemical Physics)

Recently, molecular fingerprints extracted from three-dimensional (3D) structures using advanced mathematics, such as algebraic topology, differential geometry, and graph theory have been paired with efficient machine learning, especially deep learning algorithms to outperform other methods in drug discovery applications and competitions. This raises the question of whether classical 2D fingerprints are still valuable in computer-aided drug discovery. This work considers 23 datasets associated with four typical problems, namely protein–ligand binding, toxicity, solubility and partition coefficient to assess the performance of eight 2D fingerprints. Advanced machine learning algorithms including random forest, gradient boosted decision tree, single-task deep neural network and multitask deep neural network are employed to construct efficient 2D-fingerprint based models. Additionally, appropriate consensus models are built to further enhance the performance of 2D-fingerprint-based methods. It is demonstrated that 2D-fingerprint-based models perform as well as the state-of-the-art 3D structure-based models for the predictions of toxicity, solubility, partition coefficient and protein–ligand binding affinity based on only ligand information. However, 3D structure-based models outperform 2D fingerprint-based methods in complex-based protein–ligand binding affinity predictions.
more » « less
Full Text Available

« Prev Next »