NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Deep Learning to Predict Protein Backbone Structure from High-Resolution Cryo-EM Density Maps

https://doi.org/10.1038/s41598-020-60598-y

Si, Dong; Moritz, Spencer A.; Pfab, Jonas; Hou, Jie; Cao, Renzhi; Wang, Liguo; Wu, Tianqi; Cheng, Jianlin (December 2020, Scientific Reports)

Full Text Available
Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge

https://doi.org/10.1038/s41592-024-02321-7

Lawson, Catherine L; Kryshtafovych, Andriy; Pintilie, Grigore D; Burley, Stephen K; Černý, Jiří; Chen, Vincent B; Emsley, Paul; Gobbi, Alberto; Joachimiak, Andrzej; Noreng, Sigrid; et al (July 2024, Nature Methods)

Full Text Available
Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13

https://doi.org/10.1002/prot.25697

Hou, Jie; Wu, Tianqi; Cao, Renzhi; Cheng, Jianlin (April 2019, Proteins: Structure, Function, and Bioinformatics)

Abstract Predicting residue‐residue distance relationships (eg, contacts) has become the key direction to advance protein structure prediction since 2014 CASP11 experiment, while deep learning has revolutionized the technology for contact and distance distribution prediction since its debut in 2012 CASP10 experiment. During 2018 CASP13 experiment, we enhanced our MULTICOM protein structure prediction system with three major components: contact distance prediction based on deep convolutional neural networks, distance‐driven template‐free (ab initio) modeling, and protein model ranking empowered by deep learning and contact prediction. Our experiment demonstrates that contact distance prediction and deep learning methods are the key reasons that MULTICOM was ranked 3rd out of all 98 predictors in both template‐free and template‐based structure modeling in CASP13. Deep convolutional neural network can utilize global information in pairwise residue‐residue features such as coevolution scores to substantially improve contact distance prediction, which played a decisive role in correctly folding some free modeling and hard template‐based modeling targets. Deep learning also successfully integrated one‐dimensional structural features, two‐dimensional contact information, and three‐dimensional structural quality scores to improve protein model quality assessment, where the contact prediction was demonstrated to consistently enhance ranking of protein models for the first time. The success of MULTICOM system clearly shows that protein contact distance prediction and model selection driven by deep learning holds the key of solving protein structure prediction problem. However, there are still challenges in accurately predicting protein contact distance when there are few homologous sequences, folding proteins from noisy contact distances, and ranking models of hard targets.
more » « less
Artificial intelligence advances for de novo molecular structure modeling in cryo‐electron microscopy

https://doi.org/10.1002/wcms.1542

Si, Dong; Nakamura, Andrew; Tang, Runbang; Guan, Haowen; Hou, Jie; Firozi, Ammaar; Cao, Renzhi; Hippe, Kyle; Zhao, Minglei (May 2021, WIREs Computational Molecular Science)

Abstract Cryo‐electron microscopy (cryo‐EM) has become a major experimental technique to determine the structures of large protein complexes and molecular assemblies, as evidenced by the 2017 Nobel Prize. Although cryo‐EM has been drastically improved to generate high‐resolution three‐dimensional maps that contain detailed structural information about macromolecules, the computational methods for using the data to automatically build structure models are lagging far behind. The traditional cryo‐EM model building approach is template‐based homology modeling. Manual de novo modeling is very time‐consuming when no template model is found in the database. In recent years, de novo cryo‐EM modeling using machine learning (ML) and deep learning (DL) has ranked among the top‐performing methods in macromolecular structure modeling. DL‐based de novo cryo‐EM modeling is an important application of artificial intelligence, with impressive results and great potential for the next generation of molecular biomedicine. Accordingly, we systematically review the representative ML/DL‐based de novo cryo‐EM modeling methods. Their significances are discussed from both practical and methodological viewpoints. We also briefly describe the background of cryo‐EM data processing workflow. Overall, this review provides an introductory guide to modern research on artificial intelligence for de novo molecular structure modeling and future directions in this emerging field. This article is categorized under:Structure and Mechanism > Molecular StructuresStructure and Mechanism > Computational Biochemistry and BiophysicsData Science > Artificial Intelligence/Machine Learning
more » « less
Cryo-EM model validation recommendations based on outcomes of the 2019 EMDataResource challenge

https://doi.org/10.1038/s41592-020-01051-w

Lawson, Catherine L.; Kryshtafovych, Andriy; Adams, Paul D.; Afonine, Pavel V.; Baker, Matthew L.; Barad, Benjamin A.; Bond, Paul; Burnley, Tom; Cao, Renzhi; Cheng, Jianlin; et al (February 2021, Nature Methods)

Abstract This paper describes outcomes of the 2019 Cryo-EM Model Challenge. The goals were to (1) assess the quality of models that can be produced from cryogenic electron microscopy (cryo-EM) maps using current modeling software, (2) evaluate reproducibility of modeling results from different software developers and users and (3) compare performance of current metrics used for model evaluation, particularly Fit-to-Map metrics, with focus on near-atomic resolution. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived by 13 participating teams from four benchmark maps, including three forming a resolution series (1.8 to 3.1 Å). The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual experiments and structure data archives such as the Protein Data Bank. We recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed cryo-EM map density.
more » « less
An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12

https://doi.org/10.1038/s41598-018-26812-8

Keasar, Chen; McGuffin, Liam J.; Wallner, Björn; Chopra, Gaurav; Adhikari, Badri; Bhattacharya, Debswapna; Blake, Lauren; Bortot, Leandro Oliveira; Cao, Renzhi; Dhanasekaran, B. K.; et al (December 2018, Scientific Reports)

Full Text Available
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

https://doi.org/10.1186/s13059-019-1835-8

Zhou, Naihui; Jiang, Yuxiang; Bergquist, Timothy R.; Lee, Alexandra J.; Kacsoh, Balint Z.; Crocker, Alex W.; Lewis, Kimberley A.; Georghiou, George; Nguyen, Huy N.; Hamid, Md Nafiz; et al (December 2019, Genome Biology)

Full Text Available

Search for: All records