NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Advances in Language-Model-Informed Protein–Nucleic Acid Binding Site Prediction

https://doi.org/10.1007/978-1-0716-4623-6_9

Tarafder, Sumit; Wang, Xinyu; Roche, Rahmatullah; Bhattacharya, Debswapna (January 2025, Springer US)

Full Text Available
The landscape of RNA 3D structure modeling with transformer networks

https://doi.org/10.1093/biomethods/bpae047

Tarafder, Sumit; Roche, Rahmatullah; Bhattacharya, Debswapna (January 2024, Biology Methods and Protocols)

Abstract Transformers are a powerful subclass of neural networks catalyzing the development of a growing number of computational methods for RNA structure modeling. Here, we conduct an objective and empirical study of the predictive modeling accuracy of the emerging transformer-based methods for RNA structure prediction. Our study reveals multi-faceted complementarity between the methods and underscores some key aspects that affect the prediction accuracy.
more » « less
Full Text Available
EquiPNAS: improved protein–nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks

https://doi.org/10.1093/nar/gkae039

Roche, Rahmatullah; Moussad, Bernard; Shuvo, Md Hossain; Tarafder, Sumit; Bhattacharya, Debswapna (January 2024, Nucleic Acids Research)

Abstract Protein language models (pLMs) trained on a large corpus of protein sequences have shown unprecedented scalability and broad generalizability in a wide range of predictive modeling tasks, but their power has not yet been harnessed for predicting protein–nucleic acid binding sites, critical for characterizing the interactions between proteins and nucleic acids. Here, we present EquiPNAS, a new pLM-informed E(3) equivariant deep graph neural network framework for improved protein–nucleic acid binding site prediction. By combining the strengths of pLM and symmetry-aware deep graph learning, EquiPNAS consistently outperforms the state-of-the-art methods for both protein–DNA and protein–RNA binding site prediction on multiple datasets across a diverse set of predictive modeling scenarios ranging from using experimental input to AlphaFold2 predictions. Our ablation study reveals that the pLM embeddings used in EquiPNAS are sufficiently powerful to dramatically reduce the dependence on the availability of evolutionary information without compromising on accuracy, and that the symmetry-aware nature of the E(3) equivariant graph-based neural architecture offers remarkable robustness and performance resilience. EquiPNAS is freely available at https://github.com/Bhattacharya-Lab/EquiPNAS.
more » « less
Full Text Available
The transformative power of transformers in protein structure prediction

https://doi.org/10.1073/pnas.2303499120

Moussad, Bernard; Roche, Rahmatullah; Bhattacharya, Debswapna (August 2023, Proceedings of the National Academy of Sciences)

Transformer neural networks have revolutionized structural biology with the ability to predict protein structures at unprecedented high accuracy. Here, we report the predictive modeling performance of the state-of-the-art protein structure prediction methods built on transformers for 69 protein targets from the recently concluded 15th Critical Assessment of Structure Prediction (CASP15) challenge. Our study shows the power of transformers in protein structure modeling and highlights future areas of improvement.
more » « less
Full Text Available
E(3) equivariant graph neural networks for robust and accurate protein-protein interaction site prediction

https://doi.org/10.1371/journal.pcbi.1011435

Roche, Rahmatullah; Moussad, Bernard; Shuvo, Md Hossain; Bhattacharya, Debswapna (August 2023, PLOS Computational Biology)
Li, Jinyan (Ed.)
Artificial intelligence-powered protein structure prediction methods have led to a paradigm-shift in computational structural biology, yet contemporary approaches for predicting the interfacial residues (i.e., sites) of protein-protein interaction (PPI) still rely on experimental structures. Recent studies have demonstrated benefits of employing graph convolution for PPI site prediction, but ignore symmetries naturally occurring in 3-dimensional space and act only on experimental coordinates. Here we present EquiPPIS, an E(3) equivariant graph neural network approach for PPI site prediction. EquiPPIS employs symmetry-aware graph convolutions that transform equivariantly with translation, rotation, and reflection in 3D space, providing richer representations for molecular data compared to invariant convolutions. EquiPPIS substantially outperforms state-of-the-art approaches based on the same experimental input, and exhibits remarkable robustness by attaining better accuracy with predicted structural models from AlphaFold2 than what existing methods can achieve even with experimental structures. Freely available athttps://github.com/Bhattacharya-Lab/EquiPPIS, EquiPPIS enables accurate PPI site prediction at scale.
more » « less
Full Text Available
Contact-Assisted Threading in Low-Homology Protein Modeling

Bhattacharya, Sutanu; Roche, Rahmatullah; Shuvo, Md Hossain; Moussad, Bernard; Bhattacharya, Debswapna (March 2023, Methods in Molecular Biology)
Filipek, Sławomir (Ed.)
Full Text Available
PIQLE: protein–protein interface quality estimation by deep graph learning of multimeric interaction geometries

https://doi.org/10.1093/bioadv/vbad070

Shuvo, Md Hossain; Karim, Mohimenul; Roche, Rahmatullah; Bhattacharya, Debswapna (January 2023, Bioinformatics Advances)
Gromiha, Michael (Ed.)
Abstract Motivation Accurate modeling of protein–protein interaction interface is essential for high-quality protein complex structure prediction. Existing approaches for estimating the quality of a predicted protein complex structural model utilize only the physicochemical properties or energetic contributions of the interacting atoms, ignoring evolutionarily information or inter-atomic multimeric geometries, including interaction distance and orientations. Results Here, we present PIQLE, a deep graph learning method for protein–protein interface quality estimation. PIQLE leverages multimeric interaction geometries and evolutionarily information along with sequence- and structure-derived features to estimate the quality of individual interactions between the interfacial residues using a multi-head graph attention network and then probabilistically combines the estimated quality for scoring the overall interface. Experimental results show that PIQLE consistently outperforms existing state-of-the-art methods including DProQA, TRScore, GNN-DOVE and DOVE on multiple independent test datasets across a wide range of evaluation metrics. Our ablation study and comparison with the self-assessment module of AlphaFold-Multimer repurposed for protein complex scoring reveal that the performance gains are connected to the effectiveness of the multi-head graph attention network in leveraging multimeric interaction geometries and evolutionary information along with other sequence- and structure-derived features adopted in PIQLE. Availability and implementation An open-source software implementation of PIQLE is freely available at https://github.com/Bhattacharya-Lab/PIQLE. Supplementary information Supplementary data are available at Bioinformatics Advances online.
more » « less
Full Text Available
rrQNet : Protein contact map quality estimation by deep evolutionary reconciliation

https://doi.org/10.1002/prot.26394

Roche, Rahmatullah; Bhattacharya, Sutanu; Shuvo, Md. Hossain; Bhattacharya, Debswapna (June 2022, Proteins: Structure, Function, and Bioinformatics)

Full Text Available
DisCovER : distance‐ and orientation‐based covariational threading for weakly homologous proteins

https://doi.org/10.1002/prot.26254

Bhattacharya, Sutanu; Roche, Rahmatullah; Moussad, Bernard; Bhattacharya, Debswapna (February 2022, Proteins: Structure, Function, and Bioinformatics)

Full Text Available
Hybridized distance- and contact-based hierarchical structure modeling for folding soluble and membrane proteins

https://doi.org/10.1371/journal.pcbi.1008753

Roche, Rahmatullah; Bhattacharya, Sutanu; Bhattacharya, Debswapna (February 2021, PLOS Computational Biology)
Kolodny, Rachel (Ed.)
Crystallography and NMR system (CNS) is currently a widely used method for fragment-free ab initio protein folding from inter-residue distance or contact maps. Despite its widespread use in protein structure prediction, CNS is a decade-old macromolecular structure determination system that was originally developed for solving macromolecular geometry from experimental restraints as opposed to predictive modeling driven by interaction map data. As such, the adaptation of the CNS experimental structure determination protocol for ab initio protein folding is intrinsically anomalous that may undermine the folding accuracy of computational protein structure prediction. In this paper, we propose a new CNS-free hierarchical structure modeling method called DConStruct for folding both soluble and membrane proteins driven by distance and contact information. Rigorous experimental validation shows that DConStruct attains much better reconstruction accuracy than CNS when tested with the same input contact map at varying contact thresholds. The hierarchical modeling with iterative self-correction employed in DConStruct scales at a much higher degree of folding accuracy than CNS with the increase in contact thresholds, ultimately approaching near-optimal reconstruction accuracy at higher-thresholded contact maps. The folding accuracy of DConStruct can be further improved by exploiting distance-based hybrid interaction maps at tri-level thresholding, as demonstrated by the better performance of our method in folding free modeling targets from the 12th and 13th rounds of the Critical Assessment of techniques for protein Structure Prediction (CASP) experiments compared to popular CNS- and fragment-based approaches and energy-minimization protocols, some of which even using much finer-grained distance maps than ours. Additional large-scale benchmarking shows that DConStruct can significantly improve the folding accuracy of membrane proteins compared to a CNS-based approach. These results collectively demonstrate the feasibility of greatly improving the accuracy of ab initio protein folding by optimally exploiting the information encoded in inter-residue interaction maps beyond what is possible by CNS.
more » « less
Full Text Available

« Prev Next »

Search for: All records