NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

To pack or not to pack: revisiting protein side-chain packing in the post-AlphaFold era

https://doi.org/10.1093/bib/bbaf297

Vangaru, Sriniketh; Bhattacharya, Debswapna (June 2025, Briefings in Bioinformatics)

Abstract Protein side-chain packing (PSCP), the problem of predicting side-chain conformations given a fixed backbone structure, has important implications in the modeling of structures and interactions. However, despite the groundbreaking progress in protein structure prediction pioneered by AlphaFold, the existing PSCP methods still rely on experimental inputs, and do not leverage AlphaFold-predicted backbone coordinates to enable PSCP at scale. Here, we perform a large-scale benchmarking of the predictive performance of various PSCP methods on public datasets from multiple rounds of the Critical Assessment of Structure Prediction challenges using a diverse set of evaluation metrics. Empirical results demonstrate that the PSCP methods perform well in packing the side-chains with experimental inputs, but they fail to generalize in repacking AlphaFold-generated structures. We additionally explore the effectiveness of leveraging the self-assessment confidence scores from AlphaFold by implementing a backbone confidence-aware integrative approach. While such a protocol often leads to performance improvement by attaining modest yet statistically significant accuracy gains over the AlphaFold baseline, it does not yield consistent and pronounced improvements. Our study highlights the recent advances and remaining challenges in PSCP in the post-AlphaFold era.
more » « less
NEFFy: A Versatile Tool for Computing the Number of Effective Sequences

https://doi.org/10.1093/bioinformatics/btaf222

Haghani, Maryam; Bhattacharya, Debswapna; Murali, T M (June 2025, Bioinformatics)
Cheng, Jianlin (Ed.)
Abstract MotivationA Multiple Sequence Alignment (MSA) contains fundamental evolutionary information that is useful in the prediction of structure and function of proteins and nucleic acids. The “Number of Effective Sequences” (NEFF) quantifies the diversity of sequences of an MSA. While several tools embed NEFF calculation with various options, none are standalone tools for this purpose, and they do not offer all the available options. ResultsWe developed NEFFy, the first software package to integrate all these options and calculate NEFF across diverse MSA formats for proteins, RNAs, and DNAs. It surpasses existing tools in functionality without compromising computational efficiency and scalability. NEFFy also offers per-residue NEFF calculation and supports NEFF computation for MSAs of multimeric proteins, with the capability to be extended to DNAs and RNAs. Availability and ImplementationNEFFy is released as open-source software under the GNU Public License v3.0. The source code in C ++ and a Python wrapper are available at https://github.com/Maryam-Haghani/NEFFy. To ensure users can fully leverage these capabilities, comprehensive documentation and examples are provided at https://Maryam-Haghani.github.io/NEFFy. Supplementary InformationSupplementary data are available at Bioinformatics online.
more » « less
Free, publicly-accessible full text available June 3, 2026
Sifting through the noise: A survey of diffusion probabilistic models and their applications to biomolecules

https://doi.org/10.1016/j.jmb.2024.168818

Norton, Trevor; Bhattacharya, Debswapna (March 2025, Journal of Molecular Biology)

Free, publicly-accessible full text available March 1, 2026
EquiRank: Improved protein-protein interface quality estimation using protein language-model-informed equivariant graph neural networks

https://doi.org/10.1016/j.csbj.2024.12.015

Shuvo, Md Hossain; Bhattacharya, Debswapna (January 2025, Computational and Structural Biotechnology Journal)

Free, publicly-accessible full text available January 1, 2026
Advances in Language-Model-Informed Protein–Nucleic Acid Binding Site Prediction

https://doi.org/10.1007/978-1-0716-4623-6_9

Tarafder, Sumit; Wang, Xinyu; Roche, Rahmatullah; Bhattacharya, Debswapna (January 2025, Springer US)

Free, publicly-accessible full text available January 1, 2026
The landscape of RNA 3D structure modeling with transformer networks

https://doi.org/10.1093/biomethods/bpae047

Tarafder, Sumit; Roche, Rahmatullah; Bhattacharya, Debswapna (July 2024, Biology Methods and Protocols)

Abstract Transformers are a powerful subclass of neural networks catalyzing the development of a growing number of computational methods for RNA structure modeling. Here, we conduct an objective and empirical study of the predictive modeling accuracy of the emerging transformer-based methods for RNA structure prediction. Our study reveals multi-faceted complementarity between the methods and underscores some key aspects that affect the prediction accuracy.
more » « less
lociPARSE: A Locality-aware Invariant Point Attention Model for Scoring RNA 3D Structures

https://doi.org/10.1021/acs.jcim.4c01621

Tarafder, Sumit; Bhattacharya, Debswapna (November 2024, Journal of Chemical Information and Modeling)
EquiPNAS: improved protein–nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks

https://doi.org/10.1093/nar/gkae039

Roche, Rahmatullah; Moussad, Bernard; Shuvo, Md Hossain; Tarafder, Sumit; Bhattacharya, Debswapna (January 2024, Nucleic Acids Research)

Abstract Protein language models (pLMs) trained on a large corpus of protein sequences have shown unprecedented scalability and broad generalizability in a wide range of predictive modeling tasks, but their power has not yet been harnessed for predicting protein–nucleic acid binding sites, critical for characterizing the interactions between proteins and nucleic acids. Here, we present EquiPNAS, a new pLM-informed E(3) equivariant deep graph neural network framework for improved protein–nucleic acid binding site prediction. By combining the strengths of pLM and symmetry-aware deep graph learning, EquiPNAS consistently outperforms the state-of-the-art methods for both protein–DNA and protein–RNA binding site prediction on multiple datasets across a diverse set of predictive modeling scenarios ranging from using experimental input to AlphaFold2 predictions. Our ablation study reveals that the pLM embeddings used in EquiPNAS are sufficiently powerful to dramatically reduce the dependence on the availability of evolutionary information without compromising on accuracy, and that the symmetry-aware nature of the E(3) equivariant graph-based neural architecture offers remarkable robustness and performance resilience. EquiPNAS is freely available at https://github.com/Bhattacharya-Lab/EquiPNAS.
more » « less
The transformative power of transformers in protein structure prediction

https://doi.org/10.1073/pnas.2303499120

Moussad, Bernard; Roche, Rahmatullah; Bhattacharya, Debswapna (August 2023, Proceedings of the National Academy of Sciences)

Transformer neural networks have revolutionized structural biology with the ability to predict protein structures at unprecedented high accuracy. Here, we report the predictive modeling performance of the state-of-the-art protein structure prediction methods built on transformers for 69 protein targets from the recently concluded 15th Critical Assessment of Structure Prediction (CASP15) challenge. Our study shows the power of transformers in protein structure modeling and highlights future areas of improvement.
more » « less
Full Text Available
E(3) equivariant graph neural networks for robust and accurate protein-protein interaction site prediction

https://doi.org/10.1371/journal.pcbi.1011435

Roche, Rahmatullah; Moussad, Bernard; Shuvo, Md Hossain; Bhattacharya, Debswapna (August 2023, PLOS Computational Biology)
Li, Jinyan (Ed.)
Artificial intelligence-powered protein structure prediction methods have led to a paradigm-shift in computational structural biology, yet contemporary approaches for predicting the interfacial residues (i.e., sites) of protein-protein interaction (PPI) still rely on experimental structures. Recent studies have demonstrated benefits of employing graph convolution for PPI site prediction, but ignore symmetries naturally occurring in 3-dimensional space and act only on experimental coordinates. Here we present EquiPPIS, an E(3) equivariant graph neural network approach for PPI site prediction. EquiPPIS employs symmetry-aware graph convolutions that transform equivariantly with translation, rotation, and reflection in 3D space, providing richer representations for molecular data compared to invariant convolutions. EquiPPIS substantially outperforms state-of-the-art approaches based on the same experimental input, and exhibits remarkable robustness by attaining better accuracy with predicted structural models from AlphaFold2 than what existing methods can achieve even with experimental structures. Freely available athttps://github.com/Bhattacharya-Lab/EquiPPIS, EquiPPIS enables accurate PPI site prediction at scale.
more » « less
Full Text Available

« Prev Next »

Search for: All records