Improving the generalizability of protein-ligand binding predictions with AI-Bind

Chatterjee, Ayan; Walters, Robin; Shafi, Zohair; Ahmed, Omair Shafi; Sebek, Michael; Gysi, Deisy; Yu, Rose; Eliassi-Rad, Tina; Barabási, Albert-László; Menichetti, Giulia

doi:10.1038/s41467-023-37572-z

Citation Details

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. more »

Award ID(s):: 1741197

PAR ID:: 10480787

Author(s) / Creator(s):: Chatterjee, Ayan; Walters, Robin; Shafi, Zohair; Ahmed, Omair Shafi; Sebek, Michael; Gysi, Deisy; Yu, Rose; Eliassi-Rad, Tina; Barabási, Albert-László; Menichetti, Giulia

Publisher / Repository:: Springer Nature

Date Published:: 2023-12-01

Journal Name:: Nature Communications

Volume:: 14

Issue:: 1

ISSN:: 2041-1723

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1038/s41467-023-37572-z

More Like this