Cross-Modality Protein Embedding for Compound-Protein Affinity and Contact Prediction

You, Yuning; Shen, Yang

Citation Details

Compound-protein pairs dominate FDA-approved drug-target pairs and the prediction of compound-protein affinity and contact (CPAC) could help accelerate drug discovery. In this study we consider proteins as multi-modal data including 1D amino-acid sequences and (sequence-predicted) 2D residue-pair contact maps. We empirically evaluate the embeddings of the two single modalities in their accuracyand generalizability of CPAC prediction (i.e. structure-free interpretable compound-protein affinity prediction). And we rationalize their performances in both challenges of embedding individual modalities and learning generalizable embedding-label relationship. We further propose two models involving cross-modality protein embedding and establish that the one with cross interaction (thus capturing correlations among modalities) outperforms SOTAs and our single modality models in affinity, contact, and binding-site predictions for proteins never seen in the training set. more »

Award ID(s):: 1943008

PAR ID:: 10225270

Author(s) / Creator(s):: You, Yuning; Shen, Yang

Date Published:: 2020-12-12

Journal Name:: Machine Learning for Structure Biology (MLSB) Workshop at the 34th Conference on Neural Information Processing Systems

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this