TT3D: Leveraging precomputed protein 3D sequence models to predict protein–protein interactions

Sledzieski, Samuel (ORCID:0000000201703029); Devkota, Kapil (ORCID:0000000260936260); Singh, Rohit (ORCID:0000000240847340); Cowen, Lenore (ORCID:0000000166986413); Berger, Bonnie (ORCID:0000000227247228); Elofsson, ed., Arne

doi:10.1093/bioinformatics/btad663

Citation Details

TT3D: Leveraging precomputed protein 3D sequence models to predict protein–protein interactions

Abstract Motivation

High-quality computational structural models are now precomputed and available for nearly every protein in UniProt. However, the best way to leverage these models to predict which pairs of proteins interact in a high-throughput manner is not immediately clear. The recent Foldseek method of van Kempen et al. encodes the structural information of distances and angles along the protein backbone into a linear string of the same length as the protein string, using tokens from a 21-letter discretized structural alphabet (3Di).

Results

We show that using both the amino acid sequence and the 3Di sequence generated by Foldseek as inputs to our recent deep-learning method, Topsy-Turvy, substantially improves the performance of predicting protein–protein interactions cross-species. Thus TT3D (Topsy-Turvy 3D) presents a way to reuse all the computational effort going into producing high-quality structural models from sequence, while being sufficiently lightweight so that high-quality binary protein–protein interaction predictions across all protein pairs can be made genome-wide.

Availability and Implementation

TT3D is available at https://github.com/samsledje/D-SCRIPT. An archived version of the code at time of submission can be found at https://zenodo.org/records/10037674.

NSF-PAR ID:: 10473636

Author(s) / Creator(s):: Sledzieski, Samuel; Devkota, Kapil; Singh, Rohit; Cowen, Lenore; Berger, Bonnie; Elofsson, ed., Arne

Publisher / Repository:: Oxford University Press

Date Published:: 2023-10-28

Journal Name:: Bioinformatics

Volume:: 39

Issue:: 11

ISSN:: 1367-4811

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/bioinformatics/btad663

More Like this