skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Computed structures of core eukaryotic protein complexes
Protein-protein interactions play critical roles in biology, but the structures of many eukaryotic protein complexes are unknown, and there are likely many interactions not yet identified. We take advantage of advances in proteome-wide amino acid coevolution analysis and deep-learning–based structure modeling to systematically identify and build accurate models of core eukaryotic protein complexes within the Saccharomyces cerevisiae proteome. We use a combination of RoseTTAFold and AlphaFold to screen through paired multiple sequence alignments for 8.3 million pairs of yeast proteins, identify 1505 likely to interact, and build structure models for 106 previously unidentified assemblies and 806 that have not been structurally characterized. These complexes, which have as many as five subunits, play roles in almost all key processes in eukaryotic cells and provide broad insights into biological function.  more » « less
Award ID(s):
1937533
PAR ID:
10392063
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; more » ; ; ; ; ; ; ; ; ; ; « less
Date Published:
Journal Name:
Science
Volume:
374
Issue:
6573
ISSN:
0036-8075
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Physical interactions of proteins play key functional roles in many important cellular processes. To understand molecular mechanisms of such functions, it is crucial to determine the structure of protein complexes. To complement experimental approaches, which usually take a considerable amount of time and resources, various computational methods have been developed for predicting the structures of protein complexes. In computational modeling, one of the challenges is to identify near-native structures from a large pool of generated models. Here, we developed a deep learning–based approach named Graph Neural Network–based DOcking decoy eValuation scorE (GNN-DOVE). To evaluate a protein docking model, GNN-DOVE extracts the interface area and represents it as a graph. The chemical properties of atoms and the inter-atom distances are used as features of nodes and edges in the graph, respectively. GNN-DOVE was trained, validated, and tested on docking models in the Dockground database and further tested on a combined dataset of Dockground and ZDOCK benchmark as well as a CAPRI scoring dataset. GNN-DOVE performed better than existing methods, including DOVE, which is our previous development that uses a convolutional neural network on voxelized structure models. 
    more » « less
  2. Abstract The cluster of differentiation 36 (CD36) domain defines the characteristic ectodomain associated with class B scavenger receptor (SR-B) proteins. In bilaterians, SR-Bs play critical roles in diverse biological processes including innate immunity functions such as pathogen recognition and apoptotic cell clearance, as well as metabolic sensing associated with fatty acid uptake and cholesterol transport. Although previous studies suggest this protein family is ancient, SR-B diversity across Eukarya has not been robustly characterized. We analyzed SR-B homologs identified from the genomes and transcriptomes of 165 diverse eukaryotic species. The presence of highly conserved amino acid motifs across major eukaryotic supergroups supports the presence of a SR-B homolog in the last eukaryotic common ancestor. Our comparative analyses of SR-B protein structure identify the retention of a canonical asymmetric beta barrel tertiary structure within the CD36 ectodomain across Eukarya. We also identify multiple instances of independent lineage-specific sequence expansions in the apex region of the CD36 ectodomain—a region functionally associated with ligand-sensing. We hypothesize that a combination of both sequence expansion and structural variation in the CD36 apex region may reflect the evolution of SR-B ligand-sensing specificity between diverse eukaryotic clades. 
    more » « less
  3. Abstract Motivation Many important cellular processes involve physical interactions of proteins. Therefore, determining protein quaternary structures provide critical insights for understanding molecular mechanisms of functions of the complexes. To complement experimental methods, many computational methods have been developed to predict structures of protein complexes. One of the challenges in computational protein complex structure prediction is to identify near-native models from a large pool of generated models. Results We developed a convolutional deep neural network-based approach named DOcking decoy selection with Voxel-based deep neural nEtwork (DOVE) for evaluating protein docking models. To evaluate a protein docking model, DOVE scans the protein–protein interface of the model with a 3D voxel and considers atomic interaction types and their energetic contributions as input features applied to the neural network. The deep learning models were trained and validated on docking models available in the ZDock and DockGround databases. Among the different combinations of features tested, almost all outperformed existing scoring functions. Availability and implementation Codes available at http://github.com/kiharalab/DOVE, http://kiharalab.org/dove/. Supplementary information Supplementary data are available at Bioinformatics online. 
    more » « less
  4. RNA molecules often play critical roles in assisting the formation of membraneless organelles in eukaryotic cells. Yet, little is known about the organization of RNAs within membraneless organelles. Here, using super-resolution imaging and nuclear speckles as a model system, we demonstrate that different sequence domains of RNA transcripts exhibit differential spatial distributions within speckles. Specifically, we image transcripts containing a region enriched in binding motifs of serine/arginine-rich (SR) proteins and another region enriched in binding motifs of heterogeneous nuclear ribonucleoproteins (hnRNPs). We show that these transcripts localize to the outer shell of speckles, with the SR motif-rich region localizing closer to the speckle center relative to the hnRNP motif-rich region. Further, we identify that this intra-speckle RNA organization is driven by the strength of RNA-protein interactions inside and outside speckles. Our results hint at novel functional roles of nuclear speckles and likely other membraneless organelles in organizing RNA substrates for biochemical reactions. 
    more » « less
  5. Driving mechanisms of many biological functions in a cell include physical interactions of proteins. As protein-protein interactions (PPIs) are also important in disease development, protein-protein interactions are highlighted in the pharmaceutical industry as possible therapeutic targets in recent years. To understand the variety of protein-protein interactions in a proteome, it is essential to establish a method that can identify similarity and dissimilarity between protein-protein interactions for inferring the binding of similar molecules, including drugs and other proteins. In this study, we developed a novel method, protein-protein interaction-Surfer, which compares and quantifies similarity of local surface regions of protein-protein interactions. protein-protein interaction-Surfer represents a protein-protein interaction surface with overlapping surface patches, each of which is described with a three-dimensional Zernike descriptor (3DZD), a compact mathematical representation of 3D function. 3DZD captures both the 3D shape and physicochemical properties of the protein surface. The performance of protein-protein interaction-Surfer was benchmarked on datasets of protein-protein interactions, where we were able to show that protein-protein interaction-Surfer finds similar potential drug binding regions that do not share sequence and structure similarity. protein-protein interaction-Surfer is available at https://kiharalab.org/ppi-surfer . 
    more » « less