skip to main content


Title: Altered dynamics upon oligomerization corresponds to key functional sites
ABSTRACT

It is known that over half of the proteins encoded by most organisms function as oligomeric complexes. Oligomerization confers structural stability and dynamics changes in proteins. We investigate the effects of oligomerization on protein dynamics and its functional significance for a set of 145 multimeric proteins. Using coarse‐grained elastic network models, we inspect the changes in residue fluctuations upon oligomerization and then compare with residue conservation scores to identify the functional significance of these changes. Our study reveals conservation of about ½ of the fluctuations, with ¼ of the residues increasing in their mobilities and ¼ having reduced fluctuations. The residues with dampened fluctuations are evolutionarily more conserved and can serve as orthosteric binding sites, indicating their importance. We also use triosephosphate isomerase as a test case to understand why certain enzymes function only in their oligomeric forms despite the monomer including all required catalytic residues. To this end, we compare the residue communities (groups of residues which are highly correlated in their fluctuations) in the monomeric and dimeric forms of the enzyme. We observe significant changes to the dynamical community architecture of the catalytic core of this enzyme. This relates to its functional mechanism and is seen only in the oligomeric form of the protein, answering why proteins are oligomeric structures. Proteins 2017; 85:1422–1434. © 2017 Wiley Periodicals, Inc.

 
more » « less
NSF-PAR ID:
10026684
Author(s) / Creator(s):
 ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Proteins: Structure, Function, and Bioinformatics
Volume:
85
Issue:
8
ISSN:
0887-3585
Page Range / eLocation ID:
p. 1422-1434
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Protein functional constraints are manifest as superfamily and functional-subgroup conserved residues, and as pairwise correlations. Deep Analysis of Residue Constraints (DARC) aids the visualization of these constraints, characterizes how they correlate with each other and with structure, and estimates statistical significance. This can identify determinants of protein functional specificity, as we illustrate for bacterial DNA clamp loader ATPases. These load ring-shaped sliding clamps onto DNA to keep polymerase attached during replication and contain one δ, three γ, and one δ’ AAA+ subunits semi-circularly arranged in the order δ-γ123-δ’. Only γ is active, though both γ and δ’ functionally influence an adjacent γ subunit. DARC identifies, as functionally-congruent features linking allosterically the ATP, DNA, and clamp binding sites: residues distinctive of γ and of γ/δ’ that mutually interact in trans, centered on the catalytic base; several γ/δ’-residues and six γ/δ’-covariant residue pairs within the DNA binding N-termini of helices α2 and α3; and γ/δ’-residues associated with the α2 C-terminus and the clamp-binding loop. Most notable is a trans-acting γ/δ’ hydroxyl group that 99% of other AAA+ proteins lack. Mutation of this hydroxyl to a methyl group impedes clamp binding and opening, DNA binding, and ATP hydrolysis—implying a remarkably clamp-loader-specific function.

     
    more » « less
  2. Human RNA‐binding motif 3 protein (RBM3) is a cold‐shock protein which functions in various aspects of global protein synthesis, cell proliferation and apoptosis by interacting with the components of basal translational machinery. RBM3 plays important roles in tumour progression and cancer metastasis, and also has been shown to be involved in neuroprotection and endoplasmic reticulum stress response. Here, we have solved the solution NMR structure of the N‐terminal 84 residue RNA recognition motif (RRM) of RBM3. The remaining residues are rich in RGG and YGG motifs and are disordered. The RRM domain adopts a βαββαβ topology, which is found in many RNA‐binding proteins. NMR‐monitored titration experiments and molecular dynamic simulations show that the beta‐sheet and two loops form the RNA‐binding interface. Hydrogen bond, pi–pi and pi–cation are the key interactions between the RNA and the RRM domain. NMR, size exclusion chromatography and chemical cross‐linking experiments show that RBM3 forms oligomers in solution, which is favoured by decrease in temperature, thus, potentially linking it to its function as a cold‐shock protein. Temperature‐dependent NMR studies revealed that oligomerization of the RRM domain occurs via nonspecific interactions. Overall, this study provides the detailed structural analysis of RRM domain of RBM3, its interaction with RNA and the molecular basis of its temperature‐dependent oligomerization.

     
    more » « less
  3. Abstract Summary

    A new dynamic community identifier (DCI) is presented that relies upon protein residue dynamic cross-correlations generated by Gaussian elastic network models to identify those residue clusters exhibiting motions within a protein. A number of examples of communities are shown for diverse proteins, including GPCRs. It is a tool that can immediately simplify and clarify the most essential functional moving parts of any given protein. Proteins usually can be subdivided into groups of residues that move as communities. These are usually densely packed local sub-structures, but in some cases can be physically distant residues identified to be within the same community. The set of these communities for each protein are the moving parts. The ways in which these are organized overall can aid in understanding many aspects of functional dynamics and allostery. DCI enables a more direct understanding of functions including enzyme activity, action across membranes and changes in the community structure from mutations or ligand binding. The DCI server is freely available on a web site (https://dci.bb.iastate.edu/).

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  4. Abstract Motivation

    The analysis of sequence conservation patterns has been widely utilized to identify functionally important (catalytic and ligand-binding) protein residues for over a half-century. Despite decades of development, on average state-of-the-art non-template-based functional residue prediction methods must predict ∼25% of a protein’s total residues to correctly identify half of the protein’s functional site residues. The overwhelming proportion of false positives results in reported ‘F-Scores’ of ∼0.3. We investigated the limits of current approaches, focusing on the so-far neglected impact of the specific choice of homologs included in multiple sequence alignments (MSAs).

    Results

    The limits of conservation-based functional residue prediction were explored by surveying the binding sites of 1023 proteins. A straightforward conservation analysis of MSAs composed of randomly selected homologs sampled from a PSI-BLAST search achieves average F-Scores of ∼0.3, a performance matching that reported by state-of-the-art methods, which often consider additional features for the prediction in a machine learning setting. Interestingly, we found that a simple combinatorial MSA sampling algorithm will in almost every case produce an MSA with an optimal set of homologs whose conservation analysis reaches average F-Scores of ∼0.6, doubling state-of-the-art performance. We also show that this is nearly at the theoretical limit of possible performance given the agreement between different binding site definitions. Additionally, we showcase the progress in this direction made by Selection of Alignment by Maximal Mutual Information (SAMMI), an information-theory-based approach to identifying biologically informative MSAs. This work highlights the importance and the unused potential of optimally composed MSAs for conservation analysis.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  5. Entropy should directly reflect the extent of disorder in proteins. By clustering structurally related proteins and studying the multiple-sequence-alignment of the sequences of these clusters, we were able to link between sequence, structure, and disorder information. We introduced several parameters as measures of fluctuations at a given MSA site and used these as representative of the sequence and structure entropy at that site. In general, we found a tendency for negative correlations between disorder and structure, and significant positive correlations between disorder and the fluctuations in the system. We also found evidence for residue-type conservation for those residues proximate to potentially disordered sites. Mutation at the disorder site itself appear to be allowed. In addition, we found positive correlation for disorder and accessible surface area, validating that disordered residues occur in exposed regions of proteins. Finally, we also found that fluctuations in the dihedral angles at the original mutated residue and disorder are positively correlated while dihedral angle fluctuations in spatially proximal residues are negatively correlated with disorder. Our results seem to indicate permissible variability in the disordered site, but greater rigidity in the parts of the protein with which the disordered site interacts. This is another indication that disordered residues are involved in protein function. 
    more » « less