skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Engineering gain‐of‐function mutants of a WW domain by dynamics and structural analysis
Abstract Proteins gain optimal fitness such as foldability and function through evolutionary selection. However, classical studies have found that evolutionarily designed protein sequences alone cannot guarantee foldability, or at least not without considering local contacts associated with the initial folding steps. We previously showed that foldability and function can be restored by removing frustration in the folding energy landscape of a model WW domain protein, CC16, which was designed based on Statistical Coupling Analysis (SCA). Substitutions ensuring the formation of five local contacts identified as “on‐path” were selected using the closest homolog native folded sequence, N21. Surprisingly, the resulting sequence, CC16‐N21, bound to Group I peptides, while N21 did not. Here, we identified single‐point mutations that enable N21 to bind a Group I peptide ligand through structure and dynamic‐based computational design. Comparison of the docked position of the CC16‐N21/ligand complex with the N21 structure showed that residues at positions 9 and 19 are important for peptide binding, whereas the dynamic profiles identified position 10 as allosterically coupled to the binding site and exhibiting different dynamics between N21 and CC16‐N21. We found that swapping these positions in N21 with matched residues from CC16‐N21 recovers nature‐like binding affinity to N21. This study validates the use of dynamic profiles as guiding principles for affecting the binding affinity of small proteins.  more » « less
Award ID(s):
1901709
PAR ID:
10451573
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Protein Science
Volume:
32
Issue:
9
ISSN:
0961-8368
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract AlphaFold2 has revolutionized protein structure prediction from amino‐acid sequence. In addition to protein structures, high‐resolution dynamics information about various protein regions is important for understanding protein function. Although AlphaFold2 has neither been designed nor trained to predict protein dynamics, it is shown here how the information returned by AlphaFold2 can be used to predict dynamic protein regions at the individual residue level. The approach, which is termed cdsAF2, uses the 3D protein structure returned by AlphaFold2 to predict backbone NMR NHS2order parameters using a local contact model that takes into account the contacts made by each peptide plane along the backbone with its environment. By combining for each residue AlphaFold2's pLDDT confidence score for the structure prediction accuracy with the predictedS2value using the local contact model, an estimator is obtained that semi‐quantitatively captures many of the dynamics features observed in experimental backbone NMR NHS2order parameter profiles. The method is demonstrated for a set nine proteins of different sizes and variable amounts of dynamics and disorder. 
    more » « less
  2. Abstract Protein–protein interactions that involve recognition of short peptides are critical in cellular processes. Protein–peptide interaction surface areas are relatively small and shallow, and there are often overlapping specificities in families of peptide‐binding domains. Therefore, dissecting selectivity determinants can be challenging. PDZ domains are a family of peptide‐binding domains located in several intracellular signaling and trafficking pathways. These domains are also directly targeted by pathogens, and a hallmark of many oncogenic viral proteins is a PDZ‐binding motif. However, amidst sequences that target PDZ domains, there is a wide spectrum in relative promiscuity. For example, the viral HPV16 E6 oncoprotein recognizes over double the number of PDZ domain‐containing proteins as the cystic fibrosis transmembrane conductance regulator (CFTR) in the cell, despite similar PDZ targeting‐sequences and identical motif residues. Here, we determine binding affinities for PDZ domains known to bind either HPV16 E6 alone or both CFTR and HPV16 E6, using peptides matching WT and hybrid sequences. We also use energy minimization to model PDZ–peptide complexes and use sequence analyses to investigate this difference. We find that while the majority of single mutations had marginal effects on overall affinity, the additive effect on the free energy of binding accurately describes the selectivity observed. Taken together, our results describe how complex and differing PDZ interactomes can be programmed in the cell. 
    more » « less
  3. Abstract Charged residues on the surface of proteins are critical for both protein stability and interactions. However, many proteins contain binding regions with a high net charge that may destabilize the protein but are useful for binding to oppositely charged targets. We hypothesized that these domains would be marginally stable, as electrostatic repulsion would compete with favorable hydrophobic collapse during folding. Furthermore, by increasing the salt concentration, we predict that these protein folds would be stabilized by mimicking some of the favorable electrostatic interactions that take place during target binding. We varied the salt and urea concentrations to probe the contributions of electrostatic and hydrophobic interactions for the folding of the yeast SH3 domain found in Abp1p. The SH3 domain was significantly stabilized with increased salt concentrations due to Debye–Huckel screening and a nonspecific territorial ion‐binding effect. Molecular dynamics and NMR show that sodium ions interact with all 15 acidic residues but do little to change backbone dynamics or overall structure. Folding kinetics experiments show that the addition of urea or salt primarily affects the folding rate, indicating that almost all the hydrophobic collapse and electrostatic repulsion occur in the transition state. After the transition state formation, modest yet favorable short‐range salt bridges are formed along with hydrogen bonds, as the native state fully folds. Thus, hydrophobic collapse offsets electrostatic repulsion to ensure this highly charged binding domain can still fold and be ready to bind to its charged peptide targets, a property that is likely evolutionarily conserved over 1 billion years. 
    more » « less
  4. Abstract Chaperones are essential to the co-translational folding of most proteins. However, the principles of co-translational chaperone interaction throughout the proteome are poorly understood, as current methods are restricted to few substrates and cannot capture nascent protein folding or chaperone binding sites, precluding a comprehensive understanding of productive and erroneous protein biosynthesis. Here, by integrating genome-wide selective ribosome profiling, single-molecule tools, and computational predictions using AlphaFold we show that the binding of the mainE. colichaperones involved in co-translational folding, Trigger Factor (TF) and DnaK correlates with “unsatisfied residues” exposed on nascent partial folds – residues that have begun to form tertiary structure but cannot yet form all native contacts due to ongoing translation. This general principle allows us to predict their co-translational binding across the proteome based on sequence only, which we verify experimentally. The results show that TF and DnaK stably bind partially folded rather than unfolded conformers. They also indicate a synergistic action of TF guiding intra-domain folding and DnaK preventing premature inter-domain contacts, and reveal robustness in the larger chaperone network (TF, DnaK, GroEL). Given the complexity of translation, folding, and chaperone functions, our predictions based on general chaperone binding rules indicate an unexpected underlying simplicity. 
    more » « less
  5. Hydrogen bonds (HB)s are the most abundant motifs in biological systems. They play a key role in determining protein–ligand binding affinity and selectivity. We designed two pharmaceutically beneficial HB databases, database A including ca. 12,000 protein–ligand complexes with ca. 22,000 HBs and their geometries, and database B including ca. 400 protein–ligand complexes with ca. 2200 HBs, their geometries, and bond strengths determined via our local vibrational mode analysis. We identified seven major HB patterns, which can be utilized as a de novo QSAR model to predict the binding affinity for a specific protein–ligand complex. Glycine was reported as the most abundant amino acid residue in both donor and acceptor profiles, and N–H⋯O was the most frequent HB type found in database A. HBs were preferred to be in the linear range, and linear HBs were identified as the strongest. HBs with HB angles in the range of 100–110°, typically forming intramolecular five-membered ring structures, showed good hydrophobic properties and membrane permeability. Utilizing database B, we found a generalized Badger’s relationship for more than 2200 protein–ligand HBs. In addition, the strength and occurrence maps between each amino acid residue and ligand functional groups open an attractive possibility for a novel drug-design approach and for determining drug selectivity and affinity, and they can also serve as an important tool for the hit-to-lead process. 
    more » « less