skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The physical and evolutionary energy landscapes of devolved protein sequences corresponding to pseudogenes
Protein evolution is guided by structural, functional, and dynamical constraints ensuring organismal viability. Pseudogenes are genomic sequences identified in many eukaryotes that lack translational activity due to sequence degradation and thus over time have undergone “devolution.” Previously pseudogenized genes sometimes regain their protein-coding function, suggesting they may still encode robust folding energy landscapes despite multiple mutations. We study both the physical folding landscapes of protein sequences corresponding to human pseudogenes using the Associative Memory, Water Mediated, Structure and Energy Model, and the evolutionary energy landscapes obtained using direct coupling analysis (DCA) on their parent protein families. We found that generally mutations that have occurred in pseudogene sequences have disrupted their native global network of stabilizing residue interactions, making it harder for them to fold if they were translated. In some cases, however, energetic frustration has apparently decreased when the functional constraints were removed. We analyzed this unexpected situation for Cyclophilin A, Profilin-1, and Small Ubiquitin-like Modifier 2 Protein. Our analysis reveals that when such mutations in the pseudogene ultimately stabilize folding, at the same time, they likely alter the pseudogenes’ former biological activity, as estimated by DCA. We localize most of these stabilizing mutations generally to normally frustrated regions required for binding to other partners.  more » « less
Award ID(s):
1943442
PAR ID:
10554688
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
National Academy of Sciences
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
121
Issue:
21
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Proteins are constantly undergoing folding and unfolding transitions, with rates that determine their homeostasis in vivo and modulate their biological function. The ability to optimize these rates without affecting overall native stability is hence highly desirable for protein engineering and design. The great challenge is, however, that mutations generally affect folding and unfolding rates with inversely complementary fractions of the net free energy change they inflict on the native state. Here we address this challenge by targeting the folding transition state (FTS) of chymotrypsin inhibitor 2 (CI2), a very slow and stable two‐state folding protein with an FTS known to be refractory to change by mutation. We first discovered that the CI2's FTS is energetically taxed by the desolvation of several, highly conserved, charges that form a buried salt bridge network in the native structure. Based on these findings, we designed a CI2 variant that bears just four mutations and aims to selectively stabilize the FTS. This variant has >250‐fold faster rates in both directions and hence identical native stability, demonstrating the success of our FTS‐centric design strategy. With an optimized FTS, CI2 also becomes 250‐fold more sensitive to proteolytic degradation by its natural substrate chymotrypsin, and completely loses its activity as inhibitor. These results indicate that CI2 has been selected through evolution to have a very unstable FTS in order to attain the kinetic stability needed to effectively function as protease inhibitor. Moreover, the CI2 case showcases that protein (un)folding rates can critically pivot around a few key residues‐interactions, which can strongly modify the general effects of known structural factors such as domain size and fold topology. From a practical standpoint, our results suggest that future efforts should perhaps focus on identifying such critical residues‐interactions in proteins as best strategy to significantly improve our ability to predict and engineer protein (un)folding rates. 
    more » « less
  2. Abstract Cellulose, the main component of the plant cell wall, is synthesized by the multimeric cellulose synthase (CESA) complex (CSC). In plant cells, CSCs are assembled in the endoplasmic reticulum or Golgi and transported through the endomembrane system to the plasma membrane (PM). However, how CESA catalytic activity or conserved motifs around the catalytic core influence vesicle trafficking or protein dynamics is not well understood. Here, we used yellow fluorescent protein (YFP)-tagged AtCESA6 and created 18 mutants in key motifs of the catalytic domain to analyze how they affected seedling growth, cellulose biosynthesis, complex formation, and CSC dynamics and trafficking in Arabidopsis thaliana. Seedling growth and cellulose content were reduced by nearly all mutations. Moreover, mutations in most conserved motifs slowed CSC movement in the PM as well as delivery of CSCs to the PM. Interestingly, mutations in the DDG and QXXRW motifs affected YFP-CESA6 abundance in the Golgi. These mutations also perturbed post-Golgi trafficking of CSCs. The 18 mutations were divided into 2 groups based on their phenotypes; we propose that Group I mutations cause CSC trafficking defects, whereas Group II mutations, especially in the QXXRW motif, affect protein folding and/or CSC rosette formation. Collectively, our results demonstrate that the CESA6 catalytic domain is essential for cellulose biosynthesis as well as CSC formation, protein folding and dynamics, and vesicle trafficking. 
    more » « less
  3. null (Ed.)
    Packing interaction is a critical driving force in the folding of helical membrane proteins. Despite the importance, packing defects (i.e., cavities including voids, pockets, and pores) are prevalent in membrane-integral enzymes, channels, transporters, and receptors, playing essential roles in function. Then, a question arises regarding how the two competing requirements, packing for stability vs. cavities for function, are reconciled in membrane protein structures. Here, using the intramembrane protease GlpG of Escherichia coli as a model and cavity-filling mutation as a probe, we tested the impacts of native cavities on the thermodynamic stability and function of a membrane protein. We find several stabilizing mutations which induce substantial activity reduction without distorting the active site. Notably, these mutations are all mapped onto the regions of conformational flexibility and functional importance, indicating that the cavities facilitate functional movement of GlpG while compromising the stability. Experiment and molecular dynamics simulation suggest that the stabilization is induced by the coupling between enhanced protein packing and weakly unfavorable lipid desolvation, or solely by favorable lipid solvation on the cavities. Our result suggests that, stabilized by the relatively weak interactions with lipids, cavities are accommodated in membrane proteins without severe energetic cost, which, in turn, serve as a platform to fine-tune the balance between stability and flexibility for optimal activity. 
    more » « less
  4. null (Ed.)
    Cells adapt to changing environments. Perturb a cell and it returns to a point of homeostasis. Perturb a population and it evolves toward a fitness peak. We review quantitative models of the forces of adaptation and their visualizations on landscapes. While some adaptations result from single mutations or few-gene effects, others are more cooperative, more delocalized in the genome, and more universal and physical. For example, homeostasis and evolution depend on protein folding and aggregation, energy and protein production, protein diffusion, molecular motor speeds and efficiencies, and protein expression levels. Models provide a way to learn about the fitness of cells and cell populations by making and testing hypotheses. 
    more » « less
  5. Background/Objectives: Somatic and genetic mutations in glutathione peroxidases (GPxs), including GPx7 and GPx8, have been linked to intellectual disability, microcephaly, and various tumors. GPx7 and GPx8 evolved the latest among the GPx enzymes and are present in the endoplasmic reticulum. Although lacking a glutathione binding domain, GPx7 and GPx8 possess peroxidase activity that helps the body respond to cellular stress. However, the protein mutations in these peroxidases remain relatively understudied. Methods: By elucidating the structural and stability consequences of missense mutations, this study aims to provide insights into the pathogenic mechanisms involved in different cancers, thereby aiding clinical diagnosis, treatment strategies, and the development of targeted therapies. We performed saturated computational mutagenesis to analyze 2926 and 3971 missense mutations of GPx7 and GPx8, respectively. Results: The results indicate that G153H and G153F in GPx7 are highly destabilizing, while E93M and W142F are stabilizing. In GPx8, N74W and G173W caused the most instability while S70I and S119P increased stability. Our analysis shows that highly destabilizing somatic and genetic mutations are more likely pathogenic compared to stabilizing mutations. Conclusions: This comprehensive analysis of missense mutations in GPx7 and GPx8 provides critical insights into their impact on protein structure and stability, contributing to a deeper understanding of the roles of somatic mutations in cancer development and progression. These findings can inform more precise clinical diagnostics and targeted treatment approaches for cancers. 
    more » « less