Collagens are the foundational component of diverse tissues, including skin, bone, cartilage, and basement membranes, and are the most abundant protein class in animals. The fibrillar collagens are large, complex, multidomain proteins, all containing the characteristic triple helix motif. The most prevalent collagens are heterotrimeric, meaning that cells express at least two distinctive procollagen polypeptides that must assemble into specific heterotrimer compositions. The molecular mechanisms ensuring correct heterotrimeric assemblies are poorly understood – even for the most common collagen, type-I. The longstanding paradigm is that assembly is controlled entirely by the ~30 kDa globular C-propeptide (C-Pro) domain. Still, this dominating model for procollagen assembly has left many questions unanswered. Here, we show that the C-Pro paradigm is incomplete. In addition to the critical role of the C-Pro domain in templating assembly, we find that the amino acid sequence near the C terminus of procollagen’s triple-helical domain plays an essential role in defining procollagen assembly outcomes. These sequences near the C terminus of the triple-helical domain encode conformationally stabilizing features that ensure only desirable C-Pro-mediated trimeric templates are committed to irreversible triple-helix folding. Incorrect C-Pro trimer assemblies avoid commitment to triple-helix formation thanks to destabilizing features in the amino acid sequences of their triple helix. Incorrect C-Pro assemblies are consequently able to dissociate and search for new binding partners. These findings provide a distinctive perspective on the mechanism of procollagen assembly, revealing the molecular basis by which incorrect homotrimer assemblies are avoided and setting the stage for a deeper understanding of the biogenesis of this ubiquitous protein.
more »
« less
Effects of terminal tripeptide units on mechanical properties of collagen triple helices
Collagen, a vital protein that provides strength to various body tissues, has a triple helix structure containing three polypeptide chains. The chains are composed mostly of a tripeptide of glycine (G), proline (P), and hydroxyproline (O). Using molecular dynamics simulations and theoretical analysis, the study examines the mechanical response of collagen triple helix structures, made up of three different tripeptide units, when subjected to different fracture loading modes. The results show that collagen with GPO tripeptide units at their C-terminal are mechanically stronger than the POG and OGP units with a single amino-acid frame shift. Our work shows that the N-terminal has less effect on collagen fracture than the C-terminal. The differences in mechanical response are explained by the heterogenous rigidity of the amino acid backbone and the resulting shear lag effect near the terminal. The findings have potential applications in developing tough synthetic collagen for building materials and may stimulate further studies on the connection between terminal repeats and the mechanical-thermal behavior of other structural proteins such as silk, elastin, fibrin, and keratin.
more »
« less
- Award ID(s):
- 2145392
- PAR ID:
- 10466884
- Publisher / Repository:
- Elsevier
- Date Published:
- Journal Name:
- Extreme Mechanics Letters
- Volume:
- 64
- Issue:
- C
- ISSN:
- 2352-4316
- Page Range / eLocation ID:
- 102075
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Protein farnesylation is a post-translational modification where a 15-carbon farnesyl isoprenoid is appended to the C-terminal end of a protein by farnesyltransferase (FTase). This process often causes proteins to associate with the membrane and participate in signal transduction pathways. The most common substrates of FTase are proteins that have C-terminal tetrapeptide CaaX box sequences where the cysteine is the site of modification. However, recent work has shown that five amino acid sequences can also be recognized, including the pentapeptides CMIIM and CSLMQ. In this work, peptide libraries were initially used to systematically vary the residues in those two parental sequences using an assay based on Matrix Assisted Laser Desorption Ionization–Mass Spectrometry (MALDI-MS). In addition, 192 pentapeptide sequences from the human proteome were screened using that assay to discover additional extended CaaaX-box motifs. Selected hits from that screening effort were rescreened using an in vivo yeast reporter protein assay. The X-ray crystal structure of CMIIM bound to FTase was also solved, showing that the C-terminal tripeptide of that sequence interacted with the enzyme in a similar manner as the C-terminal tripeptide of CVVM, suggesting that the tripeptide comprises a common structural element for substrate recognition in both tetrapeptide and pentapeptide sequences. Molecular dynamics simulation of CMIIM bound to FTase further shed light on the molecular interactions involved, showing that a putative catalytically competent Zn(II)-thiolate species was able to form. Bioinformatic predictions of tetrapeptide (CaaX-box) reactivity correlated well with the reactivity of pentapeptides obtained from in vivo analysis, reinforcing the importance of the C-terminal tripeptide motif. This analysis provides a structural framework for understanding the reactivity of extended CaaaX-box motifs and a method that may be useful for predicting the reactivity of additional FTase substrates bearing CaaaX-box sequences.more » « less
-
null (Ed.)Assembling peptides allow the creation of structurally complex materials, where amino acid selection influences resulting properties. We present a synergistic approach of experiments and simulations for examining the influence of natural and non-natural amino acid substitutions via incorporation of charged residues and a reactive handle on the thermal stability and assembly of multifunctional collagen mimetic peptides (CMPs). Experimentally, we observed inclusion of charged residues significantly decreased the melting temperature of CMP triple helices with further destabilization upon inclusion of the reactive handle. Atomistic simulations of a single CMP triple helix in explicit water showed increased residue-level and helical structural fluctuations caused by the inclusion of the reactive handle; however, these atomistic simulations cannot be used to predict changes in CMP melting transition. Coarse-grained (CG) simulations of CMPs at experimentally relevant solution conditions, showed, qualitatively, the same trends as experiments in CMP melting transition temperature with CMP design. These simulations show that when charged residues are included electrostatic repulsions significantly destabilize the CMP triple helix and that an additional inclusion of a reactive handle does not significantly change the melting transition. Based on findings from both experiments and simulations, the sequence design was refined for increased CMP triple helix thermal stability, and the reactive handle was utilized for the incorporation of the assembled CMPs within covalently crosslinked hydrogels. Overall, a unique approach was established for predicting stability of CMP triple helices for various sequences prior to synthesis, providing molecular insights for sequence design towards the creation of bulk nanostructured soft biomaterials.more » « less
-
Abstract Cis-peptide bonds are rare in proteins, and building blocks less favorable to the trans-conformer have been considered destabilizing. Although proline tolerates the cis-conformer modestly among all amino acids, for collagen, the most prevalent proline-abundant protein, all peptide bonds must be trans to form its hallmark triple-helix structure. Here, using host-guest collagen mimetic peptides (CMPs), we discover that surprisingly, even the cis-enforcing peptoid residues (N-substituted glycines) form stable triple-helices. Our interrogations establish that these peptoid residues entropically stabilize the triple-helix by pre-organizing individual peptides into a polyproline-II helix. Moreover, noting that the cis-demanding peptoid residues drastically reduce the folding rate, we design a CMP whose triple-helix formation can be controlled by peptoid cis-trans isomerization, enabling direct targeting of fibrotic remodeling in myocardial infarction in vivo. These findings elucidate the principles of peptoid cis-trans isomerization in protein folding and showcase the exploitation of cis-amide-favoring residues in building programmable and functional peptidomimetics.more » « less
-
Flory’s random coil model assumes that conformational fluctuations of amino acid residues in unfolded poly(oligo)peptides and proteins are uncorrelated (isolated pair hypothesis, IPH). This implies that conformational energies, entropies and solvation free energies are all additive. Nearly 25 years ago, analyses of coil libraries cast some doubt on this notion, in that they revealed that aromatic, but also β-branched side chains, could change the 3J(HNHCα) coupling of their neighbors. Since then, multiple bioinformatical, computational and experimental studies have revealed that conformational propensities of amino acids in unfolded peptides and proteins depend on their nearest neighbors. We used recently reported and newly obtained Ramachandran plots of tetra- and pentapeptides with non-terminal homo- and heterosequences of amino acid residues to quantitatively determine nearest neighbor coupling between them with a Ising type model. Results reveal that, depending on the choice of amino acid residue pairs, nearest neighbor interactions either stabilize or destabilize pairs of polyproline II and β-strand conformations. This leads to a redistribution of population between these conformations and a reduction in conformational entropy. Interactions between residues in polyproline II and turn(helix)-forming conformations seem to be cooperative in most cases, but the respective interaction parameters are subject to large statistical errors.more » « less