The design of completely synthetic proteins from first principles— de novo protein design—is challenging. This is because, despite recent advances in computational protein–structure prediction and design, we do not understand fully the sequence-to-structure relationships for protein folding, assembly, and stabilization. Antiparallel 4-helix bundles are amongst the most studied scaffolds for de novo protein design. We set out to re-examine this target, and to determine clear sequence-to-structure relationships, or design rules, for the structure. Our aim was to determine a common and robust sequence background for designing multiple de novo 4-helix bundles. In turn, this could be used in chemical and synthetic biology to direct protein–protein interactions and as scaffolds for functional protein design. Our approach starts by analyzing known antiparallel 4-helix coiled-coil structures to deduce design rules. In terms of the heptad repeat, abcdefg — i.e. , the sequence signature of many helical bundles—the key features that we identify are: a = Leu, d = Ile, e = Ala, g = Gln, and the use of complementary charged residues at b and c. Next, we implement these rules in the rational design of synthetic peptides to form antiparallel homo- and heterotetramers. Finally, we use the sequence of the homotetramer to derive in one step a single-chain 4-helix-bundle protein for recombinant production in E. coli . All of the assembled designs are confirmed in aqueous solution using biophysical methods, and ultimately by determining high-resolution X-ray crystal structures. Our route from peptides to proteins provides an understanding of the role of each residue in each design.
more »
« less
Rationally seeded computational protein design of ɑ-helical barrels
Abstract Computational protein design is advancing rapidly. Here we describe efficient routes starting from validated parallel and antiparallel peptide assemblies to design two families of α-helical barrel proteins with central channels that bind small molecules. Computational designs are seeded by the sequences and structures of defined de novo oligomeric barrel-forming peptides, and adjacent helices are connected by loop building. For targets with antiparallel helices, short loops are sufficient. However, targets with parallel helices require longer connectors; namely, an outer layer of helix–turn–helix–turn–helix motifs that are packed onto the barrels. Throughout these computational pipelines, residues that define open states of the barrels are maintained. This minimizes sequence sampling, accelerating the design process. For each of six targets, just two to six synthetic genes are made for expression inEscherichia coli. On average, 70% of these genes express to give soluble monomeric proteins that are fully characterized, including high-resolution structures for most targets that match the design models with high accuracy.
more »
« less
- Award ID(s):
- 2019598
- PAR ID:
- 10621826
- Publisher / Repository:
- Nature Chemical Biology
- Date Published:
- Journal Name:
- Nature Chemical Biology
- Volume:
- 20
- Issue:
- 8
- ISSN:
- 1552-4450
- Page Range / eLocation ID:
- 991 to 999
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Peptide-based helical barrels are a noteworthy building block for hierarchical assembly, with a hydrophobic cavity that can serve as a host for cargo. In this study, disulfide-stapled helical barrels were synthesized containing ligands for metal ions on the hydrophilic face of each amphiphilic peptide helix. The major product of the disulfide-stapling reaction was found to be composed of five amphiphilic peptides, thereby going from a 16-amino-acid peptide to a stapled 80-residue protein in one step. The structure of this pentamer, 5HB1, was optimized in silico, indicating a significant hydrophobic cavity of ~6 Å within a helical barrel. Metal-ion-promoted assembly of the helical barrel building blocks generated higher order assemblies with a three-dimensional (3D) matrix morphology. The matrix was decorated with hydrophobic dyes and His-tagged proteins both before and after assembly, taking advantage of the hydrophobic pocket within the helical barrels and coordination sites within the metal ion-peptide framework. As such, this peptide-based biomaterial has potential for a number of biotechnology applications, including supplying small molecule and protein growth factors during cell and tissue growth within the matrix.more » « less
-
Abstract Structures at serine‐proline sites in proteins were analyzed using a combination of peptide synthesis with structural methods and bioinformatics analysis of the PDB. Dipeptides were synthesized with the proline derivative (2S,4S)‐(4‐iodophenyl)hydroxyproline [hyp(4‐I‐Ph)]. The crystal structure of Boc‐Ser‐hyp(4‐I‐Ph)‐OMe had two molecules in the unit cell. One molecule exhibitedcis‐proline and a type VIa2 β‐turn (BcisD). Thecis‐proline conformation was stabilized by a C–H/O interaction between Pro C–Hαand the Ser side‐chain oxygen. NMR data were consistent with stabilization ofcis‐proline by a C–H/O interaction in solution. The other crystallographically observed molecule hadtrans‐Pro and both residues in the PPII conformation. Two conformations were observed in the crystal structure of Ac‐Ser‐hyp(4‐I‐Ph)‐OMe, with Ser adopting PPII in one and the β conformation in the other, each with Pro in the δ conformation andtrans‐Pro. Structures at Ser‐Pro sequences were further examined via bioinformatics analysis of the PDB and via DFT calculations. Ser‐Pro versus Ala–Pro sequences were compared to identify bases for Ser stabilization of local structures. C–H/O interactions between the Ser side‐chain Oγand Pro C–Hαwere observed in 45% of structures with Ser‐cis‐Pro in the PDB, with nearly all Ser‐cis‐Pro structures adopting a type VI β‐turn. 53% of Ser‐trans‐Pro sequences exhibited main‐chain COi•••HNi+3or COi•••HNi+4hydrogen bonds, with Ser as theiresidue and Pro as thei + 1 residue. These structures were overwhelmingly either type I β‐turns or N‐terminal capping motifs on α‐helices or 310‐helices. These results indicate that Ser‐Pro sequences are particularly potent in favoring these structures. In each, Ser is in either the PPII or β conformation, with the Ser Oγcapable of engaging in a hydrogen bond with the amide N–H of thei + 2 (type I β‐turn or 310‐helix; Serχ1t) ori + 3 (α‐helix; Serχ1g+) residue. Non‐prolinecisamide bonds can also be stabilized by C–H/O interactions.more » « less
-
a) Abstract DnaK is a prokaryotic Hsp70 chaperone, with numerous functions in helping to fold nascent polypeptides and more generally in proteostasis. It also restores native structures to heat-shocked proteins in an ATP-hydrolysis-dependent manner. The structures of DnaK complexes with nucleotides, co-chaperones andsmallpeptides have already been resolved. However, there are no structures of DnaK complexes with larger, mostly folded substrates, such as firefly luciferase (Fluc, 61 kDa), which impedes the understanding of the mechanism through which DnaK refolds such large proteins. Here, we generated a model of a DnaK-firefly luciferase complex with Alphafold3, and examined its dynamics with all-atom molecular dynamics simulations. In this complex, Fluc is immobilized under the DnaK alpha-helical lid against the NBD, not the SBDβ, contrary to the data reported in the literature for model peptides. The DnaK lid is positioned strategically over Fluc’s helix 405-411, which we recently determined to be the first (and likely the only) helix melted in Fluc at 42 °C. We simulated the interaction between DnaK and the helix in its native and misfolded state and found that during the lid translocation toward the SBDβ, only the melted helix follows the lid and is actively pulled out from Fluc, while the native helix is not dislocated. These observations suggest a new model for the DnaK chaperone mechanism, where the alpha helical lid forms hydrogen bonds to the protein segment to be structurally tested. Lid pulls out only highly deformable misfolded helices, allowing them to refold into their native structures, and does not pull out those that are correctly folded because they are not deformable. Broader Audience Statementc) DnaK is a model chaperone, which can reactivate thermally denatured proteins. Even though a plethora of significant findings about DnaK structure, dynamics and interactions with its co-chaperone have been accumulated over 30 years, the exact molecular mechanism by which DnaK refolds misfolded proteins remains a mystery. This work exploited the ability of the Alphafold3 platform to generate an atomistic model for a complex between DnaK and Firefly luciferase and used molecular dynamics simulations to directly capture how DnaK may assist denatured proteins by mechanically pulling out their misfolded helices. This study provides a new insight into the DnaK mechanism.more » « less
-
Trent, M Stephen; Konovalova, Anna (Ed.)ABSTRACT Almost all integral membrane proteins that reside in the outer membrane (OM) of gram-negative bacteria contain a closed amphipathic β sheet (“β barrel”) that serves as a membrane anchor. The membrane integration of β barrel structures is catalyzed by a highly conserved heterooligomer called thebarrelassemblymachine (BAM). Although charged residues that are exposed to the lipid bilayer are infrequently found in outer membrane protein β barrels, the β barrels of OmpC/OmpF-type trimeric porins produced by Enterobacterales contain multiple conserved lipid-facing basic residues located near the extracellular side of the OM. Here, we show that these residues are required for the efficient insertion of theEscherichia coliOmpC protein into the OMin vivo. We found that the mutation of multiple basic residues to glutamine or alanine slowed insertion and reduced insertion efficiency. Furthermore, molecular dynamics simulations provided evidence that the basic residues promote the formation of hydrogen bonds and salt bridges with lipopolysaccharide, a unique glycolipid located exclusively in the outer leaflet of the OM. Taken together, our results support a model in which hydrophilic interactions between OmpC and LPS help to anchor the protein in the OM when the local environment is perturbed by BAM during membrane insertion and suggest a surprising role for membrane lipids in the insertion reaction.IMPORTANCEThe assembly (folding and membrane insertion) of bacterial outer membrane proteins (OMPs) is an essential cellular process that is a potential target for novel antibiotics. A heterooligomer called thebarrelassemblymachine (BAM) plays a major role in catalyzing OMP assembly. Here, we show that a group of highly conserved lipid-facing basic residues inEscherichia coliOmpC, a member of a major family of abundant OMPs known as trimeric porins, is required for the efficient integration of the protein into the outer membrane (OM). Based on our work and previous studies, we propose that the basic residues form interactions with a unique OM lipid (lipopolysaccharide) that promotes the insertion reaction. Our results provide strong evidence that interactions between specific membrane lipids and at least a subset of OMPs are required to supplement the activity of BAM and facilitate the integration of the proteins into the membrane.more » « less
An official website of the United States government

