The design of completely synthetic proteins from first principles— de novo protein design—is challenging. This is because, despite recent advances in computational protein–structure prediction and design, we do not understand fully the sequence-to-structure relationships for protein folding, assembly, and stabilization. Antiparallel 4-helix bundles are amongst the most studied scaffolds for de novo protein design. We set out to re-examine this target, and to determine clear sequence-to-structure relationships, or design rules, for the structure. Our aim was to determine a common and robust sequence background for designing multiple de novo 4-helix bundles. In turn, this could be used in chemical and synthetic biology to direct protein–protein interactions and as scaffolds for functional protein design. Our approach starts by analyzing known antiparallel 4-helix coiled-coil structures to deduce design rules. In terms of the heptad repeat, abcdefg — i.e. , the sequence signature of many helical bundles—the key features that we identify are: a = Leu, d = Ile, e = Ala, g = Gln, and the use of complementary charged residues at b and c. Next, we implement these rules in the rational design of synthetic peptides to form antiparallel homo- and heterotetramers. Finally, we use the sequence of the homotetramer to derive in one step a single-chain 4-helix-bundle protein for recombinant production in E. coli . All of the assembled designs are confirmed in aqueous solution using biophysical methods, and ultimately by determining high-resolution X-ray crystal structures. Our route from peptides to proteins provides an understanding of the role of each residue in each design.
more »
« less
Rationally seeded computational protein design of ɑ-helical barrels
Abstract Computational protein design is advancing rapidly. Here we describe efficient routes starting from validated parallel and antiparallel peptide assemblies to design two families of α-helical barrel proteins with central channels that bind small molecules. Computational designs are seeded by the sequences and structures of defined de novo oligomeric barrel-forming peptides, and adjacent helices are connected by loop building. For targets with antiparallel helices, short loops are sufficient. However, targets with parallel helices require longer connectors; namely, an outer layer of helix–turn–helix–turn–helix motifs that are packed onto the barrels. Throughout these computational pipelines, residues that define open states of the barrels are maintained. This minimizes sequence sampling, accelerating the design process. For each of six targets, just two to six synthetic genes are made for expression inEscherichia coli. On average, 70% of these genes express to give soluble monomeric proteins that are fully characterized, including high-resolution structures for most targets that match the design models with high accuracy.
more »
« less
- Award ID(s):
- 2019598
- PAR ID:
- 10621826
- Publisher / Repository:
- Nature Chemical Biology
- Date Published:
- Journal Name:
- Nature Chemical Biology
- Volume:
- 20
- Issue:
- 8
- ISSN:
- 1552-4450
- Page Range / eLocation ID:
- 991 to 999
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Peptide-based helical barrels are a noteworthy building block for hierarchical assembly, with a hydrophobic cavity that can serve as a host for cargo. In this study, disulfide-stapled helical barrels were synthesized containing ligands for metal ions on the hydrophilic face of each amphiphilic peptide helix. The major product of the disulfide-stapling reaction was found to be composed of five amphiphilic peptides, thereby going from a 16-amino-acid peptide to a stapled 80-residue protein in one step. The structure of this pentamer, 5HB1, was optimized in silico, indicating a significant hydrophobic cavity of ~6 Å within a helical barrel. Metal-ion-promoted assembly of the helical barrel building blocks generated higher order assemblies with a three-dimensional (3D) matrix morphology. The matrix was decorated with hydrophobic dyes and His-tagged proteins both before and after assembly, taking advantage of the hydrophobic pocket within the helical barrels and coordination sites within the metal ion-peptide framework. As such, this peptide-based biomaterial has potential for a number of biotechnology applications, including supplying small molecule and protein growth factors during cell and tissue growth within the matrix.more » « less
-
Abstract Structures at serine‐proline sites in proteins were analyzed using a combination of peptide synthesis with structural methods and bioinformatics analysis of the PDB. Dipeptides were synthesized with the proline derivative (2S,4S)‐(4‐iodophenyl)hydroxyproline [hyp(4‐I‐Ph)]. The crystal structure of Boc‐Ser‐hyp(4‐I‐Ph)‐OMe had two molecules in the unit cell. One molecule exhibitedcis‐proline and a type VIa2 β‐turn (BcisD). Thecis‐proline conformation was stabilized by a C–H/O interaction between Pro C–Hαand the Ser side‐chain oxygen. NMR data were consistent with stabilization ofcis‐proline by a C–H/O interaction in solution. The other crystallographically observed molecule hadtrans‐Pro and both residues in the PPII conformation. Two conformations were observed in the crystal structure of Ac‐Ser‐hyp(4‐I‐Ph)‐OMe, with Ser adopting PPII in one and the β conformation in the other, each with Pro in the δ conformation andtrans‐Pro. Structures at Ser‐Pro sequences were further examined via bioinformatics analysis of the PDB and via DFT calculations. Ser‐Pro versus Ala–Pro sequences were compared to identify bases for Ser stabilization of local structures. C–H/O interactions between the Ser side‐chain Oγand Pro C–Hαwere observed in 45% of structures with Ser‐cis‐Pro in the PDB, with nearly all Ser‐cis‐Pro structures adopting a type VI β‐turn. 53% of Ser‐trans‐Pro sequences exhibited main‐chain COi•••HNi+3or COi•••HNi+4hydrogen bonds, with Ser as theiresidue and Pro as thei + 1 residue. These structures were overwhelmingly either type I β‐turns or N‐terminal capping motifs on α‐helices or 310‐helices. These results indicate that Ser‐Pro sequences are particularly potent in favoring these structures. In each, Ser is in either the PPII or β conformation, with the Ser Oγcapable of engaging in a hydrogen bond with the amide N–H of thei + 2 (type I β‐turn or 310‐helix; Serχ1t) ori + 3 (α‐helix; Serχ1g+) residue. Non‐prolinecisamide bonds can also be stabilized by C–H/O interactions.more » « less
-
The extreme 5′-end of the enterovirus RNA genome contains a conserved cloverleaf-like domain that recruits 3CD and PCBP proteins required for initiating genome replication. Here, we report the crystal structure at 1.9 Å resolution of this domain from the CVB3 genome in complex with an antibody chaperone. The RNA folds into an antiparallel H-type four-way junction comprising four subdomains with co-axially stacked sA-sD and sB-sC helices. Long-range interactions between a conserved A40 in the sC-loop and Py-Py helix within the sD subdomain organize near-parallel orientations of the sA-sB and sC-sD helices. Our NMR studies confirm that these long-range interactions occur in solution and without the chaperone. The phylogenetic analyses indicate that our crystal structure represents a conserved architecture of enteroviral cloverleaf-like domains, including the A40 and Py-Py interactions. The protein binding studies further suggest that the H-shape architecture provides a ready-made platform to recruit 3CD and PCBP2 for viral replication.more » « less
-
a) Abstract DnaK is a prokaryotic Hsp70 chaperone, with numerous functions in helping to fold nascent polypeptides and more generally in proteostasis. It also restores native structures to heat-shocked proteins in an ATP-hydrolysis-dependent manner. The structures of DnaK complexes with nucleotides, co-chaperones andsmallpeptides have already been resolved. However, there are no structures of DnaK complexes with larger, mostly folded substrates, such as firefly luciferase (Fluc, 61 kDa), which impedes the understanding of the mechanism through which DnaK refolds such large proteins. Here, we generated a model of a DnaK-firefly luciferase complex with Alphafold3, and examined its dynamics with all-atom molecular dynamics simulations. In this complex, Fluc is immobilized under the DnaK alpha-helical lid against the NBD, not the SBDβ, contrary to the data reported in the literature for model peptides. The DnaK lid is positioned strategically over Fluc’s helix 405-411, which we recently determined to be the first (and likely the only) helix melted in Fluc at 42 °C. We simulated the interaction between DnaK and the helix in its native and misfolded state and found that during the lid translocation toward the SBDβ, only the melted helix follows the lid and is actively pulled out from Fluc, while the native helix is not dislocated. These observations suggest a new model for the DnaK chaperone mechanism, where the alpha helical lid forms hydrogen bonds to the protein segment to be structurally tested. Lid pulls out only highly deformable misfolded helices, allowing them to refold into their native structures, and does not pull out those that are correctly folded because they are not deformable. Broader Audience Statementc) DnaK is a model chaperone, which can reactivate thermally denatured proteins. Even though a plethora of significant findings about DnaK structure, dynamics and interactions with its co-chaperone have been accumulated over 30 years, the exact molecular mechanism by which DnaK refolds misfolded proteins remains a mystery. This work exploited the ability of the Alphafold3 platform to generate an atomistic model for a complex between DnaK and Firefly luciferase and used molecular dynamics simulations to directly capture how DnaK may assist denatured proteins by mechanically pulling out their misfolded helices. This study provides a new insight into the DnaK mechanism.more » « less
An official website of the United States government

