skip to main content


Title: A hybrid coarse-grained model for structure, solvation and assembly of lipid-like peptides
Reconstituted photosynthetic proteins which are activated upon exposure to solar energy hold enormous potential for powering future solid state devices and solar cells. The functionality and integration of these proteins into such devices has been successfully enabled by lipid-like peptides. Yet, a fundamental understanding of the organization of these peptides with respect to the photosynthetic proteins and themselves remains unknown and is critical for guiding the design of such light-activated devices. This study investigates the relative organization of one such peptide sequence V 6 K 2 (V: valine and K: lysine) within assemblies. Given the expansive spatiotemporal scales associated with this study, a hybrid coarse-grained (CG) model which captures the structure, conformation and aggregation of the peptide is adopted. The CG model uses a combination of iterative Boltzmann inversion and force matching to provide insight into the relative organization of V 6 K 2 in assemblies. The CG model reproduces the structure of a V 6 K 2 peptide sequence along with its all atom (AA) solvation structure. The relative organization of multiple peptides in an assembly, as captured by CG simulations, is in agreement with corresponding results from AA simulations. Also, a backmapping procedure reintroduces the AA details of the peptides within the aggregates captured by the CG model to demonstrate the relative organization of the peptides. Furthermore, a large number of peptides self-assemble into an elongated micelle in the CG simulation, which is consistent with experimental findings. The coarse-graining procedure is tested for transferability to longer peptide sequences, and hence can be extended to other amphiphilic peptide sequences.  more » « less
Award ID(s):
1654325
NSF-PAR ID:
10317950
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Physical Chemistry Chemical Physics
Volume:
24
Issue:
3
ISSN:
1463-9076
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. This data set for the manuscript entitled "Design of Peptides that Fold and Self-Assemble on Graphite" includes all files needed to run and analyze the simulations described in the this manuscript in the molecular dynamics software NAMD, as well as the output of the simulations. The files are organized into directories corresponding to the figures of the main text and supporting information. They include molecular model structure files (NAMD psf or Amber prmtop format), force field parameter files (in CHARMM format), initial atomic coordinates (pdb format), NAMD configuration files, Colvars configuration files, NAMD log files, and NAMD output including restart files (in binary NAMD format) and trajectories in dcd format (downsampled to 10 ns per frame). Analysis is controlled by shell scripts (Bash-compatible) that call VMD Tcl scripts or python scripts. These scripts and their output are also included.

    Version: 2.0

    Changes versus version 1.0 are the addition of the free energy of folding, adsorption, and pairing calculations (Sim_Figure-7) and shifting of the figure numbers to accommodate this addition.


    Conventions Used in These Files
    ===============================

    Structure Files
    ----------------
    - graph_*.psf or sol_*.psf (original NAMD (XPLOR?) format psf file including atom details (type, charge, mass), as well as definitions of bonds, angles, dihedrals, and impropers for each dipeptide.)

    - graph_*.pdb or sol_*.pdb (initial coordinates before equilibration)
    - repart_*.psf (same as the above psf files, but the masses of non-water hydrogen atoms have been repartitioned by VMD script repartitionMass.tcl)
    - freeTop_*.pdb (same as the above pdb files, but the carbons of the lower graphene layer have been placed at a single z value and marked for restraints in NAMD)
    - amber_*.prmtop (combined topology and parameter files for Amber force field simulations)
    - repart_amber_*.prmtop (same as the above prmtop files, but the masses of non-water hydrogen atoms have been repartitioned by ParmEd)

    Force Field Parameters
    ----------------------
    CHARMM format parameter files:
    - par_all36m_prot.prm (CHARMM36m FF for proteins)
    - par_all36_cgenff_no_nbfix.prm (CGenFF v4.4 for graphene) The NBFIX parameters are commented out since they are only needed for aromatic halogens and we use only the CG2R61 type for graphene.
    - toppar_water_ions_prot_cgenff.str (CHARMM water and ions with NBFIX parameters needed for protein and CGenFF included and others commented out)

    Template NAMD Configuration Files
    ---------------------------------
    These contain the most commonly used simulation parameters. They are called by the other NAMD configuration files (which are in the namd/ subdirectory):
    - template_min.namd (minimization)
    - template_eq.namd (NPT equilibration with lower graphene fixed)
    - template_abf.namd (for adaptive biasing force)

    Minimization
    -------------
    - namd/min_*.0.namd

    Equilibration
    -------------
    - namd/eq_*.0.namd

    Adaptive biasing force calculations
    -----------------------------------
    - namd/eabfZRest7_graph_chp1404.0.namd
    - namd/eabfZRest7_graph_chp1404.1.namd (continuation of eabfZRest7_graph_chp1404.0.namd)

    Log Files
    ---------
    For each NAMD configuration file given in the last two sections, there is a log file with the same prefix, which gives the text output of NAMD. For instance, the output of namd/eabfZRest7_graph_chp1404.0.namd is eabfZRest7_graph_chp1404.0.log.

    Simulation Output
    -----------------
    The simulation output files (which match the names of the NAMD configuration files) are in the output/ directory. Files with the extensions .coor, .vel, and .xsc are coordinates in NAMD binary format, velocities in NAMD binary format, and extended system information (including cell size) in text format. Files with the extension .dcd give the trajectory of the atomic coorinates over time (and also include system cell information). Due to storage limitations, large DCD files have been omitted or replaced with new DCD files having the prefix stride50_ including only every 50 frames. The time between frames in these files is 50 * 50000 steps/frame * 4 fs/step = 10 ns. The system cell trajectory is also included for the NPT runs are output/eq_*.xst.

    Scripts
    -------
    Files with the .sh extension can be found throughout. These usually provide the highest level control for submission of simulations and analysis. Look to these as a guide to what is happening. If there are scripts with step1_*.sh and step2_*.sh, they are intended to be run in order, with step1_*.sh first.


    CONTENTS
    ========

    The directory contents are as follows. The directories Sim_Figure-1 and Sim_Figure-8 include README.txt files that describe the files and naming conventions used throughout this data set.

    Sim_Figure-1: Simulations of N-acetylated C-amidated amino acids (Ac-X-NHMe) at the graphite–water interface.

    Sim_Figure-2: Simulations of different peptide designs (including acyclic, disulfide cyclized, and N-to-C cyclized) at the graphite–water interface.

    Sim_Figure-3: MM-GBSA calculations of different peptide sequences for a folded conformation and 5 misfolded/unfolded conformations.

    Sim_Figure-4: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-5: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 295 K.

    Sim_Figure-5_replica: Temperature replica exchange molecular dynamics simulations for the peptide cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) with 20 replicas for temperatures from 295 to 454 K.

    Sim_Figure-6: Simulation of the peptide molecule cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) in free solution (no graphite).

    Sim_Figure-7: Free energy calculations for folding, adsorption, and pairing for the peptide CHP1404 (sequence: cyc(GTGSGTG-GPGG-GCGTGTG-SGPG)). For folding, we calculate the PMF as function of RMSD by replica-exchange umbrella sampling (in the subdirectory Folding_CHP1404_Graphene/). We make the same calculation in solution, which required 3 seperate replica-exchange umbrella sampling calculations (in the subdirectory Folding_CHP1404_Solution/). Both PMF of RMSD calculations for the scrambled peptide are in Folding_scram1404/. For adsorption, calculation of the PMF for the orientational restraints and the calculation of the PMF along z (the distance between the graphene sheet and the center of mass of the peptide) are in Adsorption_CHP1404/ and Adsorption_scram1404/. The actual calculation of the free energy is done by a shell script ("doRestraintEnergyError.sh") in the 1_free_energy/ subsubdirectory. Processing of the PMFs must be done first in the 0_pmf/ subsubdirectory. Finally, files for free energy calculations of pair formation for CHP1404 are found in the Pair/ subdirectory.

    Sim_Figure-8: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) where the peptides are far above the graphene–water interface in the initial configuration.

    Sim_Figure-9: Two replicates of a simulation of nine peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-9_scrambled: Two replicates of a simulation of nine peptide molecules with the control sequence cyc(GGTPTTGGGGGGSGGPSGTGGC) at the graphite–water interface at 370 K.

    Sim_Figure-10: Adaptive biasing for calculation of the free energy of the folded peptide as a function of the angle between its long axis and the zigzag directions of the underlying graphene sheet.

     

    This material is based upon work supported by the US National Science Foundation under grant no. DMR-1945589. A majority of the computing for this project was performed on the Beocat Research Cluster at Kansas State University, which is funded in part by NSF grants CHE-1726332, CNS-1006860, EPS-1006860, and EPS-0919443. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562, through allocation BIO200030. 
    more » « less
  2. Aggregation of misfolded oligomeric amyloid-beta (Aβ) peptides on lipid membranes has been identified as a primary event in Alzheimer's pathogenesis. However, the structural and dynamical features of this membrane assisted Aβ aggregation have not been well characterized. The microscopic characterization of dynamic molecular-level interactions in peptide aggregation pathways has been challenging both computationally and experimentally. In this work, we explore differential patterns of membrane-induced Aβ 16–22 (K–L–V–F–F–A–E) aggregation from the microscopic perspective of molecular interactions. Physics-based coarse-grained molecular dynamics (CG-MD) simulations were employed to investigate the effect of lipid headgroup charge – zwitterionic (1-palmitoyl-2-oleoyl- sn-glycero -3-phosphocholine: POPC) and anionic (1-palmitoyl-2-oleoyl- sn-glycero -3-phospho- l -serine: POPS) – on Aβ 16–22 peptide aggregation. Our analyses present an extensive overview of multiple pathways for peptide absorption and biomechanical forces governing peptide folding and aggregation. In agreement with experimental observations, anionic POPS molecules promote extended configurations in Aβ peptides that contribute towards faster emergence of ordered β-sheet-rich peptide assemblies compared to POPC, suggesting faster fibrillation. In addition, lower cumulative rates of peptide aggregation in POPS due to higher peptide–lipid interactions and slower lipid diffusion result in multiple distinct ordered peptide aggregates that can serve as nucleation seeds for subsequent Aβ aggregation. This study provides an in-silico assessment of experimentally observed aggregation patterns, presents new morphological insights and highlights the importance of lipid headgroup chemistry in modulating the peptide absorption and aggregation process. 
    more » « less
  3. Protein mimics such as peptoids form self-assembled nanostructures whose shape and function are governed by the side chain chemistry and secondary structure. Experiments have shown that a peptoid sequence with a helical secondary structure assembles into microspheres that are stable under various conditions. The conformation and organization of the peptoids within the assemblies remains unknown and is elucidated in this study via a hybrid, bottom-up coarse-graining approach. The resultant coarse-grained (CG) model preserves the chemical and structural details that are critical for capturing the secondary structure of the peptoid. The CG model accurately captures the overall conformation and solvation of the peptoids in an aqueous solution. Furthermore, the model resolves the assembly of multiple peptoids into a hemispherical aggregate that is in qualitative agreement with the corresponding results from experiments. The mildly hydrophilic peptoid residues are placed along the curved interface of the aggregate. The composition of the residues on the exterior of the aggregate is determined by two conformations adopted by the peptoid chains. Hence, the CG model simultaneously captures sequence-specific features and the assembly of a large number of peptoids. This multiscale, multiresolution coarse-graining approach could help in predicting the organization and packing of other tunable oligomeric sequences of relevance to biomedicine and electronics. 
    more » « less
  4. null (Ed.)
    Injectable hydrogels are attractive for therapeutic delivery because they can be locally administered through minimally-invasive routes. Charge-complementary peptide nanofibers provide hydrogels that are suitable for encapsulation of biotherapeutics, such as cells and proteins, because they assemble under physiological temperature, pH, and ionic strength. However, relationships between the sequences of charge-complementary peptides and the physical properties of the hydrogels that they form are not well understood. Here we show that hydrogel viscoelasticity, pore size, and pore structure depend on the pairing of charge-complementary “CATCH(+/−)” peptides. Oscillatory rheology demonstrated that co-assemblies of CATCH(4+/4−), CATCH(4+/6−), CATCH(6+/4−), and CATCH(6+/6−) formed viscoelastic gels that can recover after high-shear and high-strain disruption, although the extent of recovery depends on the peptide pairing. Cryogenic scanning electron microscopy demonstrated that hydrogel pore size and pore wall also depend on peptide pairing, and that these properties change to different extents after injection. In contrast, no obvious correlation was observed between nanofiber charge state, measured with ζ-potential, and hydrogel physical properties. CATCH(4+/6−) hydrogels injected into the subcutaneous space elicited weak, transient inflammation whereas CATCH(6+/4−) hydrogels induced stronger inflammation. No antibodies were raised against the CATCH(4+) or CATCH(6−) peptides following multiple challenges in vehicle or when co-administered with an adjuvant. These results demonstrate that CATCH(+/−) peptides form biocompatible injectable hydrogels with viscoelastic properties that can be tuned by varying peptide sequence, establishing their potential as carriers for localized delivery of therapeutic cargoes. 
    more » « less
  5. Biophysical interactions between proteins and peptides are key determinants of molecular recognition specificity landscapes. However, an understanding of how molecular structure and residue-level energetics at protein−peptide interfaces shape these landscapes remains elusive. We combine information from yeast-based library screening, next-generation sequencing, and structure-based modeling in a supervised machine learning approach to report the comprehensive sequence−energetics−function mapping of the specificity landscape of the hepatitis C virus (HCV) NS3/4A protease, whose function—site-specific cleavages of the viral polyprotein—is a key determinant of viral fitness. We screened a library of substrates in which five residue positions were randomized and measured cleavability of ∼30,000 substrates (∼1% of the library) using yeast display and fluorescence-activated cell sorting followed by deep sequencing. Structure-based models of a subset of experimentally derived sequences were used in a supervised learning procedure to train a support vector machine to predict the cleavability of 3.2 million substrate variants by the HCV protease. The resulting landscape allows identification of previously unidentified HCV protease substrates, and graph-theoretic analyses reveal extensive clustering of cleavable and uncleavable motifs in sequence space. Specificity landscapes of known drug-resistant variants are similarly clustered. The described approach should enable the elucidation and redesign of specificity landscapes of a wide variety of proteases, including human-origin enzymes. Our results also suggest a possible role for residue-level energetics in shaping plateau-like functional landscapes predicted from viral quasispecies theory.

     
    more » « less