skip to main content


Title: Improved Modeling of Cation‐π and Anion‐Ring Interactions Using the Drude Polarizable Empirical Force Field for Proteins

Cation‐π interactions are noncovalent interactions between a π‐electron system and a positively charged ion that are regarded as a strong noncovalent interaction and are ubiquitous in biological systems. Similarly, though less studied, anion‐ring interactions are present in proteins along with in‐plane interactions of anions with aromatic rings. As these interactions are between a polarizing ion and a polarizable π system, the accuracy of the treatment of these interactions in molecular dynamics (MD) simulations using additive force fields (FFs) may be limited. In the present work, to allow for a better description of ion‐π interactions in proteins in the Drude‐2013 protein polarizable FF, we systematically optimized the parameters for these interactions targeting model compound quantum mechanical (QM) interaction energies with atom pair‐specific Lennard‐Jones parameters along with virtual particles as selected ring centroids introduced to target the QM interaction energies and geometries. Subsequently, MD simulations were performed on a series of protein structures where ion‐π pairs occur to evaluate the optimized parameters in the context of the Drude‐2013 FF. The resulting FF leads to a significant improvement in reproducing the ion‐π pair distances observed in experimental protein structures, as well as a smaller root‐mean‐square differences and fluctuations of the overall protein structures from experimental structures. Accordingly, the optimized Drude‐2013 protein polarizable FF is suggested for use in MD simulations of proteins where cation‐π and anion‐ring interactions are critical. © 2019 Wiley Periodicals, Inc.

 
more » « less
NSF-PAR ID:
10458962
Author(s) / Creator(s):
 ;  
Publisher / Repository:
Wiley Blackwell (John Wiley & Sons)
Date Published:
Journal Name:
Journal of Computational Chemistry
Volume:
41
Issue:
5
ISSN:
0192-8651
Page Range / eLocation ID:
p. 439-448
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Experimentally conducted reactions between CO 2 and various substrates ( i.e. , ethylenediamine (EDA), ethanolamine (ETA), ethylene glycol (EG), mercaptoethanol (ME), and ethylene dithiol (EDT)) are considered in a computational study. The reactions were previously conducted under harsh conditions utilizing toxic metal catalysts. We computationally utilize Brønsted acidic ionic liquid (IL) [Et 2 NH 2 ]HSO 4 as a catalyst aiming to investigate and propose ‘greener’ pathways for future experimental studies. Computations show that EDA is the best to fixate CO 2 among the tested substrates: the nucleophilic EDA attack on CO 2 is calculated to have a very small energy barrier to overcome (TS1EDA, Δ G ‡ = 1.4 kcal mol −1 ) and form I1EDA (carbamic acid adduct). The formed intermediate is converted to cyclic urea (PEDA, imidazolidin-2-one) via ring closure and dehydration of the concerted transition state (TS2EDA, Δ G ‡ = 32.8 kcal mol −1 ). Solvation model analysis demonstrates that nonpolar solvents (hexane, THF) are better for fixing CO 2 with EDA. Attaching electron-donating and -withdrawing groups to EDA does not reduce the energy barriers. Modifying the IL via changing the anion part (HSO 4 − ) central S atom with 6 A and 5 A group elements (Se, P, and As) shows that a Se-based IL can be utilized for the same purpose. Molecular dynamics (MD) simulations reveal that the IL ion pairs can hold substrates and CO 2 molecules via noncovalent interactions to ease nucleophilic attack on CO 2 . 
    more » « less
  2. RNA molecules are highly dynamic and capable of adopting a wide range of complex, folded structures. The factors driving the folding and dynamics of these structures are dependent on a balance of base pairing, hydration, base stacking, ion interactions, and the conformational sampling of the 2′‐hydroxyl group in the ribose sugar. The representation of these features is a challenge for empirical force fields used in molecular dynamics simulations. Toward meeting this challenge, the inclusion of explicit electronic polarization is important in accurately modeling RNA structure. In this work, we present a polarizable force field for RNA based on the classical Drude oscillator model, which represents electronic degrees of freedom via negatively charged particles attached to their parent atoms by harmonic springs. Beginning with parametrization against quantum mechanical base stacking interaction energy and conformational energy data, we have extended the Drude‐2017 nucleic acid force field to include RNA. The conformational sampling of a range of RNA sequences were used to validate the force field, including canonical A‐form RNA duplexes, stem‐loops, and complex tertiary folds that bind multiple Mg2+ions. Overall, the Drude‐2017 RNA force field reproduces important properties of these structures, including the conformational sampling of the 2′‐hydroxyl and key interactions with Mg2+ions. © 2018 Wiley Periodicals, Inc.

     
    more » « less
  3. CHARMM‐GUI,http://www.charmm-gui.org, is a web‐based graphical user interface that prepares complex biomolecular systems for molecular simulations. CHARMM‐GUI creates input files for a number of programs including CHARMM, NAMD, GROMACS, AMBER, GENESIS, LAMMPS, Desmond, OpenMM, and CHARMM/OpenMM. Since its original development in 2006, CHARMM‐GUI has been widely adopted for various purposes and now contains a number of different modules designed to set up a broad range of simulations: (1)PDB Reader & Manipulator,Glycan Reader, andLigand Reader & Modelerfor reading and modifying molecules; (2)Quick MD Simulator,Membrane Builder,Nanodisc Builder,HMMM Builder,Monolayer Builder,Micelle Builder, andHex Phase Builderfor building all‐atom simulation systems in various environments; (3)PACE CG BuilderandMartini Makerfor building coarse‐grained simulation systems; (4)DEER FacilitatorandMDFF/xMDFF Utilizerfor experimentally guided simulations; (5)Implicit Solvent Modeler,PBEQ‐Solver, andGCMC/BD Ion Simulatorfor implicit solvent related calculations; (6)Ligand Binderfor ligand solvation and binding free energy simulations; and (7)Drude Prepperfor preparation of simulations with the CHARMM Drude polarizable force field. Recently, new modules have been integrated into CHARMM‐GUI, such asGlycolipid Modelerfor generation of various glycolipid structures, andLPS Modelerfor generation of lipopolysaccharide structures from various Gram‐negative bacteria. These new features together with existing modules are expected to facilitate advanced molecular modeling and simulation thereby leading to an improved understanding of the structure and dynamics of complex biomolecular systems. Here, we briefly review these capabilities and discuss potential future directions in the CHARMM‐GUI development project. © 2016 Wiley Periodicals, Inc.

     
    more » « less
  4. Phosphorylation and dephosphorylation of proteins by kinases and phosphatases are central to cellular responses and function. The structural effects of serine and threonine phosphorylation were examined in peptides and in proteins, by circular dichroism, NMR spectroscopy, bioinformatics analysis of the PDB, small-molecule X-ray crystallography, and computational investigations. Phosphorylation of both serine and threonine residues induces substantial conformational restriction in their physiologically more important dianionic forms. Threonine exhibits a particularly strong disorder-to-order transition upon phosphorylation, with dianionic phosphothreonine preferentially adopting a cyclic conformation with restricted φ (φ ~ –60 ̊) stabilized by three noncovalent interactions: a strong intraresidue phosphate-amide hydrogen bond, an n→π* interaction between consecutive carbonyls, and an n→σ* interaction between the phosphate Oγ lone pair and the antibonding orbital of C–Hβ that restricts the χ2 side chain conformation. Proline is unique among the canonical amino acids for its covalent cyclization on the backbone. Phosphothreonine can mimic proline's backbone cyclization via noncovalent interactions. The preferred torsions of dianionic phosphothreonine are φ,ψ = polyproline II helix > α-helix (φ ~ –60 ̊); χ1 = g–; χ2 ~ +115 ̊ (eclipsed C–H/O–P bonds). This structural signature is observed in diverse proteins, including in the activation loops of protein kinases and in protein-protein interactions. In total, these results suggest a structural basis for the differential use and evolution of threonine versus serine phosphorylation sites in proteins, with serine phosphorylation typically inducing smaller, rheostat-like changes, versus threonine phosphorylation promoting larger, step function-like switches, in proteins. 
    more » « less
  5. This data set for the manuscript entitled "Design of Peptides that Fold and Self-Assemble on Graphite" includes all files needed to run and analyze the simulations described in the this manuscript in the molecular dynamics software NAMD, as well as the output of the simulations. The files are organized into directories corresponding to the figures of the main text and supporting information. They include molecular model structure files (NAMD psf or Amber prmtop format), force field parameter files (in CHARMM format), initial atomic coordinates (pdb format), NAMD configuration files, Colvars configuration files, NAMD log files, and NAMD output including restart files (in binary NAMD format) and trajectories in dcd format (downsampled to 10 ns per frame). Analysis is controlled by shell scripts (Bash-compatible) that call VMD Tcl scripts or python scripts. These scripts and their output are also included.

    Version: 2.0

    Changes versus version 1.0 are the addition of the free energy of folding, adsorption, and pairing calculations (Sim_Figure-7) and shifting of the figure numbers to accommodate this addition.


    Conventions Used in These Files
    ===============================

    Structure Files
    ----------------
    - graph_*.psf or sol_*.psf (original NAMD (XPLOR?) format psf file including atom details (type, charge, mass), as well as definitions of bonds, angles, dihedrals, and impropers for each dipeptide.)

    - graph_*.pdb or sol_*.pdb (initial coordinates before equilibration)
    - repart_*.psf (same as the above psf files, but the masses of non-water hydrogen atoms have been repartitioned by VMD script repartitionMass.tcl)
    - freeTop_*.pdb (same as the above pdb files, but the carbons of the lower graphene layer have been placed at a single z value and marked for restraints in NAMD)
    - amber_*.prmtop (combined topology and parameter files for Amber force field simulations)
    - repart_amber_*.prmtop (same as the above prmtop files, but the masses of non-water hydrogen atoms have been repartitioned by ParmEd)

    Force Field Parameters
    ----------------------
    CHARMM format parameter files:
    - par_all36m_prot.prm (CHARMM36m FF for proteins)
    - par_all36_cgenff_no_nbfix.prm (CGenFF v4.4 for graphene) The NBFIX parameters are commented out since they are only needed for aromatic halogens and we use only the CG2R61 type for graphene.
    - toppar_water_ions_prot_cgenff.str (CHARMM water and ions with NBFIX parameters needed for protein and CGenFF included and others commented out)

    Template NAMD Configuration Files
    ---------------------------------
    These contain the most commonly used simulation parameters. They are called by the other NAMD configuration files (which are in the namd/ subdirectory):
    - template_min.namd (minimization)
    - template_eq.namd (NPT equilibration with lower graphene fixed)
    - template_abf.namd (for adaptive biasing force)

    Minimization
    -------------
    - namd/min_*.0.namd

    Equilibration
    -------------
    - namd/eq_*.0.namd

    Adaptive biasing force calculations
    -----------------------------------
    - namd/eabfZRest7_graph_chp1404.0.namd
    - namd/eabfZRest7_graph_chp1404.1.namd (continuation of eabfZRest7_graph_chp1404.0.namd)

    Log Files
    ---------
    For each NAMD configuration file given in the last two sections, there is a log file with the same prefix, which gives the text output of NAMD. For instance, the output of namd/eabfZRest7_graph_chp1404.0.namd is eabfZRest7_graph_chp1404.0.log.

    Simulation Output
    -----------------
    The simulation output files (which match the names of the NAMD configuration files) are in the output/ directory. Files with the extensions .coor, .vel, and .xsc are coordinates in NAMD binary format, velocities in NAMD binary format, and extended system information (including cell size) in text format. Files with the extension .dcd give the trajectory of the atomic coorinates over time (and also include system cell information). Due to storage limitations, large DCD files have been omitted or replaced with new DCD files having the prefix stride50_ including only every 50 frames. The time between frames in these files is 50 * 50000 steps/frame * 4 fs/step = 10 ns. The system cell trajectory is also included for the NPT runs are output/eq_*.xst.

    Scripts
    -------
    Files with the .sh extension can be found throughout. These usually provide the highest level control for submission of simulations and analysis. Look to these as a guide to what is happening. If there are scripts with step1_*.sh and step2_*.sh, they are intended to be run in order, with step1_*.sh first.


    CONTENTS
    ========

    The directory contents are as follows. The directories Sim_Figure-1 and Sim_Figure-8 include README.txt files that describe the files and naming conventions used throughout this data set.

    Sim_Figure-1: Simulations of N-acetylated C-amidated amino acids (Ac-X-NHMe) at the graphite–water interface.

    Sim_Figure-2: Simulations of different peptide designs (including acyclic, disulfide cyclized, and N-to-C cyclized) at the graphite–water interface.

    Sim_Figure-3: MM-GBSA calculations of different peptide sequences for a folded conformation and 5 misfolded/unfolded conformations.

    Sim_Figure-4: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-5: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 295 K.

    Sim_Figure-5_replica: Temperature replica exchange molecular dynamics simulations for the peptide cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) with 20 replicas for temperatures from 295 to 454 K.

    Sim_Figure-6: Simulation of the peptide molecule cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) in free solution (no graphite).

    Sim_Figure-7: Free energy calculations for folding, adsorption, and pairing for the peptide CHP1404 (sequence: cyc(GTGSGTG-GPGG-GCGTGTG-SGPG)). For folding, we calculate the PMF as function of RMSD by replica-exchange umbrella sampling (in the subdirectory Folding_CHP1404_Graphene/). We make the same calculation in solution, which required 3 seperate replica-exchange umbrella sampling calculations (in the subdirectory Folding_CHP1404_Solution/). Both PMF of RMSD calculations for the scrambled peptide are in Folding_scram1404/. For adsorption, calculation of the PMF for the orientational restraints and the calculation of the PMF along z (the distance between the graphene sheet and the center of mass of the peptide) are in Adsorption_CHP1404/ and Adsorption_scram1404/. The actual calculation of the free energy is done by a shell script ("doRestraintEnergyError.sh") in the 1_free_energy/ subsubdirectory. Processing of the PMFs must be done first in the 0_pmf/ subsubdirectory. Finally, files for free energy calculations of pair formation for CHP1404 are found in the Pair/ subdirectory.

    Sim_Figure-8: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) where the peptides are far above the graphene–water interface in the initial configuration.

    Sim_Figure-9: Two replicates of a simulation of nine peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-9_scrambled: Two replicates of a simulation of nine peptide molecules with the control sequence cyc(GGTPTTGGGGGGSGGPSGTGGC) at the graphite–water interface at 370 K.

    Sim_Figure-10: Adaptive biasing for calculation of the free energy of the folded peptide as a function of the angle between its long axis and the zigzag directions of the underlying graphene sheet.

     

    This material is based upon work supported by the US National Science Foundation under grant no. DMR-1945589. A majority of the computing for this project was performed on the Beocat Research Cluster at Kansas State University, which is funded in part by NSF grants CHE-1726332, CNS-1006860, EPS-1006860, and EPS-0919443. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562, through allocation BIO200030. 
    more » « less