skip to main content


Title: Theory and simulations for RNA folding in mixtures of monovalent and divalent cations
RNA molecules cannot fold in the absence of counterions. Experiments are typically performed in the presence of monovalent and divalent cations. How to treat the impact of a solution containing a mixture of both ion types on RNA folding has remained a challenging problem for decades. By exploiting the large concentration difference between divalent and monovalent ions used in experiments, we develop a theory based on the reference interaction site model (RISM), which allows us to treat divalent cations explicitly while keeping the implicit screening effect due to monovalent ions. Our theory captures both the inner shell and outer shell coordination of divalent cations to phosphate groups, which we demonstrate is crucial for an accurate calculation of RNA folding thermodynamics. The RISM theory for ion–phosphate interactions when combined with simulations based on a transferable coarse-grained model allows us to predict accurately the folding of several RNA molecules in a mixture containing monovalent and divalent ions. The calculated folding free energies and ion-preferential coefficients for RNA molecules (pseudoknots, a fragment of the rRNA, and the aptamer domain of the adenine riboswitch) are in excellent agreement with experiments over a wide range of monovalent and divalent ion concentrations. Because the theory is general, it can be readily used to investigate ion and sequence effects on DNA properties.  more » « less
Award ID(s):
1900093
NSF-PAR ID:
10188687
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
116
Issue:
42
ISSN:
0027-8424
Page Range / eLocation ID:
21022 to 21030
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Folding of ribozymes into well-defined tertiary structures usually requires divalent cations. How Mg2+ ions direct the folding kinetics has been a long-standing unsolved problem because experiments cannot detect the positions and dynamics of ions. To address this problem, we used molecular simulations to dissect the folding kinetics of the Azoarcus ribozyme by monitoring the path each molecule takes to reach the folded state. We quantitatively establish that Mg2+ binding to specific sites, coupled with counter-ion release of monovalent cations, stimulate the formation of secondary and tertiary structures, leading to diverse pathways that include direct rapid folding and trapping in misfolded structures. In some molecules, key tertiary structural elements form when Mg2+ ions bind to specific RNA sites at the earliest stages of the folding, leading to specific collapse and rapid folding. In others, the formation of non-native base pairs, whose rearrangement is needed to reach the folded state, is the rate-limiting step. Escape from energetic traps, driven by thermal fluctuations, occurs readily. In contrast, the transition to the native state from long-lived topologically trapped native-like metastable states is extremely slow. Specific collapse and formation of energetically or topologically frustrated states occur early in the assembly process.

     
    more » « less
  2. Electric double layers are crucial to energy storage and electrocatalytic device performance. While double layer formation originates in electrostatic interactions, electric double layer properties are governed by a balance of both electrostatic and entropic driving forces; favorable ion-surface electrostatic interactions attract counterions to charged surfaces to compensate, or "screen," potentials, but the confinement of these same ions from a bulk reservoir to the interface incurs an entropic penalty. Here, we use a dicationic imidazolium ionic liquid and its monovalent analogue to explore how cation valence and entropy influence double layer formation and electrochemical reactivity using CO2 electroreduction as a model reaction. We find that divalent and monovalent cations display similar CO2 reduction kinetics but differ vastly in steady-state reactivity due to rapid electrochemically induced precipitation of insulating dicationic (bi)carbonate films. Using in situ surface-enhanced Raman scattering spectroscopy, we find that potential-dependent reorientation occurs at similar potentials between the two ionic liquids, but the introduction of a covalent link in the divalent cation imparts a more ordered double layer structure that favors (bi)carbonate precipitation. In mixed monovalent-divalent electrolytes, we find that the divalent cations dominate interfacial properties by preferentially accumulating at surfaces even at very low relative concentrations. Our findings confirm that ion entropy plays a key role in modulating local electrochemical environments and highlight how double layer properties are very sensitive to the properties of counterions that pay the lowest entropic penalty to accumulate at interfaces. Overall, we illustrate that ion entropy provides a new knob to tune reaction microenvironments and unveil how entropy plays a major role in modulating electrochemical reactivity in mixed ion electrolytes. 
    more » « less
  3. This data set for the manuscript entitled "Design of Peptides that Fold and Self-Assemble on Graphite" includes all files needed to run and analyze the simulations described in the this manuscript in the molecular dynamics software NAMD, as well as the output of the simulations. The files are organized into directories corresponding to the figures of the main text and supporting information. They include molecular model structure files (NAMD psf or Amber prmtop format), force field parameter files (in CHARMM format), initial atomic coordinates (pdb format), NAMD configuration files, Colvars configuration files, NAMD log files, and NAMD output including restart files (in binary NAMD format) and trajectories in dcd format (downsampled to 10 ns per frame). Analysis is controlled by shell scripts (Bash-compatible) that call VMD Tcl scripts or python scripts. These scripts and their output are also included.

    Version: 2.0

    Changes versus version 1.0 are the addition of the free energy of folding, adsorption, and pairing calculations (Sim_Figure-7) and shifting of the figure numbers to accommodate this addition.


    Conventions Used in These Files
    ===============================

    Structure Files
    ----------------
    - graph_*.psf or sol_*.psf (original NAMD (XPLOR?) format psf file including atom details (type, charge, mass), as well as definitions of bonds, angles, dihedrals, and impropers for each dipeptide.)

    - graph_*.pdb or sol_*.pdb (initial coordinates before equilibration)
    - repart_*.psf (same as the above psf files, but the masses of non-water hydrogen atoms have been repartitioned by VMD script repartitionMass.tcl)
    - freeTop_*.pdb (same as the above pdb files, but the carbons of the lower graphene layer have been placed at a single z value and marked for restraints in NAMD)
    - amber_*.prmtop (combined topology and parameter files for Amber force field simulations)
    - repart_amber_*.prmtop (same as the above prmtop files, but the masses of non-water hydrogen atoms have been repartitioned by ParmEd)

    Force Field Parameters
    ----------------------
    CHARMM format parameter files:
    - par_all36m_prot.prm (CHARMM36m FF for proteins)
    - par_all36_cgenff_no_nbfix.prm (CGenFF v4.4 for graphene) The NBFIX parameters are commented out since they are only needed for aromatic halogens and we use only the CG2R61 type for graphene.
    - toppar_water_ions_prot_cgenff.str (CHARMM water and ions with NBFIX parameters needed for protein and CGenFF included and others commented out)

    Template NAMD Configuration Files
    ---------------------------------
    These contain the most commonly used simulation parameters. They are called by the other NAMD configuration files (which are in the namd/ subdirectory):
    - template_min.namd (minimization)
    - template_eq.namd (NPT equilibration with lower graphene fixed)
    - template_abf.namd (for adaptive biasing force)

    Minimization
    -------------
    - namd/min_*.0.namd

    Equilibration
    -------------
    - namd/eq_*.0.namd

    Adaptive biasing force calculations
    -----------------------------------
    - namd/eabfZRest7_graph_chp1404.0.namd
    - namd/eabfZRest7_graph_chp1404.1.namd (continuation of eabfZRest7_graph_chp1404.0.namd)

    Log Files
    ---------
    For each NAMD configuration file given in the last two sections, there is a log file with the same prefix, which gives the text output of NAMD. For instance, the output of namd/eabfZRest7_graph_chp1404.0.namd is eabfZRest7_graph_chp1404.0.log.

    Simulation Output
    -----------------
    The simulation output files (which match the names of the NAMD configuration files) are in the output/ directory. Files with the extensions .coor, .vel, and .xsc are coordinates in NAMD binary format, velocities in NAMD binary format, and extended system information (including cell size) in text format. Files with the extension .dcd give the trajectory of the atomic coorinates over time (and also include system cell information). Due to storage limitations, large DCD files have been omitted or replaced with new DCD files having the prefix stride50_ including only every 50 frames. The time between frames in these files is 50 * 50000 steps/frame * 4 fs/step = 10 ns. The system cell trajectory is also included for the NPT runs are output/eq_*.xst.

    Scripts
    -------
    Files with the .sh extension can be found throughout. These usually provide the highest level control for submission of simulations and analysis. Look to these as a guide to what is happening. If there are scripts with step1_*.sh and step2_*.sh, they are intended to be run in order, with step1_*.sh first.


    CONTENTS
    ========

    The directory contents are as follows. The directories Sim_Figure-1 and Sim_Figure-8 include README.txt files that describe the files and naming conventions used throughout this data set.

    Sim_Figure-1: Simulations of N-acetylated C-amidated amino acids (Ac-X-NHMe) at the graphite–water interface.

    Sim_Figure-2: Simulations of different peptide designs (including acyclic, disulfide cyclized, and N-to-C cyclized) at the graphite–water interface.

    Sim_Figure-3: MM-GBSA calculations of different peptide sequences for a folded conformation and 5 misfolded/unfolded conformations.

    Sim_Figure-4: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-5: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 295 K.

    Sim_Figure-5_replica: Temperature replica exchange molecular dynamics simulations for the peptide cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) with 20 replicas for temperatures from 295 to 454 K.

    Sim_Figure-6: Simulation of the peptide molecule cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) in free solution (no graphite).

    Sim_Figure-7: Free energy calculations for folding, adsorption, and pairing for the peptide CHP1404 (sequence: cyc(GTGSGTG-GPGG-GCGTGTG-SGPG)). For folding, we calculate the PMF as function of RMSD by replica-exchange umbrella sampling (in the subdirectory Folding_CHP1404_Graphene/). We make the same calculation in solution, which required 3 seperate replica-exchange umbrella sampling calculations (in the subdirectory Folding_CHP1404_Solution/). Both PMF of RMSD calculations for the scrambled peptide are in Folding_scram1404/. For adsorption, calculation of the PMF for the orientational restraints and the calculation of the PMF along z (the distance between the graphene sheet and the center of mass of the peptide) are in Adsorption_CHP1404/ and Adsorption_scram1404/. The actual calculation of the free energy is done by a shell script ("doRestraintEnergyError.sh") in the 1_free_energy/ subsubdirectory. Processing of the PMFs must be done first in the 0_pmf/ subsubdirectory. Finally, files for free energy calculations of pair formation for CHP1404 are found in the Pair/ subdirectory.

    Sim_Figure-8: Simulation of four peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) where the peptides are far above the graphene–water interface in the initial configuration.

    Sim_Figure-9: Two replicates of a simulation of nine peptide molecules with the sequence cyc(GTGSGTG-GPGG-GCGTGTG-SGPG) at the graphite–water interface at 370 K.

    Sim_Figure-9_scrambled: Two replicates of a simulation of nine peptide molecules with the control sequence cyc(GGTPTTGGGGGGSGGPSGTGGC) at the graphite–water interface at 370 K.

    Sim_Figure-10: Adaptive biasing for calculation of the free energy of the folded peptide as a function of the angle between its long axis and the zigzag directions of the underlying graphene sheet.

     

    This material is based upon work supported by the US National Science Foundation under grant no. DMR-1945589. A majority of the computing for this project was performed on the Beocat Research Cluster at Kansas State University, which is funded in part by NSF grants CHE-1726332, CNS-1006860, EPS-1006860, and EPS-0919443. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1548562, through allocation BIO200030. 
    more » « less
  4. Deep sea biology is known to thrive at pressures up to ≈1 kbar, which motivates fundamental biophysical studies of biomolecules under such extreme environments. In this work, the conformational equilibrium of the lysine riboswitch has been systematically investigated by single molecule FRET (smFRET) microscopy at pressures up to 1500 bar. The lysine riboswitch preferentially unfolds with increasing pressure, which signals an increase in free volume (Δ V 0 > 0) upon folding of the biopolymer. Indeed, the effective lysine binding constant increases quasi-exponentially with pressure rise, which implies a significant weakening of the riboswitch–ligand interaction in a high-pressure environment. The effects of monovalent/divalent cations and osmolytes on folding are also explored to acquire additional insights into cellular mechanisms for adapting to high pressures. For example, we find that although Mg 2+ greatly stabilizes folding of the lysine riboswitch (ΔΔ G 0 < 0), there is negligible impact on changes in free volume (ΔΔ V 0 ≈ 0) and thus any pressure induced denaturation effects. Conversely, osmolytes (commonly at high concentrations in deep sea marine species) such as the trimethylamine N -oxide (TMAO) significantly reduce free volumes (ΔΔ V 0 < 0) and thereby diminish pressure-induced denaturation. We speculate that, besides stabilizing RNA structure, enhanced levels of TMAO in cells might increase the dynamic range for competent riboswitch folding by suppressing the pressure-induced denaturation response. This in turn could offer biological advantage for vertical migration of deep-sea species, with impacts on food searching in a resource limited environment. 
    more » « less
  5. Rational design of molecular chelating agents requires a detailed understanding of physicochemical ligand–metal interactions in solvent phase. Computational quantum chemistry methods should be able to provide this, but computational reports have shown poor accuracy when determining absolute binding constants for many chelating molecules. To understand why, we compare and benchmark static- and dynamics-based computational procedures for a range of monovalent and divalent cations binding to a conventional cryptand molecule: 2.2.2-cryptand ([2.2.2]). The benchmarking comparison shows that dynamics simulations using standard OPLS-AA classical potentials can reasonably predict binding constants for monovalent cations, but these procedures fail for divalent cations. We also consider computationally efficient static procedure using Kohn–Sham density functional theory (DFT) and cluster-continuum modeling that accounts for local microsolvation and pH effects. This approach accurately predicts binding energies for monovalent and divalent cations with an average error of 3.2 kcal mol −1 compared to experiment. This static procedure thus should be useful for future molecular screening efforts, and high absolute errors in the literature may be due to inadequate modeling of local solvent and pH effects. 
    more » « less