skip to main content


Title: OpenAWSEM with Open3SPN2: A fast, flexible, and accessible framework for large-scale coarse-grained biomolecular simulations
We present OpenAWSEM and Open3SPN2, new cross-compatible implementations of coarse-grained models for protein (AWSEM) and DNA (3SPN2) molecular dynamics simulations within the OpenMM framework. These new implementations retain the chemical accuracy and intrinsic efficiency of the original models while adding GPU acceleration and the ease of forcefield modification provided by OpenMM’s Custom Forces software framework. By utilizing GPUs, we achieve around a 30-fold speedup in protein and protein-DNA simulations over the existing LAMMPS-based implementations running on a single CPU core. We showcase the benefits of OpenMM’s Custom Forces framework by devising and implementing two new potentials that allow us to address important aspects of protein folding and structure prediction and by testing the ability of the combined OpenAWSEM and Open3SPN2 to model protein-DNA binding. The first potential is used to describe the changes in effective interactions that occur as a protein becomes partially buried in a membrane. We also introduced an interaction to describe proteins with multiple disulfide bonds. Using simple pairwise disulfide bonding terms results in unphysical clustering of cysteine residues, posing a problem when simulating the folding of proteins with many cysteines. We now can computationally reproduce Anfinsen’s early Nobel prize winning experiments by using OpenMM’s Custom Forces framework to introduce a multi-body disulfide bonding term that prevents unphysical clustering. Our protein-DNA simulations show that the binding landscape is funneled towards structures that are quite similar to those found using experiments. In summary, this paper provides a simulation tool for the molecular biophysics community that is both easy to use and sufficiently efficient to simulate large proteins and large protein-DNA systems that are central to many cellular processes. These codes should facilitate the interplay between molecular simulations and cellular studies, which have been hampered by the large mismatch between the time and length scales accessible to molecular simulations and those relevant to cell biology.  more » « less
Award ID(s):
2019745
PAR ID:
10233534
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Editor(s):
Schneidman-Duhovny, Dina
Date Published:
Journal Name:
PLOS Computational Biology
Volume:
17
Issue:
2
ISSN:
1553-7358
Page Range / eLocation ID:
e1008308
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Structural, regulatory and enzymatic proteins interact with DNA to maintain a healthy and functional genome. Yet, our structural understanding of how proteins interact with DNA is limited. We present MELD-DNA, a novel computational approach to predict the structures of protein–DNA complexes. The method combines molecular dynamics simulations with general knowledge or experimental information through Bayesian inference. The physical model is sensitive to sequence-dependent properties and conformational changes required for binding, while information accelerates sampling of bound conformations. MELD-DNA can: (i) sample multiple binding modes; (ii) identify the preferred binding mode from the ensembles; and (iii) provide qualitative binding preferences between DNA sequences. We first assess performance on a dataset of 15 protein–DNA complexes and compare it with state-of-the-art methodologies. Furthermore, for three selected complexes, we show sequence dependence effects of binding in MELD predictions. We expect that the results presented herein, together with the freely available software, will impact structural biology (by complementing DNA structural databases) and molecular recognition (by bringing new insights into aspects governing protein–DNA interactions).

     
    more » « less
  2. Abstract

    The ability of cells and tissues to respond differentially to mechanical forces applied in distinct directions is mediated by the ability of load-bearing proteins to preferentially maintain physical linkages in certain directions. However, the molecular basis and biological consequences of directional force-sensitive binding remain unclear. Vinculin (Vcn) is a load-bearing linker protein that exhibits directional catch bonding due to interactions between the Vcn tail domain (Vt) and filamentous (F)-actin. We developed a computational approach to predict Vcn residues involved in directional catch bonding and produced a set of associated Vcn variants with unaltered Vt structure, actin binding, or phospholipid interactions. Incorporation of the variants did not affect Vcn activation but reduced Vcn loading and altered exchange dynamics, consistent with the loss of directional catch bonding. Expression of Vcn variants perturbed the coordination of subcellular structures and cell migration, establishing key cellular functions for Vcn directional catch bonding.

     
    more » « less
  3. Babitzke, Paul (Ed.)
    ABSTRACT Oxidative stress causes cellular damage, including DNA mutations, protein dysfunction, and loss of membrane integrity. Here, we discovered that a TrmB (transcription regulator of mal operon) family protein (Pfam PF01978) composed of a single winged-helix DNA binding domain (InterPro IPR002831) can function as thiol-based transcriptional regulator of oxidative stress response. Using the archaeon Haloferax volcanii as a model system, we demonstrate that the TrmB-like OxsR is important for recovery of cells from hypochlorite stress. OxsR is shown to bind specific regions of genomic DNA, particularly during hypochlorite stress. OxsR-bound intergenic regions were found proximal to oxidative stress operons, including genes associated with thiol relay and low molecular weight thiol biosynthesis. Further analysis of a subset of these sites revealed OxsR to function during hypochlorite stress as a transcriptional activator and repressor. OxsR was shown to require a conserved cysteine (C24) for function and to use a CG-rich motif upstream of conserved BRE/TATA box promoter elements for transcriptional activation. Protein modeling suggested the C24 is located at a homodimer interface formed by antiparallel α helices, and that oxidation of this cysteine would result in the formation of an intersubunit disulfide bond. This covalent linkage may promote stabilization of an OxsR homodimer with the enhanced DNA binding properties observed in the presence of hypochlorite stress. The phylogenetic distribution TrmB family proteins, like OxsR, that have a single winged-helix DNA binding domain and conserved cysteine residue suggests this type of redox signaling mechanism is widespread in Archaea. IMPORTANCE TrmB-like proteins, while not yet associated with redox stress, are found in bacteria and widespread in archaea. Here, we expand annotation of a large group of TrmB-like single winged-helix DNA binding domain proteins from diverse archaea to function as thiol-based transcriptional regulators of oxidative stress response. Using Haloferax volcanii as a model, we reveal that the TrmB-like OxsR functions during hypochlorite stress as a transcriptional activator and repressor of an extensive gene coexpression network associated with thiol relay and other related activities. A conserved cysteine residue of OxsR serves as the thiol-based sensor for this function and likely forms an intersubunit disulfide bond during hypochlorite stress that stabilizes a homodimeric configuration with enhanced DNA binding properties. A CG-rich DNA motif in the promoter region of a subset of sites identified to be OxsR-bound is required for regulation; however, not all sites have this motif, suggesting added complexity to the regulatory network. 
    more » « less
  4. Abstract

    Many proteins must interact with molecular chaperones to achieve their native state in the cell. Yet, how chaperone binding‐site characteristics affect the folding process is poorly understood. The ubiquitous Hsp70 chaperone system prevents client‐protein aggregation by holding unfolded conformations and by unfolding misfolded states. Hsp70 binding sites of client proteins comprise a nonpolar core surrounded by positively charged residues. However, a detailed analysis of Hsp70 binding sites on a proteome‐wide scale is still lacking. Further, it is not known whether proteins undergo some degree of folding while chaperone bound. Here, we begin to address the above questions by identifying Hsp70 binding sites in 2258Escherichia coli(E. coli) proteins. We find that most proteins bear at least one Hsp70 binding site and that the number of Hsp70 binding sites is directly proportional to protein size. Aggregation propensity upon release from the ribosome correlates with number of Hsp70 binding sites only in the case of large proteins. Interestingly, Hsp70 binding sites are more solvent‐exposed than other nonpolar sites, in protein native states. Our findings show that the majority ofE. coliproteins are systematically enabled to interact with Hsp70 even if this interaction only takes place during a fraction of the protein lifetime. In addition, our data suggest that some conformational sampling may take place within Hsp70‐bound states, due to the solvent exposure of some chaperone binding sites in native proteins. In all, we propose that Hsp70‐chaperone‐binding traits have evolved to favor Hsp70‐assisted protein folding devoid of aggregation.

     
    more » « less
  5. Colloidal particles with mobile binding molecules constitute a powerful platform for probing the physics of self-assembly. Binding molecules are free to diffuse and rearrange on the surface, giving rise to spontaneous control over the number of droplet–droplet bonds, i.e. , valence, as a function of the concentration of binders. This type of valence control has been realized experimentally by tuning the interaction strength between DNA-coated emulsion droplets. Optimizing for valence two yields droplet polymer chains, termed ‘colloidomers’, which have recently been used to probe the physics of folding. To understand the underlying self-assembly mechanisms, here we present a coarse-grained molecular dynamics (CGMD) model to study the self-assembly of this class of systems using explicit representations of mobile binding sites . We explore how valence of assembled structures can be tuned through kinetic control in the strong binding limit. More specifically, we optimize experimental control parameters to obtain the highest yield of long linear colloidomer chains. Subsequently tuning the dynamics of binding and unbinding via a temperature-dependent model allows us to observe a heptamer chain collapse into all possible rigid structures, in good agreement with recent folding experiments. Our CGMD platform and dynamic bonding model (implemented as an open-source custom plugin to HOOMD-Blue) reveal the molecular features governing the binding patch size and valence control, and opens the study of pathways in colloidomer folding. This model can therefore guide programmable design in experiments. 
    more » « less