skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Docking-based long timescale simulation of cell-size protein systems at atomic resolution
Computational methodologies are increasingly addressing modeling of the whole cell at the molecular level. Proteins and their interactions are the key component of cellular processes. Techniques for modeling protein interactions, thus far, have included protein docking and molecular simulation. The latter approaches account for the dynamics of the interactions but are relatively slow, if carried out at all-atom resolution, or are significantly coarse grained. Protein docking algorithms are far more efficient in sampling spatial coordinates. However, they do not account for the kinetics of the association (i.e., they do not involve the time coordinate). Our proof-of-concept study bridges the two modeling approaches, developing an approach that can reach unprecedented simulation timescales at all-atom resolution. The global intermolecular energy landscape of a large system of proteins was mapped by the pairwise fast Fourier transform docking and sampled in space and time by Monte Carlo simulations. The simulation protocol was parametrized on existing data and validated on a number of observations from experiments and molecular dynamics simulations. The simulation protocol performed consistently across very different systems of proteins at different protein concentrations. It recapitulated data on the previously observed protein diffusion rates and aggregation. The speed of calculation allows reaching second-long trajectories of protein systems that approach the size of the cells, at atomic resolution.  more » « less
Award ID(s):
1917263
PAR ID:
10481703
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
NAS
Date Published:
Journal Name:
Proceedings of the National Academy of Sciences
Volume:
119
Issue:
41
ISSN:
0027-8424
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The diffusion of proteins is significantly affected by macromolecular crowding. Molecular simulations accounting for protein interactions at atomic resolution are useful for characterizing the diffusion patterns in crowded environments. We present a comprehensive analysis of protein diffusion under different crowding conditions based on our recent docking-based approach simulating an intracellular crowded environment by sampling the intermolecular energy landscape using the Markov Chain Monte Carlo protocol. The procedure was extensively benchmarked, and the results are in very good agreement with the available experimental and theoretical data. The translational and rotational diffusion rates were determined for different types of proteins under crowding conditions in a broad range of concentrations. A protein system representing most abundant protein types in the E. coli cytoplasm was simulated, as well as large systems of other proteins of varying sizes in heterogeneous and self-crowding solutions. Dynamics of individual proteins was analyzed as a function of concentration and different diffusion rates in homogeneous and heterogeneous crowding. Smaller proteins diffused faster in heterogeneous crowding of larger molecules, compared to their diffusion in the self-crowded solution. Larger proteins displayed the opposite behavior, diffusing faster in the self-crowded solution. The results show the predictive power of our structure-based simulation approach for long timescales of cell-size systems at atomic resolution. 
    more » « less
  2. Fluctuations of protein three-dimensional structures and large-scale conformational transitions are crucial for the biological function of proteins and their complexes. Experimental studies of such phenomena remain very challenging and therefore molecular modeling can be a good alternative or a valuable supporting tool for the investigation of large molecular systems and long-time events. In this minireview, we present two alternative approaches to the coarse-grained (CG) modeling of dynamic properties of protein systems. We discuss two CG representations of polypeptide chains used for Monte Carlo dynamics simulations of protein local dynamics and conformational transitions, and highly simplified structure-based elastic network models of protein flexibility. In contrast to classical all-atom molecular dynamics, the modeling strategies discussed here allow the quite accurate modeling of much larger systems and longer-time dynamic phenomena. We briefly describe the main features of these models and outline some of their applications, including modeling of near-native structure fluctuations, sampling of large regions of the protein conformational space, or possible support for the structure prediction of large proteins and their complexes. 
    more » « less
  3. Nanodiscs are discoidal protein–lipid complexes that have wide applications in membrane protein studies. Modeling and simulation of nanodiscs are challenging due to the absence of structures of many membrane scaffold proteins (MSPs) that wrap around the membrane bilayer. We have developed CHARMM‐GUINanodisc Builder(http://www.charmm-gui.org/input/nanodisc) to facilitate the setup of nanodisc simulation systems by modeling the MSPs with defined size and known structural features. A total of 11 different nanodiscs with a diameter from 80 to 180 Å are made available in both the all‐atom CHARMM and two coarse‐grained (PACE and Martini) force fields. The usage of theNanodisc Builderis demonstrated with various simulation systems. The structures and dynamics of proteins and lipids in these systems were analyzed, showing similar behaviors to those from previous all‐atom and coarse‐grained nanodisc simulations. We expect theNanodisc Builderto be a convenient and reliable tool for modeling and simulation of nanodisc systems. © 2019 Wiley Periodicals, Inc. 
    more » « less
  4. Abstract A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics. 
    more » « less
  5. Abstract Protein docking is essential for structural characterization of protein interactions. Besides providing the structure of protein complexes, modeling of proteins and their complexes is important for understanding the fundamental principles and specific aspects of protein interactions. The accuracy of protein modeling, in general, is still less than that of the experimental approaches. Thus, it is important to investigate the applicability of docking techniques to modeled proteins. We present new comprehensive benchmark sets of protein models for the development and validation of protein docking, as well as a systematic assessment of free and template‐based docking techniques on these sets. As opposed to previous studies, the benchmark sets reflect the real case modeling/docking scenario where the accuracy of the models is assessed by the modeling procedure, without reference to the native structure (which would be unknown in practical applications). We also expanded the analysis to include docking of protein pairs where proteins have different structural accuracy. The results show that, in general, the template‐based docking is less sensitive to the structural inaccuracies of the models than the free docking. The near‐native docking poses generated by the template‐based approach, typically, also have higher ranks than those produces by the free docking (although the free docking is indispensable in modeling the multiplicity of protein interactions in a crowded cellular environment). The results show that docking techniques are applicable to protein models in a broad range of modeling accuracy. The study provides clear guidelines for practical applications of docking to protein models. 
    more » « less