Bottom-up coarse-grained (CG) molecular dynamics models are parameterized using complex effective Hamiltonians. These models are typically optimized to approximate high dimensional data from atomistic simulations. However, human validation of these models is often limited to low dimensional statistics that do not necessarily differentiate between the CG model and said atomistic simulations. We propose that classification can be used to variationally estimate high dimensional error and that explainable machine learning can help convey this information to scientists. This approach is demonstrated using Shapley additive explanations and two CG protein models. This framework may also be valuable for ascertaining whether allosteric effects at the atomistic level are accurately propagated to a CG model.
more »
« less
Everything You Want to Know About Coarse‐Graining and Never Dared to Ask: Macromolecules as a Key Example
ABSTRACT Coarse‐graining (CG) is transforming the study of molecular systems, allowing researchers to explore by computer simulations larger and more complex structures than ever before. Continued advancements in CG techniques are making simulations more efficient, establishing this approach as a cornerstone for designing innovative materials and eco‐friendly alternatives to traditional plastics. Additionally, CG methods are becoming indispensable for unraveling the complexities and functional mechanisms of large‐scale macromolecular machines within cells. Yet, crafting an effective coarse‐grained model demands a nuanced understanding of its advantages and limitations. Faster simulations come at the cost of molecular detail and accuracy in some properties, so that it is essential to balance computational efficiency with the specific needs of the system one wants to simulate. By asking the right questions, researchers can select models that offer the desired benefits while managing trade‐offs. This article delves into the potential of different CG models and the compromises inherent in their adoption, highlighting their role in shaping the future of material science and biophysics.
more »
« less
- Award ID(s):
- 2154999
- PAR ID:
- 10584541
- Publisher / Repository:
- Wiley Blackwell (John Wiley & Sons)
- Date Published:
- Journal Name:
- WIREs Computational Molecular Science
- Volume:
- 15
- Issue:
- 2
- ISSN:
- 1759-0876
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
COARSE-GRAINED MOLECULAR DYNAMICS SIMULATIONS OF SOFT MATTER RELEVANT TO THE PHARMACEUTICAL INDUSTRYSoft materials are critical to the pharmaceutical industry for their role in formulations, delivery of active compounds or understanding relevant physiological processes. This chapter will focus on coarse-grained (CG) approaches and models that have been used in conjunction with the Molecular Dynamics (MD) simulation method to investigate soft materials of interest to various applications in pharmaceutical sciences. This chapter also discusses several examples of CG MD simulations used to scientifically probe molecules with different chemistries.more » « less
-
The integral equation coarse-graining (IECG) approach is a promising high-level coarse-graining (CG) method for polymer melts, with variable resolution from soft spheres to multi CG sites, which preserves the structural and thermodynamical consistencies with the related atomistic simulations. When compared to the atomistic description, the procedure of coarse-graining results in smoother free energy surfaces, longer-ranged potentials, a decrease in the number of interaction sites for a given polymer, and more. Because these changes have competing effects on the computational efficiency of the CG model, care needs to be taken when studying the effect of coarse-graining on the computational speed-up in CG molecular dynamics simulations. For instance, treatment of long-range CG interactions requires the selection of cutoff distances that include the attractive part of the effective CG potential and force. In particular, we show how the complex nature of the range and curvature of the effective CG potential, the selection of a suitable CG timestep, the choice of the cutoff distance, the molecular dynamics algorithms, and the smoothness of the CG free energy surface affect the efficiency of IECG simulations. By direct comparison with the atomistic simulations of relatively short chain polymer melts, we find that the overall computational efficiency is highest for the highest level of CG (soft spheres), with an overall improvement of the computational efficiency being about 10 6 –10 8 for various CG levels/resolutions. Therefore, the IECG method can have important applications in molecular dynamics simulations of polymeric systems. Finally, making use of the standard spatial decomposition algorithm, the parallel scalability of the IECG simulations for various levels of CG is presented. Optimal parallel scaling is observed for a reasonably large number of processors. Although this study is performed using the IECG approach, its results on the relation between the level of CG and the computational efficiency are general and apply to any properly-constructed CG model.more » « less
-
Abstract The molecular events that permit the spike glycoprotein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to bind and enter cells are important to understand for both fundamental and therapeutic reasons. Spike proteins consist of S1 and S2 domains, which recognize angiotensin-converting enzyme 2 (ACE2) receptors and contain the viral fusion machinery, respectively. Ostensibly, the binding of spike trimers to ACE2 receptors promotes dissociation of the S1 domains and exposure of the fusion machinery, although the molecular details of this process have yet to be observed. We report the development of bottom-up coarse-grained (CG) models consistent with cryo-electron tomography data, and the use of CG molecular dynamics simulations to investigate viral binding and S2 core exposure. We show that spike trimers cooperatively bind to multiple ACE2 dimers at virion-cell interfaces in a manner distinct from binding between soluble proteins, which processively induces S1 dissociation. We also simulate possible variant behavior using perturbed CG models, and find that ACE2-induced S1 dissociation is primarily sensitive to conformational state populations and the extent of S1/S2 cleavage, rather than ACE2 binding affinity. These simulations reveal an important concerted interaction between spike trimers and ACE2 dimers that primes the virus for membrane fusion and entry.more » « less
-
Coarse-grained molecular dynamics (CGMD) simulations address lengthscales and timescales that are critical to many chemical and material applications. Nevertheless, contemporary CGMD modeling is relatively bespoke and there are no black-box CGMD methodologies available that could play a comparable role in discovery applications that density functional theory plays for electronic structure. This gap might be filled by machine learning (ML)-based CGMD potentials that simplify model development, but these methods are still in their early stages and have yet to demonstrate a significant advantage over existing physics-based CGMD methods. Here, we explore the potential of Δ-learning models to leverage the advantages of these two approaches. This is implemented by using ML-based potentials to learn the difference between the target CGMD variable and the predictions of physics-based potentials. The Δ-models are benchmarked against the baseline models in reproducing on-target and off-target atomistic properties as a function of CG resolution, mapping operator, and system topology. The Δ-models outperform the reference ML-only CGMD models in nearly all scenarios. In several cases, the ML-only models manage to minimize training errors while still producing qualitatively incorrect dynamics, which is corrected by the Δ-models. Given their negligible added cost, Δ-models provide essentially free gains over their ML-only counterparts. Nevertheless, an unexpected finding is that neither the Δ-learning models nor the ML-only models significantly outperform the elementary pairwise models in reproducing atomistic properties. This fundamental failure is attributed to the relatively large irreducible force errors associated with coarse-graining that produces little benefit from using more complex potentials.more » « less
An official website of the United States government
