Bottom-up coarse-grained (CG) molecular dynamics models are parameterized using complex effective Hamiltonians. These models are typically optimized to approximate high dimensional data from atomistic simulations. However, human validation of these models is often limited to low dimensional statistics that do not necessarily differentiate between the CG model and said atomistic simulations. We propose that classification can be used to variationally estimate high dimensional error and that explainable machine learning can help convey this information to scientists. This approach is demonstrated using Shapley additive explanations and two CG protein models. This framework may also be valuable for ascertaining whether allosteric effects at the atomistic level are accurately propagated to a CG model.
Polyelectrolyte solutions are of considerable scientific and practical importance. One of the most widely studied polymer is polystyrene sulfonate (PSS), which has a hydrophobic backbone with pendant charged groups. A polycation with similar chemical structure is poly(vinyl benzyltri methyl) ammonium (PVBTMA). In this work, we develop coarse-grained (CG) models for PSS and PVBTMA with explicit CG water and with sodium and chloride counterions, respectively. We benchmark the CG models via a comparison with atomistic simulations for single chains. We find that the choice of the topology and the partial charge distribution of the CG model, both play a crucial role in the ability of the CG model to reproduce results from atomistic simulations. There are dramatic consequences, e.g., collapse of polyions, with injudicious choices of the local charge distribution. The polyanions and polycations exhibit a similar conformational and dynamical behavior, suggesting that the sign of the polyion charge does not play a significant role.
more » « less- Award ID(s):
- 1856595
- NSF-PAR ID:
- 10363357
- Publisher / Repository:
- American Institute of Physics
- Date Published:
- Journal Name:
- The Journal of Chemical Physics
- Volume:
- 156
- Issue:
- 9
- ISSN:
- 0021-9606
- Page Range / eLocation ID:
- Article No. 094902
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Efficient sampling of the conformational space is essential for quantitative simulations of proteins. The multiscale enhanced sampling (MSES) method accelerates atomistic sampling by coupling it to a coarse‐grained (CG) simulation. Bias from coupling to the CG model is removed using Hamiltonian replica exchange, such that one could benefit simultaneously from the high accuracy of atomistic models and fast dynamics of CG ones. Here, we extend MSES to allow independent control of the effective temperatures of atomistic and CG simulations, by directly scaling the atomistic and CG Hamiltonians. The new algorithm, named MSES with independent tempering (MSES‐IT), supports more sophisticated Hamiltonian and temperature replica exchange protocols to further improve the sampling efficiency. Using a small but nontrivial β‐hairpin, we show that setting the effective temperature of CG model in all conditions to its melting temperature maximizes structural transition rates at the CG level and promotes more efficient replica exchange and diffusion in the condition space. As the result, MSES‐IT drive faster reversible transitions at the atomic level and leads to significant improvement in generating converged conformational ensembles compared to the original MSES scheme.
-
ABSTRACT In an effort to accelerate simulations exploring deformation mechanisms in semicrystalline polymers, we have created structure‐based coarse‐grained (CG) models of polyethylene and evaluated the extent to which they can simultaneously represent its amorphous and crystalline phases. Two CG models were calibrated from target data sampled from atomistic simulations of supercooled oligomer melts that differ in how accurately they represent the distribution of bond lengths between CG sites. Both models yield semicrystalline morphology when simulations are performed at ambient conditions, and both accurately predict the glass transition and melt temperatures. A thorough evaluation of the models was then conducted to assess how well they represent various properties of the amorphous and crystalline phases. We found that the model that more faithfully reproduces the target bond length distribution poorly represents the crystalline phase, which results from its inability to reproduce correlations in the structural distributions. The second model, which utilizes a harmonic bond potential and thus reproduces the target bond length distribution less accurately, represents the structure and chain mobility within the crystalline phase more realistically. Furthermore, the latter model more faithfully reproduces the vastly different relaxation timescales of the phases, a critical feature for modeling deformation mechanisms in semicrystalline polymers. © 2019 Wiley Periodicals, Inc. J. Polym. Sci., Part B: Polym. Phys.
2019 ,57 , 331–342 -
Abstract Biotic specialization holds information about the assembly, evolution, and stability of biological communities. Partner availabilities can play an important role in enabling species interactions, where uneven partner availabilities can bias estimates of biotic specialization when using phylogenetic diversity indices. It is therefore important to account for partner availability when characterizing biotic specialization using phylogenies. We developed an index, phylogenetic structure of specialization (PSS), that avoids bias from uneven partner availabilities by uncoupling the null models for interaction frequency and phylogenetic distance. We incorporate the deviation between observed and random interaction frequencies as weights into the calculation of partner phylogenetic α‐diversity. To calculate the PSS index, we then compare observed partner phylogenetic α‐diversity to a null distribution generated by randomizing phylogenetic distances among the same number of partners. PSS quantifies the phylogenetic structure (i.e., clustered, overdispersed, or random) of the partners of a focal species. We show with simulations that the PSS index is not correlated with network properties, which allows comparisons across multiple systems. We also implemented PSS on empirical networks of host–parasite, avian seed‐dispersal, lichenized fungi–cyanobacteria, and hummingbird pollination interactions. Across these systems, a large proportion of taxa interact with phylogenetically random partners according to PSS, sometimes to a larger extent than detected with an existing method that does not account for partner availability. We also found that many taxa interact with phylogenetically clustered partners, while taxa with overdispersed partners were rare. We argue that species with phylogenetically overdispersed partners have often been misinterpreted as generalists when they should be considered specialists. Our results highlight the important role of randomness in shaping interaction networks, even in highly intimate symbioses, and provide a much‐needed quantitative framework to assess the role that evolutionary history and symbiotic specialization play in shaping patterns of biodiversity. PSS is available as an R package at
https://github.com/cjpardodelahoz/pss . -
The integral equation coarse-graining (IECG) approach is a promising high-level coarse-graining (CG) method for polymer melts, with variable resolution from soft spheres to multi CG sites, which preserves the structural and thermodynamical consistencies with the related atomistic simulations. When compared to the atomistic description, the procedure of coarse-graining results in smoother free energy surfaces, longer-ranged potentials, a decrease in the number of interaction sites for a given polymer, and more. Because these changes have competing effects on the computational efficiency of the CG model, care needs to be taken when studying the effect of coarse-graining on the computational speed-up in CG molecular dynamics simulations. For instance, treatment of long-range CG interactions requires the selection of cutoff distances that include the attractive part of the effective CG potential and force. In particular, we show how the complex nature of the range and curvature of the effective CG potential, the selection of a suitable CG timestep, the choice of the cutoff distance, the molecular dynamics algorithms, and the smoothness of the CG free energy surface affect the efficiency of IECG simulations. By direct comparison with the atomistic simulations of relatively short chain polymer melts, we find that the overall computational efficiency is highest for the highest level of CG (soft spheres), with an overall improvement of the computational efficiency being about 10 6 –10 8 for various CG levels/resolutions. Therefore, the IECG method can have important applications in molecular dynamics simulations of polymeric systems. Finally, making use of the standard spatial decomposition algorithm, the parallel scalability of the IECG simulations for various levels of CG is presented. Optimal parallel scaling is observed for a reasonably large number of processors. Although this study is performed using the IECG approach, its results on the relation between the level of CG and the computational efficiency are general and apply to any properly-constructed CG model.more » « less