skip to main content

Title: Data-driven coarse-grained modeling of polymers in solution with structural and dynamic properties conserved
We present data-driven coarse-grained (CG) modeling for polymers in solution, which conserves the dynamic as well as structural properties of the underlying atomistic system. The CG modeling is built upon the framework of the generalized Langevin equation (GLE). The key is to determine each term in the GLE by directly linking it to atomistic data. In particular, we propose a two-stage Gaussian process-based Bayesian optimization method to infer the non-Markovian memory kernel from the data of the velocity autocorrelation function (VACF). Considering that the long-time behaviors of the VACF and memory kernel for polymer solutions can exhibit hydrodynamic scaling (algebraic decay with time), we further develop an active learning method to determine the emergence of hydrodynamic scaling, which can accelerate the inference process of the memory kernel. The proposed methods do not rely on how the mean force or CG potential in the GLE is constructed. Thus, we also compare two methods for constructing the CG potential: a deep learning method and the iterative Boltzmann inversion method. With the memory kernel and CG potential determined, the GLE is mapped onto an extended Markovian process to circumvent the expensive cost of directly solving the GLE. The accuracy and computational efficiency of more » the proposed CG modeling are assessed in a model star-polymer solution system at three representative concentrations. By comparing with the reference atomistic simulation results, we demonstrate that the proposed CG modeling can robustly and accurately reproduce the dynamic and structural properties of polymers in solution. « less
; ;
Award ID(s):
Publication Date:
Journal Name:
Soft Matter
Page Range or eLocation-ID:
8330 to 8344
Sponsoring Org:
National Science Foundation
More Like this
  1. We present a bottom-up coarse-graining (CG) method to establish implicit-solvent CG modeling for polymers in solution, which conserves the dynamic properties of the reference microscopic system. In particular, tens to hundreds of bonded polymer atoms (or Lennard-Jones beads) are coarse-grained as one CG particle, and the solvent degrees of freedom are eliminated. The dynamics of the CG system is governed by the generalized Langevin equation (GLE) derived via the Mori-Zwanzig formalism, by which the CG variables can be directly and rigorously linked to the microscopic dynamics generated by molecular dynamics (MD) simulations. The solvent-mediated dynamics of polymers is modeled by the non-Markovian stochastic dynamics in GLE, where the memory kernel can be computed from the MD trajectories. To circumvent the difficulty in direct evaluation of the memory term and generation of colored noise, we exploit the equivalence between the non-Markovian dynamics and Markovian dynamics in an extended space. To this end, the CG system is supplemented with auxiliary variables that are coupled linearly to the momentum and among themselves, subject to uncorrelated Gaussian white noise. A high-order time-integration scheme is used to solve the extended dynamics to further accelerate the CG simulations. To assess, validate, and demonstrate the established implicit-solventmore »CG modeling, we have applied it to study four different types of polymers in solution. The dynamic properties of polymers characterized by the velocity autocorrelation function, diffusion coefficient, and mean square displacement as functions of time are evaluated in both CG and MD simulations. Results show that the extended dynamics with auxiliary variables can construct arbitrarily high-order CG models to reproduce dynamic properties of the reference microscopic system and to characterize long-time dynamics of polymers in solution.« less
  2. The present work concerns the transferability of coarse-grained (CG) modeling in reproducing the dynamic properties of the reference atomistic systems across a range of parameters. In particular, we focus on implicit-solvent CG modeling of polymer solutions. The CG model is based on the generalized Langevin equation, where the memory kernel plays the critical role in determining the dynamics in all time scales. Thus, we propose methods for transfer learning of memory kernels. The key ingredient of our methods is Gaussian process regression. By integration with the model order reduction via proper orthogonal decomposition and the active learning technique, the transfer learning can be practically efficient and requires minimum training data. Through two example polymer solution systems, we demonstrate the accuracy and efficiency of the proposed transfer learning methods in the construction of transferable memory kernels. The transferability allows for out-of-sample predictions, even in the extrapolated domain of parameters. Built on the transferable memory kernels, the CG models can reproduce the dynamic properties of polymers in all time scales at different thermodynamic conditions (such as temperature and solvent viscosity) and for different systems with varying concentrations and lengths of polymers.
  3. Modeling a high-dimensional Hamiltonian system in reduced dimensions with respect to coarse-grained (CG) variables can greatly reduce computational cost and enable efficient bottom-up prediction of main features of the system for many applications. However, it usually experiences significantly altered dynamics due to loss of degrees of freedom upon coarse-graining. To establish CG models that can faithfully preserve dynamics, previous efforts mainly focused on equilibrium systems. In contrast, various soft matter systems are known to be out of equilibrium. Therefore, the present work concerns non-equilibrium systems and enables accurate and efficient CG modeling that preserves non-equilibrium dynamics and is generally applicable to any non-equilibrium process and any observable of interest. To this end, the dynamic equation of a CG variable is built in the form of the non-stationary generalized Langevin equation (nsGLE), where the two-time memory kernel is determined from the data of the auto-correlation function of the observable of interest. By embedding the nsGLE in an extended dynamics framework, the nsGLE can be solved efficiently to predict the non-equilibrium dynamics of the CG variable. To prove and exploit the equivalence of the nsGLE and extended dynamics, the memory kernel is parameterized in a two-time exponential expansion. A data-driven hybrid optimizationmore »process is proposed for the parameterization, which integrates the differential-evolution method with the Levenberg–Marquardt algorithm to efficiently tackle a non-convex and high-dimensional optimization problem.« less
  4. The integral equation coarse-graining (IECG) approach is a promising high-level coarse-graining (CG) method for polymer melts, with variable resolution from soft spheres to multi CG sites, which preserves the structural and thermodynamical consistencies with the related atomistic simulations. When compared to the atomistic description, the procedure of coarse-graining results in smoother free energy surfaces, longer-ranged potentials, a decrease in the number of interaction sites for a given polymer, and more. Because these changes have competing effects on the computational efficiency of the CG model, care needs to be taken when studying the effect of coarse-graining on the computational speed-up in CG molecular dynamics simulations. For instance, treatment of long-range CG interactions requires the selection of cutoff distances that include the attractive part of the effective CG potential and force. In particular, we show how the complex nature of the range and curvature of the effective CG potential, the selection of a suitable CG timestep, the choice of the cutoff distance, the molecular dynamics algorithms, and the smoothness of the CG free energy surface affect the efficiency of IECG simulations. By direct comparison with the atomistic simulations of relatively short chain polymer melts, we find that the overall computational efficiency ismore »highest for the highest level of CG (soft spheres), with an overall improvement of the computational efficiency being about 10 6 –10 8 for various CG levels/resolutions. Therefore, the IECG method can have important applications in molecular dynamics simulations of polymeric systems. Finally, making use of the standard spatial decomposition algorithm, the parallel scalability of the IECG simulations for various levels of CG is presented. Optimal parallel scaling is observed for a reasonably large number of processors. Although this study is performed using the IECG approach, its results on the relation between the level of CG and the computational efficiency are general and apply to any properly-constructed CG model.« less
  5. Statement of Purpose Hybrid nanoparticles in which a polymer is used to stabilize the secondary structure of enzyme provide a means to preserve its activity in non-native environments. This approach is illustrated here with horseradish peroxidase (HRP), an important heme enzyme used in medical diagnostic, biosensing, and biotechnological applications. Polymer chaperones in these polymer-enzyme complex (PEC) nanoparticles can enhance the utility of enzymes in unfavorable environments. Structural analysis of the PECs is a crucial link in the machine-learning driven iterative optimization cycle of polymer synthesis and testing. Here, we discuss the utility of small-angle X-ray scattering (SAXS) and quartz crystal microbalance with dissipation (QCMD) for evaluating PECs. Materials and Methods Six polymers were synthesized by automated photoinduced electron/energy transfer-reversible addition-fragmentation chain-transfer (PET-RAFT) polymerization directly in 96-well plates.1 Multiple molar ratios of enzyme:polymer (1:1, 1:5, 1:10, and 1:50) were characterized. HRP was mixed with the polymer and heated to 65 °C for 1 hr to form PECs. Enzyme assay and circular dichroism measurements were performed along with SAXS and QCMD to understand polymer-protein interactions. SAXS data were obtained at NSLS-II beamline 16-ID. Results and Discussion SAXS data were analyzed to determine the radius of gyration (Rg), Porod exponent and pair distancemore »distribution functions (P(r)) (Figure 1). Rg, which corresponds to the size of the PEC nanoparticles, is sensitive to the polydispersity of the solution and does not change significantly in the presence of the polymer GEP1. Notably, the maximal dimension does not change as significantly upon heating to denaturation in the case of the PEC as it does with HRP alone. The effect of denaturation induced by heating seems to depend on the molar ratio of the polymer to enzyme. The Porod exponent, which is related to roughness, decreased from about 4 to 3 upon complexation indicating polymer binding to the enzyme’s surface. These were confirmed by modeling the structures of the HRP, the polymer and the PEC were modeled using DAMMIF/DAMMIN and MONSA (ATSAS software). The changes observed in the structure could be correlated to the measured enzymatic activity. Figure 2 shows the evolution of the PEC when the polymer is deposited onto the enzyme immobilized on Figure 1. P(r) plots for PEC vs. HRP before and after heating, illustrating the increased enzymatic stability due to polymer additives. gold-coated QCM sensors. The plots show the changes in frequency (f) and dissipation (D) with time as HRP is first deposited and is followed by the adsorption of the polymer. Large f and D show that the polymer forms a complex with HRP. Such changes were not observed with negative controls, Pluronics and poly(ethylene glycol). Comparison of the data from free particles in solution with QCM data from immobilized enzymes, shows that the conformation of the complexes in solution and surface-bound HRP could be different. This way, we were able to explore the various states of complex formation under different conditions with different polymers. Figure 2. QCMD data showing the interaction between the immobilized HRP and the polymer. 3rd and 5th harmonics are plotted (blue -f; red-D). Conclusion SAXS and QCMD data show that stabilization of the enzyme activity by inhibiting the unraveling of the secondary structure as seen in size, surface roughness, pair distribution function and percent helicity. Acknowledgment This work was supported by NSF grant 2009942. References [1] Tamasi, M, et al. Adv Intell Syst 2020, 2(2): 1900126.« less