skip to main content


Title: Machine learning for parameter auto-tuning in molecular dynamics simulations: Efficient dynamics of ions near polarizable nanoparticles
Simulating the dynamics of ions near polarizable nanoparticles (NPs) using coarse-grained models is extremely challenging due to the need to solve the Poisson equation at every simulation timestep. Recently, a molecular dynamics (MD) method based on a dynamical optimization framework bypassed this obstacle by representing the polarization charge density as virtual dynamic variables and evolving them in parallel with the physical dynamics of ions. We highlight the computational gains accessible with the integration of machine learning (ML) methods for parameter prediction in MD simulations by demonstrating how they were realized in MD simulations of ions near polarizable NPs. An artificial neural network–based regression model was integrated with MD simulation and predicted the optimal simulation timestep and optimization parameters characterizing the virtual system with 94.3% success. The ML-enabled auto-tuning of parameters generated accurate dynamics of ions for ≈ 10 million steps while improving the stability of the simulation by over an order of magnitude. The integration of ML-enhanced framework with hybrid Open Multi-Processing / Message Passing Interface (OpenMP/MPI) parallelization techniques reduced the computational time of simulating systems with thousands of ions and induced charges from thousands of hours to tens of hours, yielding a maximum speedup of ≈ 3 from ML-only acceleration and a maximum speedup of ≈ 600 from the combination of ML and parallel computing methods. Extraction of ionic structure in concentrated electrolytes near oil–water emulsions demonstrates the success of the method. The approach can be generalized to select optimal parameters in other MD applications and energy minimization problems.  more » « less
Award ID(s):
1720625
NSF-PAR ID:
10188240
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
The International Journal of High Performance Computing Applications
Volume:
34
Issue:
3
ISSN:
1094-3420
Page Range / eLocation ID:
357 to 374
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Bhatele, A. ; Hammond, J. ; Baboulin, M. ; Kruse, C. (Ed.)
    The reactive force field (ReaxFF) interatomic potential is a powerful tool for simulating the behavior of molecules in a wide range of chemical and physical systems at the atomic level. Unlike traditional classical force fields, ReaxFF employs dynamic bonding and polarizability to enable the study of reactive systems. Over the past couple decades, highly optimized parallel implementations have been developed for ReaxFF to efficiently utilize modern hardware such as multi-core processors and graphics processing units (GPUs). However, the complexity of the ReaxFF potential poses challenges in terms of portability to new architectures (AMD and Intel GPUs, RISC-V processors, etc.), and limits the ability of computational scientists to tailor its functional form to their target systems. In this regard, the convergence of cyber-infrastructure for high performance computing (HPC) and machine learning (ML) presents new opportunities for customization, programmer productivity and performance portability. In this paper, we explore the benefits and limitations of JAX, a modern ML library in Python representing a prime example of the convergence of HPC and ML software, for implementing ReaxFF. We demonstrate that by leveraging auto-differentiation, just-in-time compilation, and vectorization capabilities of JAX, one can attain a portable, performant, and easy to maintain ReaxFF software. Beyond enabling MD simulations, end-to-end differentiability of trajectories produced by ReaxFF implemented with JAX makes it possible to perform related tasks such as force field parameter optimization and meta-analysis without requiring any significant software developments. We also discuss scalability limitations using the current version of JAX for ReaxFF simulations. 
    more » « less
  2. Meier-Schellersheim, Martin (Ed.)
    We introduce a Stochastic Reaction-Diffusion-Dynamics Model (SRDDM) for simulations of cellular mechanochemical processes with high spatial and temporal resolution. The SRDDM is mapped into the CellDynaMo package, which couples the spatially inhomogeneous reaction-diffusion master equation to account for biochemical reactions and molecular transport within the Langevin Dynamics (LD) framework to describe dynamic mechanical processes. This computational infrastructure allows the simulation of hours of molecular machine dynamics in reasonable wall-clock time. We apply SRDDM to test performance of the Search-and-Capture of mitotic spindle assembly by simulating, in three spatial dimensions, dynamic instability of elastic microtubules anchored in two centrosomes, movement and deformations of geometrically realistic centromeres with flexible kinetochores and chromosome arms. Furthermore, the SRDDM describes the mechanics and kinetics of Ndc80 linkers mediating transient attachments of microtubules to the chromosomal kinetochores. The rates of these attachments and detachments depend upon phosphorylation states of the Ndc80 linkers, which are regulated in the model by explicitly accounting for the reactions of Aurora A and B kinase enzymes undergoing restricted diffusion. We find that there is an optimal rate of microtubule-kinetochore detachments which maximizes the accuracy of the chromosome connections, that adding chromosome arms to kinetochores improve the accuracy by slowing down chromosome movements, that Aurora A and kinetochore deformations have a small positive effect on the attachment accuracy, and that thermal fluctuations of the microtubules increase the rates of kinetochore capture and also improve the accuracy of spindle assembly. 
    more » « less
  3. Fluorescent light-up aptamers (FLAPs) are well-performed biosensors for cellular imaging and the detection of different targets of interest, including RNA, non-nucleic acid molecules, metal ions, and so on. They could be easily designed and emit a strong fluorescence signal once bound to specified fluorogens. Recently, one unique aptamer called Mango-II has been discovered to possess a strong affinity and excellent fluorescent properties with fluorogens TO1-Biotin and TO3-Biotin. To explore the binding mechanisms, computational simulations have been performed to obtain structural and thermodynamic information about FLAPs at atomic resolution. AMOEBA polarizable force field, with the capability of handling the highly charged and flexible RNA system, was utilized for the simulation of Mango-II with TO1-Biotin and TO3-Biotin in this work. The calculated binding free energy using published crystal structures is in excellent agreement with the experimental values. Given the challenges in modeling complex RNA dynamics, our work demonstrates that MD simulation with a polarizable force field is valuable for understanding aptamer-fluorogen binding and potentially designing new aptamers or fluorogens with better performance. 
    more » « less
  4. Abstract

    RNA dependent RNA polymerase (RdRp), is an essential in the RNA replication within the life cycle of the severely acute respiratory coronavirus-2 (SARS-CoV-2), causing the deadly respiratory induced sickness COVID-19. Remdesivir is a prodrug that has seen some success in inhibiting this enzyme, however there is still the pressing need for effective alternatives. In this study, we present the discovery of four non-nucleoside small molecules that bind favorably to SARS-CoV-2 RdRp over the active form of the popular drug remdesivir (RTP) and adenosine triphosphate (ATP) by utilizing high-throughput virtual screening (HTVS) against the vast ZINC compound database coupled with extensive molecular dynamics (MD) simulations. After post-trajectory analysis, we found that the simulations of complexes containing both ATP and RTP remained stable for the duration of their trajectories. Additionally, it was revealed that the phosphate tail of RTP was stabilized by both the positive amino acid pocket and magnesium ions near the entry channel of RdRp which includes residues K551, R553, R555 and K621. It was also found that residues D623, D760, and N691 further stabilized the ribose portion of RTP with U10 on the template RNA strand forming hydrogen pairs with the adenosine motif. Using these models of RdRp, we employed them to screen the ZINC database of ~ 17 million molecules. Using docking and drug properties scoring, we narrowed down our selection to fourteen candidates. These were subjected to 200 ns simulations each underwent free energy calculations. We identified four hit compounds from the ZINC database that have similar binding poses to RTP while possessing lower overall binding free energies, with ZINC097971592 having a binding free energy two times lower than RTP.

     
    more » « less
  5. Abstract

    Solidification phenomenon has been an integral part of the manufacturing processes of metals, where the quantification of stochastic variations and manufacturing uncertainties is critically important. Accurate molecular dynamics (MD) simulations of metal solidification and the resulting properties require excessive computational expenses for probabilistic stochastic analyses where thousands of random realizations are necessary. The adoption of inadequate model sizes and time scales in MD simulations leads to inaccuracies in each random realization, causing a large cumulative statistical error in the probabilistic results obtained through Monte Carlo (MC) simulations. In this work, we present a machine learning (ML) approach, as a data-driven surrogate to MD simulations, which only needs a few MD simulations. This efficient yet high-fidelity ML approach enables MC simulations for full-scale probabilistic characterization of solidified metal properties considering stochasticity in influencing factors like temperature and strain rate. Unlike conventional ML models, the proposed hybrid polynomial correlated function expansion here, being a Bayesian ML approach, is data efficient. Further, it can account for the effect of uncertainty in training data by exploiting mean and standard deviation of the MD simulations, which in principle addresses the issue of repeatability in stochastic simulations with low variance. Stochastic numerical results for solidified aluminum are presented here based on complete probabilistic uncertainty quantification of mechanical properties like Young’s modulus, yield strength and ultimate strength, illustrating that the proposed error-inclusive data-driven framework can reasonably predict the properties with a significant level of computational efficiency.

     
    more » « less