skip to main content


This content will become publicly available on May 10, 2024

Title: Allegro-Legato: scalable, fast, and robust neural-network quantum molecular dynamics via sharpness-aware minimization
Neural-network quantum molecular dynamics (NNQMD) simulations based on machine learning are revolutionizing atomistic simulations of materials by providing quantum-mechanical accuracy but orders-of-magnitude faster, illustrated by ACM Gordon Bell prize (2020) and finalist (2021). State-of-the-art (SOTA) NNQMD model founded on group theory featuring rotational equivari- ance and local descriptors has provided much higher accuracy and speed than those models, thus named Allegro (meaning fast). On massively parallel super- computers, however, it suffers a fidelity-scaling problem, where growing number of unphysical predictions of interatomic forces prohibits simulations involving larger numbers of atoms for longer times. Here, we solve this problem by com- bining the Allegro model with sharpness aware minimization (SAM) for enhanc- ing the robustness of model through improved smoothness of the loss landscape. The resulting Allegro-Legato (meaning fast and “smooth”) model was shown to elongate the time-to-failure tfailure, without sacrificing computational speed or accuracy. Specifically, Allegro-Legato exhibits much weaker dependence of time- to-failure on the problem size, t_failure = N^−0.14 (N is the number of atoms) compared to the SOTA Allegro model (t_failure ∝ N^−0.29), i.e., systematically delayed time-to-failure, thus allowing much larger and longer NNQMD simulations without failure. The model also exhibits excellent computational scalabil- ity and GPU acceleration on the Polaris supercomputer at Argonne Leadership Computing Facility. Such scalable, accurate, fast and robust NNQMD models will likely find broad applications in NNQMD simulations on emerging exaflop/s computers, with a specific example of accounting for nuclear quantum effects in the dynamics of ammonia to lay a foundation of the green ammonia technology for sustainability.  more » « less
Award ID(s):
2118061
NSF-PAR ID:
10423192
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
ISC High Performance 2023, LNCS
Volume:
13948
Page Range / eLocation ID:
223-239
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Developing an accurate interatomic potential model is a prerequisite for achieving reliable results from classical molecular dynamics (CMD) simulations; however, most of the potentials are biased as specific simulation purposes or conditions are considered in the parameterization. For developing an unbiased potential, a finite‐temperature dynamics machine learning (FTD‐ML) approach is proposed, and its processes and feasibility are demonstrated using the Buckingham potential model and aluminum (Al) as an example. Compared with conventional machine learning approaches, FTD‐ML exhibits three distinguished features: 1) FTD‐ML intrinsically incorporates more extensive configurational and conditional space for enhancing the transferability of developed potentials; 2) FTD‐ML employs various properties calculated directly from CMD, for ML model training and prediction validation against experimental data instead of first‐principles data; 3) FTD‐ML is much more computationally cost effective than first‐principles simulations, especially when the system size increases over 103atoms as employed in this research for ensuring reliable training data. The Al Buckingham potential developed by the FTD‐ML approach exhibits good performance for general simulation purposes. Thus, the FTD‐ML approach is expected to contribute to a fast development of interatomic potential model suitable for various simulation purposes and conditions, without limitation of model type, while maintaining experimental‐level accuracy.

     
    more » « less
  2. Learning reservoir flow dynamics is of primary importance in creating robust predictive models for reservoir management including hydraulic fracturing processes. Physics-based models are to a certain extent exact, but they entail heavy computational infrastructure for simulating a wide variety of parameters and production scenarios. Reduced-order models offer computational advantages without compromising solution accuracy, especially if they can assimilate large volumes of production data without having to reconstruct the original model (data-driven models). Dynamic mode decomposition (DMD) entails the extraction of relevant spatial structure (modes) based on data (snapshots) that can be used to predict the behavior of reservoir fluid flow in porous media. In this paper, we will further enhance the application of the DMD, by introducing sparse DMD and local DMD. The former is particularly useful when there is a limited number of sparse measurements as in the case of reservoir simulation, and the latter can improve the accuracy of developed DMD models when the process dynamics show a moving boundary behavior like hydraulic fracturing. For demonstration purposes, we first show the methodology applied to (flow only) single- and two-phase reservoir models using the SPE10 benchmark. Both online and offline processes will be used for evaluation. We observe that we only require a few DMD modes, which are determined by the sparse DMD structure, to capture the behavior of the reservoir models. Then, we applied the local DMDc for creating a proxy for application in a hydraulic fracturing process. We also assessed the trade-offs between problem size and computational time for each reservoir model. The novelty of our method is the application of sparse DMD and local DMDc, which is a data-driven technique for fast and accurate simulations. 
    more » « less
  3. null (Ed.)
    To be able to compare many agricultural models, a general framework for model comparison when field data may limit direct comparison of models is proposed, developed, and also demonstrated. The framework first calibrates the benchmark model against the field data, and next it calibrates the test model against the data generated by the calibrated benchmark model. The framework is validated for the modeling of the soil nutrient nitrogen (N), a critical component in the overall agriculture system modeling effort. The nitrogen dynamics and related carbon (C) dynamics, as captured in advanced agricultural modeling such as RZWQM, are highly complex, involving numerous states (pools) and parameters. Calibrating many parameters requires more time and data to avoid underfitting. The execution time of a complex model is higher as well. A study of tradeoff among modeling complexities vs. speed-up, and the corresponding impact on modeling accuracy, is desirable. This paper surveys soil nitrogen models and lists those by their complexity in terms of the number of parameters, and C-N pools. This paper also examines a lean soil N and C dynamics model and compares it with an advanced model, RZWQM. Since nitrate and ammonia are not directly measured in this study, we first calibrate RZWQM using the available data from an experimental field in Greeley, CO, and next use the daily nitrate and ammonia data generated from RZWQM as ground truth, against which the lean model’s N dynamics parameters are calibrated. In both cases, the crop growth was removed to zero out the plant uptake, to compare only the soil N-dynamics. The comparison results showed good accuracy with a coefficient of determination (R2) match of 0.99 and 0.62 for nitrate and ammonia, respectively, while affording significant speed-up in simulation time. The lean model is also hosted in MyGeoHub cyberinfrastructure for universal online access. 
    more » « less
  4. Within the nuclear-electronic orbital (NEO) framework, the real-time NEO time-dependent density functional theory (RT-NEO-TDDFT) approach enables the simulation of coupled electronic-nuclear dynamics. In this approach, the electrons and quantum nuclei are propagated in time on the same footing. A relatively small time step is required to propagate the much faster electronic dynamics, thereby prohibiting the simulation of long-time nuclear quantum dynamics. Herein, the electronic Born–Oppenheimer (BO) approximation within the NEO framework is presented. In this approach, the electronic density is quenched to the ground state at each time step, and the real-time nuclear quantum dynamics is propagated on an instantaneous electronic ground state defined by both the classical nuclear geometry and the nonequilibrium quantum nuclear density. Because the electronic dynamics is no longer propagated, this approximation enables the use of an order-of-magnitude larger time step, thus greatly reducing the computational cost. Moreover, invoking the electronic BO approximation also fixes the unphysical asymmetric Rabi splitting observed in previous semiclassical RT-NEO-TDDFT simulations of vibrational polaritons even for small Rabi splitting, instead yielding a stable, symmetric Rabi splitting. For the intramolecular proton transfer in malonaldehyde, both RT-NEO-Ehrenfest dynamics and its BO counterpart can describe proton delocalization during the real-time nuclear quantum dynamics. Thus, the BO RT-NEO approach provides the foundation for a wide range of chemical and biological applications.

     
    more » « less
  5. For energy-assisted compression ignition (EACI) engine propulsion at high-altitude operating conditions using sustainable jet fuels with varying cetane numbers, it is essential to develop an efficient engine control system for robust and optimal operation. Control systems are typically trained using experimental data, which can be costly and time consuming to generate due to setup time of experiments, unforeseen delays/issues with manufacturing, mishaps/engine failures and the consequent repairs (which can take weeks), and errors in measurements. Computational fluid dynamics (CFD) simulations can overcome such burdens by complementing experiments with simulated data for control system training. Such simulations, however, can be computationally expensive. Existing data-driven machine learning (ML) models have shown promise for emulating the expensive CFD simulator, but encounter key limitations here due to the expensive nature of the training data and the range of differing combustion behaviors (e.g. misfires and partial/delayed ignition) observed at such broad operating conditions. We thus develop a novel physics-integrated emulator, called the Misfire-Integrated GP (MInt-GP), which integrates important auxiliary information on engine misfires within a Gaussian process surrogate model. With limited CFD training data, we show the MInt-GP model can yield reliable predictions of in-cylinder pressure evolution profiles and subsequent heat release profiles and engine CA50 predictions at a broad range of input conditions. We further demonstrate much better prediction capabilities of the MInt-GP at different combustion behaviors compared to existing data-driven ML models such as kriging and neural networks, while also observing up to 80 times computational speed-up over CFD, thus establishing its effectiveness as a tool to assist CFD for fast data generation in control system training.

     
    more » « less