Local atomic environment descriptors (LAEDs) are used in the materials science and chemistry communities, for example, for the development of machine learning interatomic potentials. Despite the fact that LAEDs have been extensively studied and benchmarked for various applications, global structure descriptors (GSDs), i.e., descriptors for entire molecules or crystal structures, have been mostly developed independently based on other approaches. Here, we propose a systematically improvable methodology for constructing a space of representations of GSDs from LAEDs by incorporating statistical information and information about chemical elements. We apply the method to construct GSDs of varying complexity for lithium thiophosphate structures that are of interest as solid electrolytes and use an information-theoretic approach to obtain an optimally compressed GSD. Finally, we report the performance of the compressed GSD for energy prediction tasks.
more »
« less
Continuous and optimally complete description of chemical environments using Spherical Bessel descriptors
Recently, machine learning potentials have been advanced as candidates to combine the high-accuracy of electronic structure methods with the speed of classical interatomic potentials. A crucial component of a machine learning potential is the description of local atomic environments by some set of descriptors. These should ideally be invariant to the symmetries of the physical system, twice-differentiable with respect to atomic positions (including when an atom leaves the environment), and complete to allow the atomic environment to be reconstructed up to symmetry. The stronger condition of optimal completeness requires that the condition for completeness be satisfied with the minimum possible number of descriptors. Evidence is provided that an updated version of the recently proposed Spherical Bessel (SB) descriptors satisfies the first two properties and a necessary condition for optimal completeness. The Smooth Overlap of Atomic Position (SOAP) descriptors and the Zernike descriptors are natural counterparts of the SB descriptors and are included for comparison. The standard construction of the SOAP descriptors is shown to not satisfy the condition for optimal completeness and, moreover, is found to be an order of magnitude slower to compute than that of the SB descriptors.
more »
« less
- Award ID(s):
- 1839370
- PAR ID:
- 10597346
- Publisher / Repository:
- American Institute of Physics
- Date Published:
- Journal Name:
- AIP Advances
- Volume:
- 10
- Issue:
- 1
- ISSN:
- 2158-3226
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract We present PyXtal_FF—a package based on Python programming language—for developing machine learning potentials (MLPs). The aim of PyXtal_FF is to promote the application of atomistic simulations through providing several choices of atom-centered descriptors and machine learning regressions in one platform. Based on the given choice of descriptors (including the atom-centered symmetry functions, embedded atom density, SO4 bispectrum, and smooth SO3 power spectrum), PyXtal_FF can train MLPs with either generalized linear regression or neural network models, by simultaneously minimizing the errors of energy/forces/stress tensors in comparison with the data fromab-initiosimulations. The trained MLP model from PyXtal_FF is interfaced with the Atomic Simulation Environment (ASE) package, which allows different types of light-weight simulations such as geometry optimization, molecular dynamics simulation, and physical properties prediction. Finally, we will illustrate the performance of PyXtal_FF by applying it to investigate several material systems, including the bulk SiO2, high entropy alloy NbMoTaW, and elemental Pt for general purposes. Full documentation of PyXtal_FF is available athttps://pyxtal-ff.readthedocs.io.more » « less
-
Abstract Interatomic potentials derived with Machine Learning algorithms such as Deep-Neural Networks (DNNs), achieve the accuracy of high-fidelity quantum mechanical (QM) methods in areas traditionally dominated by empirical force fields and allow performing massive simulations. Most DNN potentials were parametrized for neutral molecules or closed-shell ions due to architectural limitations. In this work, we propose an improved machine learning framework for simulating open-shell anions and cations. We introduce the AIMNet-NSE (Neural Spin Equilibration) architecture, which can predict molecular energies for an arbitrary combination of molecular charge and spin multiplicity with errors of about 2–3 kcal/mol and spin-charges with error errors ~0.01e for small and medium-sized organic molecules, compared to the reference QM simulations. The AIMNet-NSE model allows to fully bypass QM calculations and derive the ionization potential, electron affinity, and conceptual Density Functional Theory quantities like electronegativity, hardness, and condensed Fukui functions. We show that these descriptors, along with learned atomic representations, could be used to model chemical reactivity through an example of regioselectivity in electrophilic aromatic substitution reactions.more » « less
-
Electronic devices get smaller and smaller in every generation. In micro-/nano-electronic devices such as high electron mobility transistors, heat dissipation has become a crucial design consideration due to the ultrahigh heat flux that has a negative effect on devices' performance and their lifetime. Therefore, thermal transport performance enhancement is required to adapt to the device size reduction. β-Ga2O3 has recently gained significant scientific interest for future power devices because of its inherent material properties such as extremely wide bandgap, outstanding Baliga's figure of merit, large critical electric field, etc. This work aims to use a machine learning approach to search promising substrates or heat sinks for cooling β-Ga2O3, in terms of high interfacial thermal conductance (ITC), from large-scale potential structures taken from existing material databases. With the ITC dataset of 1633 various substrates for β-Ga2O3 calculated by full density functional theory, we trained our recently developed convolutional neural network (CNN) model that utilizes the fused orbital field matrix (OFM) and composition descriptors. Our model proved to be superior in performance to traditional machine learning algorithms such as random forest and gradient boosting. We then deployed the CNN model to predict the ITC of 32 716 structures in contact with β-Ga2O3. The CNN model predicted the top 20 cubic and noncubic substrates with ITC on the same level as density functional theory (DFT) results on β-Ga2O3/YN and β-Ga2O3/MgO interfaces, which has the highest ITC of 1224 and 1211 MW/m2K, respectively, among the DFT-ITC datasets. Phonon density of states, group velocity, and scattering effect on high heat flux transport and consequently increased ITC are also investigated. Moderate to high phonon density of states overlap, high group velocity, and low phonon scattering are required to achieve high ITC. We also found three Magpie descriptors with strong Pearson correlation with ITC, namely, mean atomic number, mean atomic weight, and mean ground state volume per atom. Calculations of such descriptors are computationally efficient, and therefore, these descriptors provide a new route for quickly screening potential substrates from large-scale material pools for high-performance interfacial thermal management of high-electron mobility transistor devices.more » « less
-
Complex-concentrated-alloys (CCAs) are of interest for a range of applications due to a host of desirable properties, including high-temperature strength and tolerance to radiation damage. Their multi-principal component nature results in a vast number of possible atomic environments with the associated variability in chemistry and structure. This atomic-level variability is central to the unique properties of these alloys but makes their modeling challenging. We combine atomistic simulations using many body potentials with machine learning to develop predictive models of various atomic properties of CrFeCoNiCu-based CCAs: relaxed vacancy formation energy, atomic-level cohesive energy, pressure, and volume. A fingerprint of the local atomic environments is obtained combining invariants associated with the local atomic geometry and periodic-table information of the atoms involved. Importantly, all descriptors are based on the unrelaxed atomic structure; thus, they are computationally inexpensive to compute. This enables the incorporation of these models into macroscopic simulations. The models show good accuracy and we explore their ability to extrapolate to compositions and elements not used during training.more » « less
An official website of the United States government
