skip to main content

Attention:

The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 11:00 PM ET on Friday, September 13 until 2:00 AM ET on Saturday, September 14 due to maintenance. We apologize for the inconvenience.


Title: Making thermodynamic models of mixtures predictive by machine learning: matrix completion of pair interactions
Predictive models of thermodynamic properties of mixtures are paramount in chemical engineering and chemistry. Classical thermodynamic models are successful in generalizing over (continuous) conditions like temperature and concentration. On the other hand, matrix completion methods (MCMs) from machine learning successfully generalize over (discrete) binary systems; these MCMs can make predictions without any data for a given binary system by implicitly learning commonalities across systems. In the present work, we combine the strengths from both worlds in a hybrid approach. The underlying idea is to predict the pair-interaction energies , as they are used in basically all physical models of liquid mixtures, by an MCM. As an example, we embed an MCM into UNIQUAC, a widely-used physical model for the Gibbs excess energy. We train the resulting hybrid model in a Bayesian machine-learning framework on experimental data for activity coefficients in binary systems of 1146 components from the Dortmund Data Bank. We thereby obtain, for the first time, a complete set of UNIQUAC parameters for all binary systems of these components, which allows us to predict, in principle, activity coefficients at arbitrary temperature and composition for any combination of these components, not only for binary but also for multicomponent systems. The hybrid model even outperforms the best available physical model for predicting activity coefficients, the modified UNIFAC (Dortmund) model.  more » « less
Award ID(s):
2047418 2007719 2003237 1928718
NSF-PAR ID:
10329936
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Chemical Science
Volume:
13
Issue:
17
ISSN:
2041-6520
Page Range / eLocation ID:
4854 to 4862
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    We report virial coefficients up to sixth order in density forN2,O2,NH3, andCO2, covering temperatures from 50 to 1,000 K. The reported values include coefficients and their first three temperature derivatives, for the pure species as well as all of those needed to evaluate full composition dependence of mixtures formed from any or all of these compounds. The values are obtained by calculation of appropriate cluster integrals using Mayer sampling Monte Carlo, with intermolecular interactions described by the Transferable Potential for Phase Equilibria (TraPPE) force field. All coefficients are fit as a function of temperature, yielding a thermodynamic model with analytic dependence on temperature, density, and composition. The coefficients and properties computed from them are compared to experimental data where available.

     
    more » « less
  2. For the development and optimization of molten salt reactors, nuclear fuel cycles, and energy storage materials, the temperature-dependent molten salt properties must be known over a wide range of possible compositions. Yet, for several relevant chloride and fluoride salt systems, significant gaps in data still exist, which inhibit the development of key advanced energy technologies. [1] Filling all of these gaps with high-temperature experiments is inherently challenging, especially due to the corrosive, volatile, and hazardous nature of these salts. Meanwhile, carefully validated atomistic simulations (ab initio, classical or machine learning-based) are capable of predicting thermophysical properties but are highly computationally expensive [2-5], limiting our ability to screen over large temperature-compositional spaces. In this work, we propose to circumvent these limitations by using supervised machine learning (ML) models to learn from existing bulk density data and predict densities of new and unseen mixtures. 
    more » « less
  3. null (Ed.)
    We present a generic way to hybridize physical and data-driven methods for predicting physicochemical properties. The approach ‘distills’ the physical method's predictions into a prior model and combines it with sparse experimental data using Bayesian inference. We apply the new approach to predict activity coefficients at infinite dilution and obtain significant improvements compared to the physical and data-driven baselines and established ensemble methods from the machine learning literature. 
    more » « less
  4. Secondary organic aerosols contribute a large fraction to atmospheric aerosols. The phase states of secondary organic aerosols influence heterogeneous and multiphase chemistry in the atmosphere and thus climate. In previous studies we have used the dual tandem differential mobility analyzer technique to characterize the temperature- and humidity-dependent viscosity and glass transition temperature of suspended particles. However, the technique requires high particle number concentrations, is a complex setup, is expensive, and measurements are time consuming. Here we demonstrate a new simplified and more cost-effective method to obtain similar data. The technique was used to measure the temperature where the viscosity is ∼107 Pa s for submicron particles composed of binary and ternary mixtures of the sucrose/tartaric acid/citric acid system. Sucrose, tartaric acid and citric acid are taken as proxies for viscous organic aerosol components in the atmosphere. A subset of data were compared to measurements with the dual-tandem differential mobility analyzer method. Results show good agreement between the two techniques. The same mixed chemical systems were modeled using an updated version of the parametric phase diagram model described in Kasparoglu et al. (2021, https://doi.org/10.5194/acp-21-1127-2021) as well as the predictions with the viscosity module of the Aerosol Inorganic–Organic Mixtures Functional groups Activity Coefficients model (AIOMFAC-VISC). Results show that appropriately parameterized mixing rules are suitable to describe these mixtures. We anticipate that the new technique will accelerate discovery of aerosol phase transitions in aerosol research. 
    more » « less
  5. Fugacity is a fundamental thermodynamical property of gas and gas mixtures to determine their behavior and dynamics in complex systems. Fugacity can be deduced experimentally from the measurements of volume as a function of pressure at constant temperature or calculated iteratively using analytical equations of states (EOS). Experimental measurement is time-consuming, and analytical models based on EOS are computationally demanding, especially when an approximate but quick estimation is desired. In this work, machine learning (ML) is employed as a viable alternative to analytical EOSs for quick and accurate approximation of CO2 fugacity coefficients. Five different ML algorithms are used to estimate the fugacity coefficients of pure CO2 as a function of pressure (≤ 2000 bar) and temperature (≤ 1000 °C). A combination of experimental and pseudo-experimental (obtained from an analytical EOS) data of CO2 fugacity coefficients is used to train, validate, and test the models. The best results were found using the Extreme Gradient Boosting algorithm, which showed a mean square error of only 0.0002 in the validation data and an average deviation of only 1.3% in the test data (pure prediction). To quantify the effectiveness of the machine learning techniques, results from the best-performing model are compared with two state-of-the-art analytical models. The ML model with significantly less computational complexity showed similar accuracy to the analytical models. The estimated fugacity data are then used to compute the CO2 solubility in aqueous NaCl solution of different concentrations, and a maximum deviation of only 3.2% from the experimental data is observed. 
    more » « less