skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Making thermodynamic models of mixtures predictive by machine learning: matrix completion of pair interactions
Predictive models of thermodynamic properties of mixtures are paramount in chemical engineering and chemistry. Classical thermodynamic models are successful in generalizing over (continuous) conditions like temperature and concentration. On the other hand, matrix completion methods (MCMs) from machine learning successfully generalize over (discrete) binary systems; these MCMs can make predictions without any data for a given binary system by implicitly learning commonalities across systems. In the present work, we combine the strengths from both worlds in a hybrid approach. The underlying idea is to predict the pair-interaction energies , as they are used in basically all physical models of liquid mixtures, by an MCM. As an example, we embed an MCM into UNIQUAC, a widely-used physical model for the Gibbs excess energy. We train the resulting hybrid model in a Bayesian machine-learning framework on experimental data for activity coefficients in binary systems of 1146 components from the Dortmund Data Bank. We thereby obtain, for the first time, a complete set of UNIQUAC parameters for all binary systems of these components, which allows us to predict, in principle, activity coefficients at arbitrary temperature and composition for any combination of these components, not only for binary but also for multicomponent systems. The hybrid model even outperforms the best available physical model for predicting activity coefficients, the modified UNIFAC (Dortmund) model.  more » « less
Award ID(s):
2047418 2007719 2003237 1928718
PAR ID:
10329936
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
Chemical Science
Volume:
13
Issue:
17
ISSN:
2041-6520
Page Range / eLocation ID:
4854 to 4862
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Secondary organic aerosols contribute a large fraction to atmospheric aerosols. The phase states of secondary organic aerosols influence heterogeneous and multiphase chemistry in the atmosphere and thus climate. In previous studies we have used the dual tandem differential mobility analyzer technique to characterize the temperature- and humidity-dependent viscosity and glass transition temperature of suspended particles. However, the technique requires high particle number concentrations, is a complex setup, is expensive, and measurements are time consuming. Here we demonstrate a new simplified and more cost-effective method to obtain similar data. The technique was used to measure the temperature where the viscosity is ∼107 Pa s for submicron particles composed of binary and ternary mixtures of the sucrose/tartaric acid/citric acid system. Sucrose, tartaric acid and citric acid are taken as proxies for viscous organic aerosol components in the atmosphere. A subset of data were compared to measurements with the dual-tandem differential mobility analyzer method. Results show good agreement between the two techniques. The same mixed chemical systems were modeled using an updated version of the parametric phase diagram model described in Kasparoglu et al. (2021, https://doi.org/10.5194/acp-21-1127-2021) as well as the predictions with the viscosity module of the Aerosol Inorganic–Organic Mixtures Functional groups Activity Coefficients model (AIOMFAC-VISC). Results show that appropriately parameterized mixing rules are suitable to describe these mixtures. We anticipate that the new technique will accelerate discovery of aerosol phase transitions in aerosol research. 
    more » « less
  2. A comprehensive set of single-component and binary isotherms were collected for ethanol/water adsorption into the siliceous forms of 185 known zeolites using grand-canonical Monte Carlo simulations. Using these data, a systematic analysis of ideal/real adsorbed-solution theory (IAST/RAST) was conducted and activity coefficients were derived for ethanol/water mixtures adsorbed in different zeolites based on RAST. It was found that activity coefficients of ethanol are close to unity while activity coefficients of water are larger in most zeolites, indicating a positive excess free energy of the mixture. This observation can be attributed to water/ethanol interactions being less favorable than water/water interactions in the single-component adsorption of water at comparable loadings. The deviation from ideal behavior can be highly structure-dependent but no clear correlation with pore diameters was identified. Our analysis also demonstrates the following: (1) accurate unary isotherms in the low-loading regime are critical for obtaining physically sensible activity coefficients; (2) the global regression scheme to solve for activity model parameters performs better than fitting activity models to activity coefficients calculated locally at each binary state point; and (3) including the dependence on adsorption potential offers only a minor benefit for describing binary adsorption at the lowest fugacities. Finally, the Margules activity model was found incapable of capturing the non-ideal adsorption behavior over the entire range of fugacities and compositions in all zeolites, but for conditions typical of solution-phase adsorption, RAST predictions using zeolite-specific or even bulk Margules parameters provide an improved description compared to IAST. 
    more » « less
  3. For the development and optimization of molten salt reactors, nuclear fuel cycles, and energy storage materials, the temperature-dependent molten salt properties must be known over a wide range of possible compositions. Yet, for several relevant chloride and fluoride salt systems, significant gaps in data still exist, which inhibit the development of key advanced energy technologies. [1] Filling all of these gaps with high-temperature experiments is inherently challenging, especially due to the corrosive, volatile, and hazardous nature of these salts. Meanwhile, carefully validated atomistic simulations (ab initio, classical or machine learning-based) are capable of predicting thermophysical properties but are highly computationally expensive [2-5], limiting our ability to screen over large temperature-compositional spaces. In this work, we propose to circumvent these limitations by using supervised machine learning (ML) models to learn from existing bulk density data and predict densities of new and unseen mixtures. 
    more » « less
  4. null (Ed.)
    We present a generic way to hybridize physical and data-driven methods for predicting physicochemical properties. The approach ‘distills’ the physical method's predictions into a prior model and combines it with sparse experimental data using Bayesian inference. We apply the new approach to predict activity coefficients at infinite dilution and obtain significant improvements compared to the physical and data-driven baselines and established ensemble methods from the machine learning literature. 
    more » « less
  5. Midcircuit measurements (MCMs) are crucial ingredients in the development of fault-tolerant quantum computation. While there have been rapid experimental progresses in realizing MCMs, a systematic method for characterizing noisy MCMs is still under exploration. In this work, we develop a cycle benchmarking (CB)-type algorithm to characterize noisy MCMs. The key idea is to use a joint Fourier transform on the classical and quantum registers and then estimate parameters in the Fourier space, analogous to Pauli fidelities used in CB-type algorithms for characterizing the Pauli-noise channel of Clifford gates. Furthermore, we develop a theory of the noise learnability of MCMs, which determines what information can be learned about the noise model (in the presence of state preparation and terminating measurement noise) and what cannot, which shows that all learnable information can be learned using our algorithm. As an application, we show how to use the learned information to test the independence between measurement noise and state-preparation noise in an MCM. Finally, we conduct numerical simulations to illustrate the practical applicability of the algorithm. Similar to other CB-type algorithms, we expect the algorithm to provide a useful toolkit that is of experimental interest. Published by the American Physical Society2025 
    more » « less