skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Attention:

The NSF Public Access Repository (PAR) system and access will be unavailable from 11:00 PM ET on Thursday, June 12 until 2:00 AM ET on Friday, June 13 due to maintenance. We apologize for the inconvenience.


This content will become publicly available on September 23, 2025

Title: Accelerating Multicomponent Phase-Coexistence Calculations with Physics-informed Neural Networks
Phase separation in multicomponent mixtures is of significant interest in both fundamental research and technology. Although the thermodynamic principles governing phase equilibria are straightforward, practical determination of equilibrium phases and constituent compositions for multicomponent systems is often laborious and computationally intensive. Here, we present a machine-learning workflow that simplifies and accelerates phase-coexistence calculations. We specifically analyze capabilities of neural networks to predict the number, composition, and relative abundance of equilibrium phases of systems described by Flory-Huggins theory. We find that incorporating physics-informed material constraints into the neural network architecture enhances the prediction of equilibrium compositions compared to standard neural networks with minor errors along the boundaries of the stable region. However, introducing additional physics-informed losses does not lead to significant further improvement. These errors can be virtually eliminated by using machine-learning predictions as a warm-start for a subsequent optimization routine. This work provides a promising pathway to efficiently characterize multicomponent phase coexistence.  more » « less
Award ID(s):
2118861 2237470
PAR ID:
10544578
Author(s) / Creator(s):
; ;
Publisher / Repository:
ChemRxiv
Date Published:
Format(s):
Medium: X
Institution:
Princeton University
Sponsoring Org:
National Science Foundation
More Like this
  1. Phase separation in multicomponent mixtures is of significant interest in both fundamental research and technology. Although the thermodynamic principles governing phase equilibria are straightforward, practical determination of equilibrium phases and constituent compositions for multicomponent systems is often laborious and computationally intensive. Here, we present a machine-learning workflow that simplifies and accelerates phase-coexistence calculations. We specifically analyze capabilities of neural networks to predict the number, composition, and relative abundance of equilibrium phases of systems described by Flory-Huggins theory. We find that incorporating physics-informed material constraints into the neural network architecture enhances the prediction of equilibrium compositions compared to standard neural networks with minor errors along the boundaries of the stable region. However, introducing additional physics-informed losses does not lead to significant further improvement. These errors can be virtually eliminated by using machine-learning predictions as a warm-start for a subsequent optimization routine. This work provides a promising pathway to efficiently characterize multicomponent phase coexistence. 
    more » « less
  2. This dataset holds 1036 ternary phase diagrams and how points on the diagram phase separate if they do. The data is provided as a serialized object using the `pickle' Python module. The data was compiled using Python version 3.8.  ReferencesThe specific applications and analyses of the data are described in 1.  Dhamankar, S.; Jiang, S.; Webb, M.A. "Accelerating Multicomponent Phase-Coexistence Calculations with Physics-informed Neural Networks" UsageTo access the data in the .pickle file, users can execute the following: # LOAD SIMULATION DATADATA_DIR = "your/custom/dir/" filename = os.path.join(DATA_DIR, f"data_clean.pickle")with open(filename, "rb") as handle:    (x, y_c, y_r, phase_idx, num_phase, max_phase) = pickle.load(handle) x: Input x = (χ_AB, χ_BC, χ_AC, v_A, v_B, v_C, φ_A, φ_B) ∈ ℝ^8. y_c: Output one-hot encoded classification vector y_c ∈ ℝ^3. y_r: Output equilibrium composition and abundance vector y_r = (φ_A^α, φ_B^α, φ_A^β, φ_B^β, φ_A^γ, φ_B^γ, w^α, w^β, w^γ) ∈ ℝ^9. phase_idx: A single integer indicating which unique phase system it belongs to. num_phase: A single integer indicates the number of equilibrium phases the input splits into. max_phase: A single integer indicates the maximum number of equilibrium phases the system splits into. Help, Suggestions, Corrections?If you need help, have suggestions, identify issues, or have corrections, please send your comments to Shengli Jiang at sj0161@princeton.edu GitHubAdditional data and code relevant for this study is additionally accessible at hthttps://github.com/webbtheosim/ml-ternary-phase 
    more » « less
  3. The Flory–Huggins theory describes the phase separation of solutions containing polymers. Although it finds widespread application from polymer physics to materials science to biology, the concentrations that coexist in separate phases at equilibrium have not been determined analytically, and numerical techniques are required that restrict the theory’s ease of application. In this work, we derive an implicit analytical solution to the Flory–Huggins theory of one polymer in a solvent by applying a procedure that we call the implicit substitution method. While the solutions are implicit and in the form of composite variables, they can be mapped explicitly to a phase diagram in composition space. We apply the same formalism to multicomponent polymeric systems, where we find analytical solutions for polydisperse mixtures of polymers of one type. Finally, while complete analytical solutions are not possible for arbitrary mixtures, we propose computationally efficient strategies to map out coexistence curves for systems with many components of different polymer types. 
    more » « less
  4. Seki, Shinnosuke; Stewart, Jaimie Marie (Ed.)
    Life is chemical intelligence. What is the source of intelligent behavior in molecular systems? Here we illustrate how, in contrast to the common belief that energy use in non-equilibrium reactions is essential, the detailed balance equilibrium properties of multicomponent liquid interactions are sufficient for sophisticated information processing. Our approach derives from the classical Boltzmann machine model for probabilistic neural networks, inheriting key principles such as representing probability distributions via quadratic energy functions, clamping input variables to infer conditional probability distributions, accommodating omnidirectional computation, and learning energy parameters via a wake phase / sleep phase algorithm that performs gradient descent on the relative entropy with respect to the target distribution. While the cubic lattice model of multicomponent liquids is standard, the behaviors exhibited by the trained molecules capture both previously-observed phenomena such as core-shell condensate architectures as well as novel phenomena such as an analog of Hopfield associative memories that perform recall by contact with a patterned surface. Our final example demonstrates equilibrium classification of MNIST digits. Experimental implementation using DNA nanostar liquids is conceptually straightforward. 
    more » « less
  5. This study employs physics-informed neural networks (PINNs) to reconstruct multiple flow fields in a transient natural convection system solely based on instantaneous temperature data at an arbitrary moment. Transient convection problems present reconstruction challenges due to the temporal variability of fields across different flow phases. In general, large reconstruction errors are observed during the incipient phase, while the quasi-steady phase exhibits relatively smaller errors, reduced by a factor of 2–4. We hypothesize that reconstruction errors vary across different flow phases due to the changing solution space of a PINN, inferred from the temporal gradients of the fields. Furthermore, we find that reconstruction errors tend to accumulate in regions where the spatial gradients are smaller than the order of 10−6, likely due to the vanishing gradient phenomenon. In convection phenomena, field variations often manifest across multiple scales in space. However, PINN-based reconstruction tends to preserve larger-scale variations, while smaller-scale variations become less pronounced due to the vanishing gradient problem. To mitigate the errors associated with vanishing gradients, we introduce a multi-scale approach that determines scaling constants for the PINN inputs and reformulates inputs across multiple scales. This approach improves the maximum and mean errors by 72.2% and 6.4%, respectively. Our research provides insight into the behavior of PINNs when applied to transient convection problems with large solution space and field variations across multiple scales. 
    more » « less