Enhanced thermophysical property prediction with uncertainty quantification using group contribution-Gaussian process regression

Agbodekhe, Barnabas P  (ORCID:0000000162934296); Carlozo, Montana N  (ORCID:0000000309795703); Abranches, Dinis O  (ORCID:0000000300972072); Jones, Kyla D  (ORCID:0000000303141477); Dowling, Alexander W  (ORCID:0000000193837499); Maginn, Edward J  (ORCID:0000000263091347)

doi:10.1039/D5ME00126A

Citation Details

This content will become publicly available on January 1, 2027

Enhanced thermophysical property prediction with uncertainty quantification using group contribution-Gaussian process regression

Group contribution (GC) models are powerful, simple, and popular methods for property prediction. However, the most accessible and computationally efficient GC methods, like the Joback and Reid (JR) GC models, often exhibit severe systematic bias. Furthermore, most GC methods do not have uncertainty estimates associated with their predictions. The present work develops a hybrid method for property prediction that integrates GC models with Gaussian process (GP) regression. Predictions from the JR GC method, along with the molecular weight, are used as input features to the GP models, which learn and correct the systematic biases in the GC predictions, resulting in highly accurate property predictions with reliable uncertainty estimates. The method was applied to six properties: normal boiling temperature (Tb), enthalpy of vaporization at Tb (ΔHvap), normal melting temperature (Tm), critical pressure (Pc), critical molar volume (Vc), and critical temperature (Tc). The CRC Handbook of Chemistry and Physics was used as the primary source of experimental data. The final collected experimental data ranged from 485 molecules for ΔHvap to 5640 for Tm. The proposed GCGP method significantly improved property prediction accuracy compared to the GC-only method. The coefficient of determination (R2) values of the testing set predictions are ≥0.85 for five out of six and ≥0.90 for four out of six properties modeled, and compare favorably with other methods in the literature. Tm was used to demonstrate one way the GCGP method can be tuned for even better predictive accuracy. The GCGP method provides reliable uncertainty estimates and computational efficiency for making new predictions. The GCGP method proved robust to variations in GP model architecture and kernel choice. more »

Award ID(s):: 2330175

PAR ID:: 10654702

Author(s) / Creator(s):: Agbodekhe, Barnabas P ; Carlozo, Montana N ; Abranches, Dinis O ; Jones, Kyla D ; Dowling, Alexander W ; Maginn, Edward J

Publisher / Repository:: Royal Society of Chemistry

Date Published:: 2026-01-01

Journal Name:: Molecular Systems Design & Engineering

ISSN:: 2058-9689

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on January 1, 2027
Journal Article:
https://doi.org/10.1039/D5ME00126A

More Like this