skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Active Learning for Adsorption Simulations: Evaluation, Criteria Analysis, and Recommendations for Metal–Organic Frameworks
High-throughput molecular simulations and machine learning (ML) have been implemented to adequately screen a large number of metal−organic frameworks (MOFs) for applications involving adsorption. Grand canonical Monte Carlo (GCMC) simulations have proven effective in calculating the adsorption capacity at given pressures and temperatures, but they can require expensive computational resources. While they can be resource-efficient, ML models can require large datasets, creating a need for algorithms that can efficiently characterize adsorption; active learning (AL) can play a very important role in this regard. In this work, we make use of Gaussian process regression (GPR) to model pure component adsorption of nitrogen at 77 K from 10−5 to 1 bar, methane at 298 K from 10 −5 to 100 bar, carbon dioxide at 298 K from 10−5 to 100 bar, and hydrogen at 77 K from 10−5 to 100 bar on PCN-61, MgMOF-74, DUT-32, DUT-49, MOF-177, NU-800, UiO-66, ZIF-8, IRMOF-1, IRMOF-10, and IRMOF-16. The GPR model requires an initial training of the model with an initial dataset, the prior one, and, in this study of evaluating AL, we make use of three different prior selection schemes. Each prior scheme is updated with a sampling point resulting from the GP model uncertainties. This protocol continues until a maximum GPR relative error of 2% is attained. We make a recommendation on the best prior selection scheme for the total 44 adsorbate−adsorbent pairs primarily making use of the mean absolute error and the total amount of points required for convergence of the model. To further evaluate the AL framework, we apply the BET consistency criteria on the simulated and GP nitrogen isotherms and compare the resulting surface areas.  more » « less
Award ID(s):
2143346
PAR ID:
10490799
Author(s) / Creator(s):
; ;
Publisher / Repository:
American Chemical Society
Date Published:
Journal Name:
Industrial & Engineering Chemistry Research
Volume:
62
Issue:
33
ISSN:
0888-5885
Page Range / eLocation ID:
13009 to 13024
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The large number of possible structures of metal–organic frameworks (MOFs) and their limitless potential applications have motivated molecular modelers and researchers to develop methods and models to efficiently assess MOF performance. Some of the techniques include large-scale high-throughput molecular simulations and machine learning models. Despite those advances, the number of possible materials and the potential conditions that could be used still pose a formidable challenge for model development requiring large data sets. Therefore, there is a clear need for algorithms that can efficiently explore the spaces while balancing the number of simulations with prediction accuracy. Here, we present how active learning can sequentially select simulation conditions for gas adsorption, ultimately resulting in accurate adsorption predictions with an order of magnitude lower number of simulations. We model adsorption of pure components methane and carbon dioxide in Cu–BTC. We employ Gaussian process regression (GPR) and use the resulting uncertainties in the predictions to guide the next sampling point for molecular simulation. We outline the procedure and demonstrate how this model can emulate adsorption isotherms at 300 K from 10 −6 to 300 bar (methane)/100 bar (carbon dioxide). We also show how this procedure can be used for predicting adsorption on a temperature–pressure phase space for a temperature range of 100 to 300 K, and pressure range of 10 −6 to 300 bar (methane)/100 bar (carbon dioxide). 
    more » « less
  2. null (Ed.)
    A huge challenge facing scientists is the development of adsorbent materials that exhibit ultrahigh porosity but maintain balance between gravimetric and volumetric surface areas for the onboard storage of hydrogen and methane gas—alternatives to conventional fossil fuels. Here we report the simulation-motivated synthesis of ultraporous metal–organic frameworks (MOFs) based on metal trinuclear clusters, namely, NU-1501-M (M = Al or Fe). Relative to other ultraporous MOFs, NU-1501-Al exhibits concurrently a high gravimetric Brunauer−Emmett−Teller (BET) area of 7310 m 2 g −1 and a volumetric BET area of 2060 m 2 cm −3 while satisfying the four BET consistency criteria. The high porosity and surface area of this MOF yielded impressive gravimetric and volumetric storage performances for hydrogen and methane: NU-1501-Al surpasses the gravimetric methane storage U.S. Department of Energy target (0.5 g g −1 ) with an uptake of 0.66 g g −1 [262 cm 3 (standard temperature and pressure, STP) cm −3 ] at 100 bar/270 K and a 5- to 100-bar working capacity of 0.60 g g −1 [238 cm 3 (STP) cm −3 ] at 270 K; it also shows one of the best deliverable hydrogen capacities (14.0 weight %, 46.2 g liter −1 ) under a combined temperature and pressure swing (77 K/100 bar → 160 K/5 bar). 
    more » « less
  3. Abstract The application of machine learning (ML) techniques in materials science has revolutionized the pace and scope of materials research and design. In the case of metal–organic frameworks (MOFs), a promising class of materials due to their tunable properties and versatile applications in gas adsorption and separation, ML has helped survey the vast material space. This study explores the integration of reinforcement learning (RL), specifically Q‐learning, within an active learning (AL) context, combined with Gaussian processes (GPs) for predictive modeling of adsorption in MOFs. We demonstrate the effectiveness of the RL‐driven framework in guiding the selection of training data points and optimizing predictive model performance for methane and carbon dioxide adsorption, using two different reward metrics. Our results highlight the integration of RL as an AL method for adsorption predictions in MFs, and how it compares to a previously implemented AL scheme. 
    more » « less
  4. Abstract The para spin isomer of hydrogen gas possesses high nuclear spin order that can enhance the NMR signals of a variety of molecular species. Hydrogen is routinely enriched in the para spin state by lowering the gas temperature while flowing through a catalyst. Although parahydrogen enrichments approaching 100% are achievable near the H2liquefaction temperature of 20 K, many experimentalists operate at liquid nitrogen temperatures (77 K) due to the lower associated costs and overall simplicity of the parahydrogen generator. Parahydrogen that is generated at 77 K provides an enrichment value of ~51% of the para spin isomer; while useful, there are many applications that can benefit from low‐cost access to higher parahydrogen enrichments. Here, we introduce a method of improving parahydrogen enrichment values using a liquid nitrogen‐cooled generator that operates at temperatures less than 77 K. The boiling temperature of liquid nitrogen is lowered through internal evaporation into helium gas bubbles that are injected into the liquid. Changes to liquid nitrogen temperatures and parahydrogen enrichment values were monitored as a function of helium gas flow rate. The injected helium bubbles lowered the liquid nitrogen temperature to ~65.5 K, and parahydrogen enrichments of up to ~59% were achieved; this represents an ~16% improvement compared with the expected parahydrogen fraction at 77 K. This technique is simple to implement in standard liquid nitrogen‐cooled parahydrogen generators and may be of interest to a wide range of scientists that require a cost‐effective approach to improving parahydrogen enrichment values. 
    more » « less
  5. In recent decades, metal–organic frameworks (MOFs) have gained recognition for their potential in multicomponent gas separations. Though molecular simulations have revealed structure–property relationships of MOF–adsorbate systems, they can be computationally expensive and there is a need for surrogate models that can predict the adsorption data faster. In this work, an active learning (AL) protocol is introduced that can predict multicomponent gas adsorption in a MOF for a range of thermodynamic conditions. This methodology is applied to build a model for the adsorption of three different gas mixtures (CO2–CH4, Xe–Kr, and H2S–CO2) in the MOF Cu-BTC. A Gaussian process regression (GPR) model is used to fit the data as well to leverage its predicted uncertainty to drive the learning. The training data is generated using grand-canonical Monte Carlo (GCMC) simulations as points are iteratively added to the model to minimize the predicted uncertainty. Also, a criteria which captures the perceived performance of the GPs is introduced to terminate the AL process when the perceived accuracy threshold is met. The three systems are tested for a pressure–mole fraction (P–X), and a pressure–mole fraction–temperature (P–X–T) feature space. It is demonstrated that AL one only needs a fraction of the data from simulations to build a reliable surrogate model for predicting mixture adsorption. Further, the final GP fit from AL outperforms ideal adsorbed solution theory predictions. 
    more » « less