skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Metal–organic framework clustering through the lens of transfer learning
Metal–organic frameworks (MOFs) are promising materials with various applications, and machine learning (ML) techniques can enable their design and understanding of structure–property relationships. In this paper, we use machine learning (ML) to cluster the MOFs using two different approaches. For the first set of clusters, we decompose the data using the textural properties and cluster the resulting components. We separately cluster the MOF space with respect to their topology. The feature data from each of the clusters were then fed into separate neural networks (NNs) for direct learning on an adsorption task (methane or hydrogen). The resulting NNs were then used in transfer learning (TL) where only the last NN layer was retrained. The results show significant differences in TL performance based on which cluster is chosen for direct learning. We find TL performance depends on the Euclidean distance in the decomposed feature space between the clusters involved in the direct and TL. Similar results were found when TL was performed simultaneously across both types of clusters and adsorption tasks. We note that methane adsorption was a better source task than hydrogen adsorption. Overall, the approach was able to identify MOFs with the most transferable information, leading to valuable insights and a more comprehensive understanding of the MOF landscape. This highlights the method's potential to generate a deeper understanding of complex systems and provides an opportunity for its application in alternative datasets.  more » « less
Award ID(s):
2143346
PAR ID:
10490633
Author(s) / Creator(s):
;
Publisher / Repository:
Royal Society of Chemistry
Date Published:
Journal Name:
Molecular Systems Design & Engineering
Volume:
8
Issue:
8
ISSN:
2058-9689
Page Range / eLocation ID:
1049 to 1059
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The large number of possible structures of metal–organic frameworks (MOFs) and their limitless potential applications have motivated molecular modelers and researchers to develop methods and models to efficiently assess MOF performance. Some of the techniques include large-scale high-throughput molecular simulations and machine learning models. Despite those advances, the number of possible materials and the potential conditions that could be used still pose a formidable challenge for model development requiring large data sets. Therefore, there is a clear need for algorithms that can efficiently explore the spaces while balancing the number of simulations with prediction accuracy. Here, we present how active learning can sequentially select simulation conditions for gas adsorption, ultimately resulting in accurate adsorption predictions with an order of magnitude lower number of simulations. We model adsorption of pure components methane and carbon dioxide in Cu–BTC. We employ Gaussian process regression (GPR) and use the resulting uncertainties in the predictions to guide the next sampling point for molecular simulation. We outline the procedure and demonstrate how this model can emulate adsorption isotherms at 300 K from 10 −6 to 300 bar (methane)/100 bar (carbon dioxide). We also show how this procedure can be used for predicting adsorption on a temperature–pressure phase space for a temperature range of 100 to 300 K, and pressure range of 10 −6 to 300 bar (methane)/100 bar (carbon dioxide). 
    more » « less
  2. Abstract The application of machine learning (ML) techniques in materials science has revolutionized the pace and scope of materials research and design. In the case of metal–organic frameworks (MOFs), a promising class of materials due to their tunable properties and versatile applications in gas adsorption and separation, ML has helped survey the vast material space. This study explores the integration of reinforcement learning (RL), specifically Q‐learning, within an active learning (AL) context, combined with Gaussian processes (GPs) for predictive modeling of adsorption in MOFs. We demonstrate the effectiveness of the RL‐driven framework in guiding the selection of training data points and optimizing predictive model performance for methane and carbon dioxide adsorption, using two different reward metrics. Our results highlight the integration of RL as an AL method for adsorption predictions in MFs, and how it compares to a previously implemented AL scheme. 
    more » « less
  3. High-throughput molecular simulations and machine learning (ML) have been implemented to adequately screen a large number of metal−organic frameworks (MOFs) for applications involving adsorption. Grand canonical Monte Carlo (GCMC) simulations have proven effective in calculating the adsorption capacity at given pressures and temperatures, but they can require expensive computational resources. While they can be resource-efficient, ML models can require large datasets, creating a need for algorithms that can efficiently characterize adsorption; active learning (AL) can play a very important role in this regard. In this work, we make use of Gaussian process regression (GPR) to model pure component adsorption of nitrogen at 77 K from 10−5 to 1 bar, methane at 298 K from 10 −5 to 100 bar, carbon dioxide at 298 K from 10−5 to 100 bar, and hydrogen at 77 K from 10−5 to 100 bar on PCN-61, MgMOF-74, DUT-32, DUT-49, MOF-177, NU-800, UiO-66, ZIF-8, IRMOF-1, IRMOF-10, and IRMOF-16. The GPR model requires an initial training of the model with an initial dataset, the prior one, and, in this study of evaluating AL, we make use of three different prior selection schemes. Each prior scheme is updated with a sampling point resulting from the GP model uncertainties. This protocol continues until a maximum GPR relative error of 2% is attained. We make a recommendation on the best prior selection scheme for the total 44 adsorbate−adsorbent pairs primarily making use of the mean absolute error and the total amount of points required for convergence of the model. To further evaluate the AL framework, we apply the BET consistency criteria on the simulated and GP nitrogen isotherms and compare the resulting surface areas. 
    more » « less
  4. 2D layered metal-organic frameworks (MOFs) are a new class of multifunctional materials that can provide electrical conductivity on top of the conventional structural characteristics of MOFs, offering potential applications in electronics and optics. Here, for the first time, we employ Machine Learning (ML) techniques to predict the thermodynamic stability and electronic properties of layered electrically conductive (EC) MOFs, bypassing expensive ab initio calculations for the design and discovery of new materials. Proper feature engineering is a very important factor in utilizing ML models for such purposes. Here, we show that a combination of elemental features, using generic statistical reduction methods and crystal structure information curated from the recently introduced EC-MOF database, leads to a reasonable prediction of the thermodynamic and electronic properties of EC MOFs. We utilize these features in training a diverse range of ML classifiers and regressors. Evaluating the performance of these different models, we show that an ensemble learning approach in the form of stacking ML models can lead to higher accuracy and more reliability on the predictive power of ML to be employed in future MOF research. 
    more » « less
  5. Metal–organic frameworks (MOFs), with their unique porous structures and versatile functionality, have emerged as promising materials for the adsorption, separation, and storage of diverse molecular species. In this study, we investigate water adsorption in MOF-808, a prototypical MOF that shares the same secondary building unit (SBU) as UiO-66, and elucidate how differences in topology and connectivity between the two MOFs influence the adsorption mechanism. To this end, molecular dynamics simulations were performed to calculate several thermodynamic and dynamical properties of water in MOF-808 as a function of relative humidity (RH), from the initial adsorption step to full pore filling. At low RH, the μ3-OH groups of the SBUs form hydrogen bonds with the initial water molecules entering the pores, which triggers the filling of these pores before the μ3-OH groups in other pores become engaged in hydrogen bonding with water molecules. Our analyses indicate that the pores of MOF-808 become filled by water sequentially as the RH increases. A similar mechanism has been reported for water adsorption in UiO-66. Despite this similarity, our study highlights distinct thermodynamic properties and framework characteristics that influence the adsorption process differently in MOF-808 and UiO-66. 
    more » « less