<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Inverse design of metal–organic frameworks for C2H4/C2H6 separation</title></titleStmt>
			<publicationStmt>
				<publisher>npj Computational Materials</publisher>
				<date>12/01/2022</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10467925</idno>
					<idno type="doi">10.1038/s41524-022-00946-w</idno>
					<title level='j'>npj Computational Materials</title>
<idno>2057-3960</idno>
<biblScope unit="volume">8</biblScope>
<biblScope unit="issue">1</biblScope>					

					<author>Musen Zhou</author><author>Jianzhong Wu</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[<title>Abstract</title> <p>Efficient separation of C<sub>2</sub>H<sub>4</sub>/C<sub>2</sub>H<sub>6</sub>mixtures is of paramount importance in the petrochemical industry. Nanoporous materials, especially metal-organic frameworks (MOFs), may serve the purpose owing to their tailorable structures and pore geometries. In this work, we propose a computational framework for high-throughput screening and inverse design of high-performance MOFs for adsorption and membrane processes. High-throughput screening of the computational-ready, experimental (CoRE 2019) MOF database leads to materials with exceptionally high ethane-selective adsorption selectivity (LUDLAZ: 7.68) and ethene-selective membrane selectivity (EBINUA02: 2167.3). Moreover, the inverse design enables the exploration of broader chemical space and identification of MOF structures with even higher membrane selectivity and permeability. In addition, a relative membrane performance score (rMPS) has been formulated to evaluate the overall membrane performance relative to the Robeson boundary. The computational framework offers guidelines for the design of MOFs and is generically applicable to materials discovery for gas storage and separation.</p>]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The efficiency of C2H4/C2H6 separation is important for the petrochemical industry because high-purity C2H4 is used as the primary feedstock for the synthesis of diverse chemical products including plastics, polyesters and rubber materials. <ref type="bibr">1,</ref><ref type="bibr">2</ref> Conventional processes for C2H4/C2H6 separation are mostly based on high-pressure cryogenic distillation, which requires extensive energy input while suffers from low separation efficiency. To reduce the energy cost and increase the selectivity, it is desirable to develop alternative approaches such as adsorption or permeation processes based on nanoporous materials. <ref type="bibr">[3]</ref><ref type="bibr">[4]</ref><ref type="bibr">[5]</ref> Metal-organic frameworks (MOFs) are ideal candidates for efficient separation of C2H4/C2H6 because they have good mechanic stability, large specific surface area, and tailorable pore structure and geometry. <ref type="bibr">[6]</ref><ref type="bibr">[7]</ref><ref type="bibr">[8]</ref> In particular, such materials show promising performance for separating molecules with similar size and interaction energy, such as the mixtures of H2/D2 isotopes, of noble gases (Ar/Kr/Xe), and of xylene isomers. <ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref><ref type="bibr">[12]</ref> For C2H4/C2H6 separation, promising MOF candidates have been identified by experiments. <ref type="bibr">3,</ref><ref type="bibr">4,</ref><ref type="bibr">13,</ref><ref type="bibr">14</ref> Whereas the possible variations of MOFs are virtually unlimited and the separation efficiency is sensitive to the atomic details, it is practically infeasible to explore the design space only through experimentation. Previously, computational methods have been used to identify best material candidates for separation process through high-throughput screening. 9, <ref type="bibr">15,</ref><ref type="bibr">16</ref> While the adsorption isotherms predicted by the computational methods are found in good agreement with experimental measurement, <ref type="bibr">17</ref> membrane processes are often considered more efficient to separate C2H4 from C2H6 leveraging the difference in both adsorption affinity and gas diffusivity. <ref type="bibr">18</ref> To the best of our knowledge, previous research on the computational screening of MOF database is mostly concerned with the separation of C2H4/C2H6 by adsorption. <ref type="bibr">17,</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref> From the computational perspective, the assessment of MOF materials for membrane separation is much more demanding because the evaluation of gas diffusivity in confined geometry is typically more time-consuming than that for gas adsorption. In particular, the strong confinement makes it computationally prohibitive to predict the diffusion coefficients of gas molecules in a large library of nanoporous materials using conventional methods such as molecular simulation (MD) simulation. <ref type="bibr">23,</ref><ref type="bibr">24</ref> Although computational methods (e.g., MD, grand canonical Monte Carlo simulation, and classical density function theory) have been well established for accurate prediction of gas adsorption and diffusivity, <ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref><ref type="bibr">[28]</ref> the inverse design of nanoporous materials for separation processes remains a theoretical challenge from both computational and practical perspectives. While generative adversarial network (GAN) shows early success in the inverse design of zeolites for methane storage, <ref type="bibr">29</ref> its computational complexity increases significantly with the number of elements in the crystal structure. Besides, GAN easily breaks down and fails to converge for complicated crystalline materials such as MOFs because a large number of atomic types need to be considered. In contrast, variational autoencoder (VAE) can well accommodate the complex topology and molecular structure of the secondary building blocks (SBUs) by compressing the MOF structure into a text string and projecting it into the latent space. <ref type="bibr">30</ref> However, VAE requires accurate projection (viz., encode and decoder) between the crystal structure and a latent space, and the VAE training would become infeasible when a vast number of SBUs are considered for the MOF design. Alternatively, evolutionary algorithms, such as the genetic algorithm, are promising for the inverse design of MOFs because they can accommodate not only a large number of SBUs for MOF design, but also find the solution in a nonlinear space consisted of the material topology and SBUs. <ref type="bibr">31,</ref><ref type="bibr">32</ref> In previous work, <ref type="bibr">25,</ref><ref type="bibr">26</ref> we developed a computational procedure with GPU-acceleration that offers fast and accurate evaluation of sorption and diffusion properties of gas molecules in nanoporous materials. Empowered by the developed computational capability, here we perform high-throughput screening of the computational-ready experimental (CoRE 2019) MOF database (over 10k MOFs) for the separation of C2H4/C2H6 with adsorption and membrane processes.</p><p>Compared with the state-of-art materials from the literature, <ref type="bibr">4,</ref><ref type="bibr">33</ref> the best MOFs identified in this work have significantly higher separation selectivity. The highest ethane-selective adsorption selectivity in LUDLAZ is up to 7.68, and the highest ethene-selective membrane selectivity in EBINUA02 can reach 2167.3. Leveraging on the high-throughput capability, a genetic algorithm (GA) is incorporated into our computational workflow to achieve the inverse design of MOF membranes with both high membrane selectivity and permeability. The inverse design allows us to explore a broader chemical space in comparison with high-throughput screening and identify MOFs with even higher membrane selectivity and permeability. The structural analyses of MOFs with best separation performance offer useful guidelines for the experimental design of MOFs for adsorption and membrane separation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Results and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Screening CoRE MOF 2019 Database</head><p>We first perform the high-throughput screening of the computational-ready experimental (CoRE) MOF 2019 database for separation of C2H4/C2H6 through adsorption and membrane processes. While high-throughput screening has been commonly used to find the best material candidates for gas adsorption, the procedure is more challenging for membrane separation because of the steep computational cost in evaluating the diffusion coefficients. Figure <ref type="figure">1</ref> shows the separation selectivity versus capacity for both adsorption and membrane separations. In Figure <ref type="figure">1(a)</ref> and Supplementary Figure <ref type="figure">1</ref>, MOFs with Henry's constants smaller than 1&#215;10 -5 mol&#8226;m -3 &#8226;Pa -1 and the largest cavity diameters smaller than the hard-sphere diameter of ethene have been removed.</p><p>Those structures can hardly accommodate ethane/ethene molecules thus are not further considered in this work. The hard-sphere diameters of ethane and ethene are calculated from the Barker-Henderson theory <ref type="bibr">34</ref> based on their Lennard-Jones (LJ) parameters and more details are provided in Supporting Information. As shown in Figure <ref type="figure">1</ref>(a), the maximum selectivity is less than 30 for ethene-selective MOFs suitable for adsorption separation. It decreases exponentially with the increase of the separation capacity (viz. adsorption amount) because highly confined pores are needed in order to achieve high ethene-selectivity. The structural properties of top ethene-selective and ethane-selective MOFs are shown in Supplementary Table <ref type="table">2</ref> and Supplementary Table <ref type="table">3</ref>, respectively. Such materials offer little pore volume to achieve high adsorption capacity. Table <ref type="table">1</ref> lists the properties of top 5 ethene-selective MOFs for the adsorption separation of C2H4/C2H6 at room temperature (300 K).</p><p>Figure 1. Selectivity vs. capacity for CoRE MOFs used in C2H4/C2H6 separation. (a) Adsorption selectivity. The green line is fitted with &#945; IM =0.2516e -0.7411log(K) . (b) Membrane selectivity. The red line denotes the Robeson boundary, and the color stands for the percentile of relative membrane performance score (rMPS): the red, gray and blue represent the highest, intermediate, and the lowest rMPS, respectively. The dashed line marks the membrane selectivity of 1. Although ethene-selective materials yield high selectivity in the separation of C2H4/C2H6 by gas adsorption, industrial applications desire ethane-selective processes because they can significantly reduce the energy cost. Supplementary Figure <ref type="figure">1</ref> shows that, consistent with the literature, <ref type="bibr">4,</ref><ref type="bibr">5,</ref><ref type="bibr">20</ref> the highest selectivity of ethane-selective MOFs identified in this work is much smaller than that of ethene-selective MOFs because the stronger adsorption of smaller molecules (e.g., ethene) yields a larger adsorption selectivity of C2H4/C2H6 via the ultra-small pores of promising MOFs. It is worth noting that, different from ethene-selective MOF for the separation of C2H4/C2H6 via adsorption, the selectivity of ethane-selective MOFs increases with the capacity and none of the CoRE MOFs have the ethane-selective adsorption selectivity above 8. have been reported before, <ref type="bibr">17,</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref><ref type="bibr">[21]</ref><ref type="bibr">[22]</ref> the materials identified in this work yield much higher selectivity for both ethene-selective and ethane-selective separations. Previously, the computational screening was carried either on a smaller structural database or a subset of large structural library (e.g., CoRE MOF 2019) that was restricted by certain structural and chemical criteria. While the application of these criteria would speed up the computation by reducing the number of materials to be evaluated, they ignore promising candidates due to the complex topology and structure. For the adsorption separation of C2H4/C2H6, the selectivity declines with the increase of loading amount. The reduction in adsorption selectivity can be attributed to the smaller difference between the adsorbate-adsorbate interactions in comparison to that between adsorbate and adsorbent interactions.</p><p>According to the structural analysis of the promising materials (results shown in Supplementary Figure <ref type="figure">2</ref> and Supplementary Figure <ref type="figure">3</ref>), the ethane-selective MOFs have less confined geometry in terms of the pore limit diameter (PLD), the largest cavity diameter (LCD) and pore size distribution in comparison with the ethene-selective MOFs. The increase in pore size leads to a much higher adsorption capacity for those MOFs with higher ethane-selectivity. It is worth mentioning that the highest adsorption selectivity of ethane-selective MOF [LUDLAZ:</p><p>&#945; IM (C2H6/C2H4)=7.68] identified in this work is about 60% higher than the best nanoporous material found by the previous computational screening <ref type="bibr">22</ref> , a hypothetical zeolite structure [&#945; IM (C2H6/C2H4)=4.86]. The adsorption selectivity is more than 70% higher than that of the stateof-art nanoporous material reported in the experimental literature <ref type="bibr">4</ref> , Fe2(O2)(dobdc) with C2H6/C2H4 adsorption selectivity of 4.4. Besides, LUDLAZ has a much larger Henry's constant [Kh(C2H6)=4.6056 cm 3 &#8226;g -1 &#8226;Pa <ref type="bibr">-1</ref> ] than Fe2(O2)(dobdc) [Kh(C2H6)=0.0147 cm 3 &#8226;g -1 &#8226;Pa -1 ], meaning much higher gravimetric adsorption capacity. We note in passing that LUDLAZ was originally synthesized by McKellar and coworkers to examine how ligand exchange affects the stability and compressibility of MOF materials. <ref type="bibr">35</ref> For the top 5 ethane-selective MOFs, their pore size distributions characterized by N2 adsorption are shown in Supplementary Figure <ref type="figure">3(b)</ref>. These materials have similar micropores between 5 &#197; and 7 &#197;, rendering higher selectivity of ethane over ethene in adsorption separation.</p><p>Compared with adsorption, gas separation via permeation through MOF membranes may achieve not only higher selectivity but also larger separation capacity. In addition, a membrane splits the feed stream into two purified sub-streams (viz., retentate stream and permeate stream) such that it does not require a recovery process even for ethene-selective operations. Therefore, the membrane process is often much less energy-intensive in comparison with adsorption. Figure <ref type="figure">1(b)</ref> shows the membrane selectivity versus membrane permeability in the units of barrer. The red line in Figure <ref type="figure">1</ref>(b) denotes the Robeson boundary, a semi-empirical upper limit summarized by Rungta et al. <ref type="bibr">2</ref> based on the state-of-art polymer membranes for specifically separating ethane and ethene. We see that many MOFs in CoRE MOF 2019 database surpass the Robeson boundary, indicating their superior performance compared with the polymer membranes. The highest membrane selectivity of C2H4/ C2H6 is 2167.3 in MOF -EBINUA02, which was synthesized by Tian et al. <ref type="bibr">36</ref> with 1D rhombic channel in the 3D diamond topology network. The selectivity is several orders of magnitude larger than the state-of-art membrane materials discovered by previous computational screening and experimental synthesis. <ref type="bibr">19,</ref><ref type="bibr">33</ref> Table 3 lists the top 5 MOFs with the highest membrane selectivity of C2H4/C2H6. Table 3. Top MOFs for C2H4/C2H6 separation with the highest membrane selectivity (k IM ) at 300 K.</p><p>In our previous work, <ref type="bibr">25,</ref><ref type="bibr">26</ref> we proposed the membrane performance score (MPS) to evaluate the overall performance of nanoporous materials by combining the membrane selectivity and permeability. Although MPS offers a direct comparison of nanoporous materials with different permeability and selectivity, it does not evaluate the membrane performance relative to the upper limit of the state-of-art polymer membranes (viz., the Robeson boundary). Here, we propose a modification of MPS, which is originally defined as fast/slow fast MPS k P = &#215; . The relative membrane performance score (rMPS) is defined as</p><p>where k represents the membrane selectivity, P stands for the gas permeability, and the fast component refers to the one with higher permeability in the binary mixture. Table <ref type="table">4</ref>. Top MOFs with the highest relative membrane performance score (rMPS) for the separation of C2H4/C2H6 at 300 K.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Structural Features of Promising MOF Membranes</head><p>As discussed above, the selectivity of MOF membranes is less compromised (and much higher)</p><p>high separation capacity in comparison with MOF adsorbents. To explore the synergetic effects between adsorption and diffusion, we have further examined the structural features of top MOFs with the highest membrane selectivity and rMPS. Figure 2. Distributions of the pore limit diameter (a) and the largest cavity diameter (b) for all CoRE MOFs and MOFs with top 5% ideal membrane selectivity and relative performance score (rMPS) for C2H4/C2H6 separation. Figure 2 shows the distributions of pore limit diameter (PLD) and the largest cavity diameter (LCD) for all MOFs in CoRE MOF 2019 database and MOFs with top 5% membrane selectivity and rMPS. Compared with the distributions of PLD and LCD for all CoRE MOFs, it is clear that MOFs with top 5% membrane selectivity (and rMPS) have significantly different structural features. The PLD and LCD distributions suggest that MOFs with top 5% membrane selectivity have much smaller pores than those with top 5% rMPS. However, their void fractions are rather similar, both in the range from 0.4 to 0.7 (shown in Supplementary Figure <ref type="figure">4</ref>). For MOFs with top 5% membrane selectivity, the PLD mostly distributes between 2.75 &#197; and 3.5 &#197;, where the narrow end is even slightly smaller than the LJ diameter of the methylene group in ethene. It is worth mentioning that nanoporous materials do not prohibit gas adsorption even when the PLD is slightly smaller than the LJ diameter of the gas molecules. <ref type="bibr">37</ref> As a matter of fact, the selectivity is maximized for MOFs with the PLD slightly smaller than the LJ diameter because the extremely narrow pore aperture magnifies the difference in the potential energy between C2H4 and C2H6 at the transition state. By contrast, MOFs with top 5% rMPS have a PLD distribution spanning from 3.75 &#197; to 4.75 &#197;. Compared with MOFs with top 5% membrane selectively, the slightly larger PLD for MOFs with top 5% rMPS leads to a smaller difference in the potential energy between ethane and ethene at the transition state, and therefore, much higher permeability with intermediate membrane selectivity. Similar to the PLD distribution, the LCD distribution for the MOFs with top 5% membrane selectivity is mostly localized at the smaller pore size than those with top 5% rMPS. For MOFs with top 5% membrane selectivity, the ultra-narrow pore apertures contribute to a larger membrane selectivity but a smaller diffusion coefficient and lower permeability. illustration. The energy landscape along the MEP for ethane and ethene in EBINUA02 (c) and LUDLAZ (d) at 300 K. the energy landscape for gas diffusion in MOFs with the highest membrane selectivity [EBINUA02: k IM (C2H4/C2H6)=2167.3 and rMPS=1012.4 barrer] and the highest rMPS [LUDLAZ: k IM (C2H6/C2H4)=11.1]. Supplementary Figure 5 presents the energy barrier along the MEP for EBINUA02 and LUDLAZ. For both EBINUA02 and LUDLAZ, only one direction along the lattice vector can accommodate the diffusion of ethane or ethene molecule. Figure 3(a) and (b) show that, despite the significant difference between EBINUA02 and LUDLAZ in the energy landscape along the MEP, their MEPs inside MOFs are almost identical. Both EBINUA02 and LUDLAZ yield near straight trajectories for the molecular center of mass on the MEP with a minimal change of the molecular orientation, suggesting that the high membrane selectivity is attributed to extremely narrow pores. It is worth noting that the local chemical environments are very similar along the MEP in EBINUA02 and LUDLAZ, and their different pore structures result in the distinct energy landscape along the MEP. Although EBINUA02 and LUDLAZ have a similar void fraction, their pore structures (e.g., PLD and LCD) are very different thus result in different separation mechanisms. According to the solution-diffusion theory, 18 the membrane selectivity can be improved by increasing the difference in adsorption, diffusion, or a combination of both quantities. EBINUA02 has a much smaller pore aperture (PLD: 2.91 &#197; and LCD: 3.96 &#197;) than LUDLAZ (PLD: 4.18 &#197; and LCD: 5.96 &#197;), which leads to the preferential diffusion and adsorption (viz. solubility in solution-diffusion theory) of C2H4 and the extremely high membrane selectivity of C2H4 over C2H6. Whereas in LUDLAZ, its</p><p>interaction with the gas molecules is attractive even at the transition state and the difference in energy barrier between C2H4 and C2H6 is almost negligible (Fig. <ref type="figure">3D</ref>). The relatively spacious pore structure in LUDLAZ results in a slightly faster diffusion of C2H6 than C2H4 because ethane experiences a stronger van der Waals attraction. As a result, the membrane selectivity of LUDLAZ is mostly contributed by the difference in adsorption (viz., Henry's constant) between C2H4 and C2H6. According to the above analysis of the top MOFs with high membrane selectivity and rMPS, a large energy barrier (viz., extremely narrow pore aperture) is not preferred for the design of ideal MOF membrane (with both high selectivity and permeability). The trend is intuitively understandable because a narrow pore aperture significantly reduces the gas diffusion coefficient and permeability, For the rational design of ideal MOF membranes for C2H4/C2H6 separation, the selectivity and permeability need to be harnessed by enhancing the difference in adsorption and diffusion, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Inverse Design of MOF Membranes</head><p>In comparison with adsorption, the membrane process has major advantages in terms of both separation selectivity and capacity (viz. permeability). As a result, our inverse design is concerned only with MOF membranes. To find nanoporous materials ideal for C2H4/C2H6 separation (viz., high separation selectivity and capacity), we use a genetic algorithm (GA) with the fitness score of IM total 0.5 0.5</p><p>In Equation (2), the total fitness score, F total , is evenly weighted according to the member selectivity and permeability. The member selectivity fitness score is formulated as</p><p>where IM k stands for the (ethene-selective) membrane selectivity. Conversely, the permeability fitness score is defined by ( ) 100 log 100</p><p>1 100</p><p>In this work, the membrane selectivity of 5 and permeability of 100 barrers are used as threshold values because most CoRE MOFs have a permeability larger than 100 barrer but few have a membrane selectivity larger than 5. For materials with the permeability or selectivity lower than the corresponding threshold value, the exponential form allows for a smooth variation of the fitness score. Such a smooth form is important especially for materials with an extremely small value of permeability. Compared with improving permeability, it is much more difficult to improve the membrane selectivity. Therefore, for permeability and selectivity larger than the threshold values, the fitness score of selectivity is constructed in the quadratic form while the natural logarithm is used for the permeability. According to this formulation of the fitness score, a relatively small increase in selectivity would lead to a large increase in the fitness thus allow GA to generate MOFs with both high membrane selectivity and permeability instead of MOFs with solely high permeability. Here square boxes represent the secondary building blocks (SBUs) used for MOF construction. In analogy to the genes in the chromosome, the choice of SBUs directly determines the physiochemical properties and separation performance. Black, brown and green boxes represent the topology, node and edge, respectively.</p><p>Figure <ref type="figure">4</ref> shows how each MOF is represented as a 'chromosome', i.e., in terms of the MOF topology, node and edge. The schematic flowchart elucidates the computational steps in the inverse design of MOF membranes. Compared with CoRE MOF database or other existing MOF structural databases, a much larger chemical space can be explored for the inverse design. In this work, the secondary building block (SB) database consists of 1,687 topologies, 648 nodes and 219 edges.</p><p>As each MOF is defined by a topology, and up to two types of metal nodes and three types edges, enormous combinations are possible for the MOF construction. Although the number 7.4&#215;10 <ref type="bibr">15</ref> does not necessarily reflect the actual size of the design space because many combinations of the edges, nodes and topology cannot lead to the construction of synthesizable MOFs, it shows the extensiveness of the design space in enumeration of all possible combinations of the topology and building blocks. Not all chromosomes would lead to a successful MOF design owing to the unmatched coordination number and bonding distance in the topologies, nodes and edges. Invalid chromosomes were identified during the construction of MOF structures with PORMAKE based on the compatibility criteria for the building blocks and topology. <ref type="bibr">37</ref> The percentage of invalid chromosomes is about 70% on average among all generations of chromosomes sampled by GA.</p><p>In general, the successful rate increases with the GA evolution. For valid chromosomes, we obtain the MOF structure and cell parameters from PORMAKE that is able to accommodate the nodes and linkers in arbitrary topologies while avoid clashing atoms and distorting bonds. The structures generated by PORMAKE were optimized during the MOF construction. No further geometry optimization was attempted in this work. Figure <ref type="figure">5</ref> shows the membrane selectivity and permeability of the designed MOFs in comparison with CoRE MOFs. Here, we consider only materials in the region where the scale of permeability is similar to the experiment results. Supplementary Figure <ref type="figure">6</ref> shows the same figure with the full ranges of permeability and selectivity. Figure <ref type="figure">5</ref>(a) compares the properties of MOF structures generated by GA with those from the CoRE MOF library in terms of the membrane selectivity and permeability. Clearly, GA is able to identify MOF structures with the targeted properties, i.e., high selectivity and high capacity as shown in the area above the Robeson boundary.</p><p>Because no constraint was imposed in sampling the design space, the GA sampling does not always lead to the successful construction of MOF structures. For those chromosomes not generating valid MOFs, their fitness scores would be assigned to the lowest value. sampling. Similar plots are given in Supplementary Figure 6(b) and (c) but for the entire ranges of membrane selectivity and permeability. As the area under each curve represents the percentage of valid MOF structures in each generation, the successful evolution of GA is evident not only because it generates more valid MOF structures after each round of evolution but because the designed materials show noticeable improvement in both permeability and membrane selectivity.</p><p>Although many MOF structures do not surpass the Robeson boundary even in the final generation (gold stars), GA is able to create successful candidates with high membrane selectivity and permeability. Importantly, GA is computationally much more efficient than conventional approaches of material discovery such as high-throughput screening because it avoids enumeration of the entire design space.</p><p>The best MOF identified by the inverse design (yfk-N379) significantly improves the overall membrane separation performance in terms of both membrane selectivity and permeability. Table <ref type="table">5</ref> lists the detailed properties of yfk-N379. It should be noted that the best materials identified by GA may vary with the initial generation. In this work, we tested the efficiency of the inverse design with a different initial generation (shown in Supplementary Figure <ref type="figure">8</ref>). Whereas different sets of materials were sampled during the evolution, it seems that GA sampling is robust to meet the goal of the inverse design independent of the initial condition. Although the final MOF structures are not identical, they have similar performance in terms of both membrane selectivity and permeability.</p><p>Table <ref type="table">5</ref>. Properties of designed MOFs with both high membrane selectivity and permeability for the separation of C2H4/C2H6 at 300 K.</p><p>Figure <ref type="figure">6</ref> shows molecular orientation and energy landscape for the diffusion of gas molecules along the MEP. In yfk-N379, the metal node, (CO2-</p><p>connected with the yfk topology network to form a 1D channel for the diffusion of C2H4 and C2H6 molecules. The energy landscape along the MEP in yfk-N379 is similar to that in LUDLAZ where the intermolecular interaction along the MPE is all attractive. Also, like that in LUDLAZ, the difference in the energy barrier between C2H4 and C2H6 is relatively small. As discussed above, in order to design an ideal MOF membrane with both high membrane selectivity and permeability, the separation selectivity shall be harnessed by the difference in the adsorption (viz., solubility), and the high permeability should be obtained by fast diffusion. As shown in Figure <ref type="figure">6</ref>(b), the 1D channel in yfk-N379 offers strong attraction and relatively moderate energy barrier along the MEP, which results in the extremely fast diffusion of gas molecules. The distinct difference of adsorption properties (viz., Henry's constant) between C2H4 and C2H6 in yfk-N379 leads to an exceptionally high membrane selectivity compared to the CoRE MOFs. Although the membrane selectivity yfk-N379 is not much larger than those corresponding to the top 5 CoRE MOFs, its permeability (8420 barrer) is at least three orders of magnitudes higher. Conversely, the membrane selectivity of yfk-N379 is 30% higher than those of CoRE MOFs with similar permeability. The synergetic effects from adsorption and diffusion help yfk-N379 achieve both high membrane selectivity and permeability simultaneously, making it promising for industry applications. We have performed molecular dynamics (MD) simulation to further validate the diffusion coefficients of ethane and ethene in top 5 MOFs with the highest rMPS and in the best MOF candidate identified from the inverse design (yfk-N379). As shown in Supplementary Figure <ref type="figure">7</ref>, the diffusion coefficients predicted by the transition-state theory agree well with the MD results.</p><p>In comparison with MOF-5 tested in our earlier work, these MOFs have more complicated pore structure and topology, indicating the accuracy of MEPs calculated by the simplified string method.</p><p>Compared with high-throughput screening, the inverse design via GA is computationally much more efficient. For example, the ideal MOF candidate (both high membrane selectivity and permeability) for the membrane separation of C2H4/C2H6 can be found with only 8,000 attempts in a much larger chemical space. The computational workflow thus demonstrates that, with the efficient theoretical tools for high-throughput evaluation of materials performance, how the inverse design can significantly accelerate the material discovery, especially for the construction of reticular materials (e.g., MOFs and COFs) for the gas storage and separation.</p><p>In this work, we have integrated high-throughput screening and inverse design to find the best metal-organic frameworks (MOFs) for C2H4/C2H6 separation. Both adsorption and membrane processes have been considered in the high-throughput screening of computational-ready experimental (CoRE 2019) MOF database. For the adsorption separation, the separation selectivity of ethene-selective MOF decreases with the increase of separation capacity because highly etheneselective materials have extremely small pores with low adsorption capacity. While the selectivity of ethane-selective MOF increases with the adsorption capacity, the highest adsorption selectivity of ethane-selective MOF (LUDLAZ) [&#945; IM (C2H6/C2H4)=7.68] is smaller than that of etheneselective MOF (PIRYOF) [&#945; IM (C2H4/C2H6)=27.01]. Nevertheless, LUDLAZ is more than 70% higher than the state-of-art ethane-selective MOF identified by previous work.</p><p>Compared with that in the adsorption process, the selectivity of membrane process is less compromised by the increase of the separation capacity. Through high-throughput screening, we find that EBINUA02 yields the highest membrane selectivity [k IM (C2H4/C2H6)=2167.3]. To evaluate the overall membrane performance, we introduced a relative membrane performance score (rMPS) in terms of the selectivity and permeability with respect to the Robeson boundary.</p><p>For the separation of C2H4/C2H6, high rMPS favors MOFs with high permeability and intermediate membrane selectivity because high membrane selectivity requires large energy barrier along the minimum energy path (MEP) and leads to the slow diffusion. According to the structural analysis, MOFs with top 5% membrane selectivity have a much more confined diffusion path in terms of pore limit diameter (PLD) and largest cavity diameter (LCD) than those with top 5% rMPS, despite their similarity in the distribution of void fraction. The separation mechanism is quite different between EBINUA02 and the MOF (LUDLAZ) with the highest rMPS. While a small pore aperture (PLD: 2.91 &#197; and LCD: 3.96 &#197;) in EBINUA02 results in faster diffusion and stronger adsorption of C2H4 over C2H6, the less confined diffusion path in LUDLAZ (PLD: 4.18 &#197; and LCD: 5.96 &#197;) leads to negligible difference in the diffusion. In that case, the membrane selectivity is mostly contributed by its ethane-selective solubility.</p><p>The computational efficiency of the theoretical tools for predicting the sorption and diffusion properties of nanoporous materials enables the design MOF membranes with both high membrane selectivity and permeability using the genetic algorithm (GA). Compared with high-throughput screening, not only can GA explore the material design space with targeted properties, but it takes less attempts to identify the most promising candidates as well. The best MOF discovered by GA</p><p>) with the yfk topology. The designed material has both permeability and membrane selectivity significantly larger than the threshold values set in the fitness function. Besides, its overall membrane separation performance is better than all existing experimental MOF candidates. The computational workflow used in the work thus demonstrates the capability of inverse design to accelerate the discovery of nanoporous materials, especially reticular materials (such as MOFs and COFs) for gas storage and separation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Molecular Models</head><p>In this work, ethane (C2H6) and ethene (C2H4) molecules are modeled as two united-atom groups separated by a fixed bond length. <ref type="bibr">38</ref> The detailed force field parameters can be found in Supplementary Table <ref type="table">1</ref>. These force-field parameters are able to reproduce the adsorption isotherms of ethane and ethene in nanoporous materials. <ref type="bibr">39</ref> They also predict reasonable diffusion coefficients in comparison with limited experimental data. For example, the diffusivity predicted in this work agrees well with that from experiment for C2H6 in MOF-5 [2.42&#215;10 -8 m 2 /s vs 1.8~2.1 &#215;10 -8 m 2 /s (exp)]. Metal-organic frameworks (MOFs) are considered to be rigid with the universal force field (UFF) for all nonbonded interactions. <ref type="bibr">40</ref> While the flexibility of MOFs may play an important role in determining the efficiency of gas separation, a reliable description of such effects is computationally prohibitive for high-throughput screening because it would require input from high-level quantum-mechanical calculations. <ref type="bibr">41</ref> Recently, a systematic examination of the impacts of MOF flexibility on molecular diffusivities indicates that the rigid-structure assumption yields reasonable diffusion coefficients for rigid molecules such as ethane and ethene investigated in the present work. <ref type="bibr">42</ref> As mentioned above, the diffusivity of C2H6 in MOF-5 predicted in this work agrees well with that from experiment.</p><p>The Lennard-Jones (LJ) 12-6 potential is truncated and shifted to zero at 12.9 &#197;, and the Lorentz-Berthelot mixing rule is used for the energy and size parameters between different kinds of atoms. The periodic boundary condition is applied to all cell axes. The unit cell is duplicated such that the length along each lattice axis is at least two times the cutoff distance. The structural properties, such as the pore limit diameter, the largest cavity diameter and the void fraction, are calculated with Zeo++. <ref type="bibr">43</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Adsorption Separation</head><p>Nanoporous materials have been widely used as adsorbent in industrial applications. In the low pressure region, the adsorption selectivity for an equimolar mixture of two gas species can be measured with the ratio of Henry's constants <ref type="bibr">34</ref> h,2 IM h,1</p><p>where Kh,i represents the Henry's constant of component i. For a gas molecule with a rigid conformation, the Henry's constant can be calculated via the integration of the external potential due to its interaction with the nanoporous material 44 ( )</p><p>where &#946; = 1/ (k B T ) , kB stands for the Boltzmann constant, T is the absolute temperature, V represents the system volume, &#981; ext is the external potential, i.e., the potential energy due to the interaction of a gas molecule with all atoms from the porous material, r represents the cartesian coordinates for the center of mass of the gas molecule, and &#969; stands for its Euler angles. For each MOF, the Henry's constants for C2H6 and C2H4 are numerically evaluated via midpoint rule with the step size of 1 &#197; and 45&#176; for spatial and rotational variables, respectively.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Membrane Separation</head><p>According to the solution-diffusion model, the membrane permeability is defined as the Henry's constant multiplied by the gas diffusion coefficient at infinite dilution. The membrane selectivity can thus be calculated from <ref type="bibr">44</ref> h,2 0,2 IM 2 h,1 0,1 1 </p><p>Along each direction, the diffusivity can be calculated independently according to the transitionstate theory (TST)</p><p>where &#945; stands for the direction of lattice vector, k represents the transmission rate (viz., the hopping rate of the gas molecule), and a &#945; stands for the hopping distance between neighboring unit cells along the direction of lattice vector &#945; . The hopping rate can be obtained from the minimum energy path (MEP) via the Bennett-Chandler formula <ref type="bibr">45,</ref><ref type="bibr">46</ref> ( )</p><p>where s is a normalized dimensionless variable along the MEP, and s * represents the transition state. The diffusion coefficient predicted by Equation ( <ref type="formula">7</ref>) depends heavily on the accuracy of the MEP.</p><p>The mathematical details and the accuracy of MEP calculations for predicting the diffusivity of rigid molecules, such as C2H4 and C2H6, can be found in our previous work. <ref type="bibr">25</ref> Here, we recapitulate only the key steps to identify the MEP via the simplified string method. First, we locate the starting point of the string (i.e., the diffusion path or MEP) by searching the position and orientation of a gas molecule that minimize the external potential at the entrance plane. Because of the periodic structure of MOFs, the starting and end points of the string have identical relative configurations within their own unit cells. Next, we construct an initial string by evenly placing a certain number of points (viz., images) between the starting and end points. Finally, MEP is obtained by iterative evolution of the images along the string according to the gradient of the full external potential. After each round of iteration, the string is renormalized to avoid images collectively falling into any local energy minima.</p><p>All diffusion coefficients reported in this work are predicted from the MEP obtained from the GPU-accelerated simplified string method. We have also validated the diffusion coefficients calculated with our method by carrying out molecular dynamics (MD) simulation for top 5 MOFs with highest rMPS and for the best inverse designed MOF. For all MD simulations, we used the LAMMPS package with the Nose-Hoover thermostat for controlling the temperature. The detailed settings follow the protocol suggested in the literature for infinite dilution. <ref type="bibr">47</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.4">Genetic Algorithm</head><p>For inverse design, we use the genetic algorithm (GA) to construct MOFs with desired properties. Because MOFs can be decomposed into the secondary building blocks (SBUs), each material may be considered as a 'chromosome' of different topologies, metal nodes and organic linkers. Because the supermajority of MOF topologies can accommodate no more than two types of metal nodes and three types of organic linkers, each chromosome consists of 6 genes, and each gene is represented by an integer which corresponds to a specific topology, node or linker. The chromosome representation allows for the efficient sampling of all possible combinations of topology and SBUs with GA. The population is set as 2,000 which makes the initial generation to have diverse combinations of topologies, nodes and linkers. Three evolutions are carried after the initial population and a total of 8,000 combinations of topologies, nodes and linkers is explored to find the optimal MOF structure with desired properties. Compared with the MOF structural database used in high-throughput screening, less MOF structures are used in the GA to benchmark its computational performance for the inverse design despite a much larger chemical space is considered. In the initial population, the 2,000 chromosomes are generated by the random selection of topologies, nodes and edges in the SBU database. <ref type="bibr">37</ref> In each generation, 10 MOFs are used to generate the offspring via single point crossover. The next generation of MOFs are selected by stochastic universal selection to avoid bias towards the SBUs with low fitness values while 30%</p><p>MOFs would have a random mutation on their genes. In this work, PyGAD library is used for the genetic algorithm. <ref type="bibr">48</ref> PORMAKE is used to construct MOFs when the chromosome values are assigned. 37       -3 &#8226;Pa -1 ) Kh(C2H6) (mol&#8226;m -3 &#8226;Pa -1 ) D0(C2H4) (m 2 &#8226;s -1 ) D0(C2H6) (m 2 &#8226;s -1 )</p><p>k IM (C2H4/C2H6) yfk-N379 1.682&#215;10 -2 9.828&#215;10 -4 1.677&#215;10 -9 1.467&#215;10 -9 20.8</p></div></body>
		</text>
</TEI>
