<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Global and Local Connectivities Describe Hydrogen Intercalation in Metal Oxides</title></titleStmt>
			<publicationStmt>
				<publisher>American Physical Society</publisher>
				<date>09/01/2023</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10479081</idno>
					<idno type="doi">10.1103/PhysRevLett.131.108001</idno>
					<title level='j'>Physical Review Letters</title>
<idno>0031-9007</idno>
<biblScope unit="volume">131</biblScope>
<biblScope unit="issue">10</biblScope>					

					<author>Evan V. Miu</author><author>James R. McKone</author><author>Giannis Mpourmpakis</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[We introduce global connectivity, Ω, which quantifies the detailed connectivity of a material and accurately captures reactivity behavior. We demonstrate that combining global and local connectivity describes how metal oxides interact with hydrogen. Blending density functional theory, graph theory, and machine learning we built a reactivity model which accurately predicts hydrogen intercalation potentials of different metal oxides experimentally measured in the lab. The use of global connectivity can accelerate materials design through the development of novel structure-property relationships.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Cation intercalation into metal oxide hosts is critical to many technologies, including energy storage <ref type="bibr">[1]</ref>, optoelectronics <ref type="bibr">[2]</ref>, and catalysis <ref type="bibr">[3]</ref>. Numerous experimental and computational techniques have been used to understand this phenomenon, including analytical electrochemistry <ref type="bibr">[4]</ref>,</p><p>x-ray techniques <ref type="bibr">[5]</ref>, infrared measurements <ref type="bibr">[6]</ref>, and density functional theory (DFT) calculations <ref type="bibr">[7]</ref>. These methods are regularly applied in concert <ref type="bibr">[8]</ref> and provide multifaceted views of how a material engages in intercalation. However, these approaches can be costly and/or time-consuming, inhibiting the development of application-specific intercalation materials. This limitation is not restricted to cation intercalation and is generally relevant to processes involving the formation or breakage of chemical bonds.</p><p>Structure-based descriptors <ref type="bibr">[9]</ref> have been employed to accelerate the characterization of bonding interactions in materials. These descriptors have been shown to correlate with difficult-to-obtain performance indicators, such as reaction rates and adsorption energies <ref type="bibr">[10]</ref>. The simplest example is the local coordination number (LCN), which is an integer giving the number of bonds that connect a specific atom to its neighbors <ref type="bibr">[11]</ref>. Notable extensions of the LCN are the generalized coordination number (for sites on metal nanoparticles) <ref type="bibr">[12]</ref> and the adjusted coordination number (for sites on metal oxides) <ref type="bibr">[13]</ref>. A further adaptation of the LCN is the orbitalwise coordination number <ref type="bibr">[14]</ref>, which incorporates additional information about electronic structure. Although each are powerful descriptors of chemical interactions, calculating these LCNs requires material-specific parametrization or the use of costly quantum chemical methods, limiting their ability to drive exploratory materials design.</p><p>In pursuit of increased generality and rapid prediction of bonding thermodynamics, more modern descriptors of structure venture beyond scalar LCNs to higher dimensional constructs that encode a given material as a graph <ref type="bibr">[15]</ref>. While LCNs describe the local environment in which a chemical reaction occurs, graph descriptors detail the full global structure of a material. The state of the art in machine learning applied to describing chemical bonding <ref type="bibr">[16,</ref><ref type="bibr">17]</ref> uses graph representations to accurately predict energetic behavior, suggesting that global connectivitynot just local coordination-could be used to describe and predict the thermodynamics of chemical bonding. It is therefore plausible that multiple scales of connectivity contribute to observable chemical behavior.</p><p>This Letter demonstrates how global and local connectivities can be used together to quantitatively predict the equilibrium energetics of hydrogen intercalation in metal oxides. We have previously investigated hydrogen intercalation in tungsten oxide using acid-base properties as descriptors <ref type="bibr">[18]</ref>; however, these properties must be determined via computationally demanding quantum chemical calculations. This work expands the set of materials under investigation and uses inexpensive, structure-based descriptors to determine the energetics of hydrogen intercalation in oxides. Here, we introduce a mathematical definition of global connectivity, &#937;. Importantly, &#937; is a single number (not a full graph or matrix) that characterizes the overall connectivity of a structure. It is therefore as interpretable as a LCN and can be used to differentiate between materials based on their distinct geometries. Equations ( <ref type="formula">1</ref>)-( <ref type="formula">4</ref>) summarize the definition of &#937; for a general atomic structure, and Fig. <ref type="figure">1</ref> illustrates the calculation of &#937; for two contrasting examples of fictitious metal oxides.</p><p>Equation (1) gives the symmetric normalized Laplacian matrix <ref type="bibr">[19]</ref> L, which catalogs a structure's atom-to-atom connections. L is an NxN matrix, with N being the total number of atoms contained in the molecule, nanoparticle, unit cell, etc. For the oxides in this work, N includes all metal, oxygen, and hydrogen atoms present in the unit cell. Therefore, N increases with higher levels of hydrogen insertion. In Eq. (1), u and v are atom indices and d i is the degree of atom i. Here, the degree is the LCN of an atom, i.e., the number of atoms which are directly bonded to it. Further discussion on L is included in Sec. S1 of the Supplemental Material <ref type="bibr">[20]</ref>. Notably, L can be constructed without the use of bond distances and angles. It can be determined for an arbitrary system without any geometry optimization or specific knowledge of spatial arrangement. This is demonstrated in Fig. <ref type="figure">1</ref>, which shows a more connected oxide in the top row and a less connected oxide in the bottom row. More connected systems are those that are more coordinatively saturated and densely bonded. The center column of Fig. <ref type="figure">1</ref> shows portions of the unique L representations of each oxide, computed according to Eq. ( <ref type="formula">1</ref>).</p><p>The spectrum of eigenvalues for a given system, &#955;, is determined from L with Eq. <ref type="bibr">(2)</ref>. In this eigenvalue decomposition, &#955; is a vector containing N entries and X is the corresponding set of N eigenfunctions. &#955; is invariant to the ordering of rows and columns in L, i.e., the sequence of atom labeling does not impact the resulting spectrum. When ordering from smallest to largest, the first entry, &#955; 0 , is always zero. All remaining entries are strictly positive and lie between zero and 2 because of the normalization of L <ref type="bibr">[19]</ref>.</p><p>&#937; is calculated from three quantities given by &#955;: the algebraic connectivity &#955; 1 <ref type="bibr">[36]</ref>, the spectral gap &#955; &#1013; <ref type="bibr">[37]</ref>, and the Laplacian-energy-like invariant, LEL <ref type="bibr">[38]</ref>. The invariance of &#955; asserts that these three values are also insensitive to atom numbering. As a result, they are known as graph invariants. Each holds unique information regarding the structure of the atomic system described by a particular L matrix.</p><p>&#955; 1 is the second entry of &#955; when the eigenvalues are ordered from smallest to largest. It has been shown to quantify the connectivity of networks represented by L, where smaller values are less connected and larger values are more connected <ref type="bibr">[36]</ref>. For example, the more connected oxide in Fig. <ref type="figure">1</ref> has a &#955; 1 of 0.134, while the less connected oxide has a &#955; 1 of 0. A weakness of &#955; 1 is that it cannot distinguish different systems if they individually contain nonbonded sections: they will each have &#955; 1 values of 0, even if their geometries are dramatically dissimilar. Including &#955; &#1013; in the definition of &#937; helps alleviate this limitation. &#955; &#1013; is the smallest nonzero eigenvalue of L <ref type="bibr">[37]</ref>. It quantifies how easy it is to traverse from one point in a graph to another. Smaller values indicate that a structure contains multiple isolated parts which are connected by a small number of bonds. Because &#955; &#1013; is defined to be nonzero, it can uniquely characterize layered systems that cannot be distinguished via &#955; 1 .</p><p>Finally, the LEL invariant is used to scale &#955; 1 and &#955; &#1013; to make fair comparisons between systems containing different numbers of atoms. It is computed as the sum of the square roots of all entries in &#955;, according to Eq. (3). The LEL invariant is used as a normalization factor to account for effects on global connectivities computed for systems of different sizes.</p><p>Equation ( <ref type="formula">4</ref>) defines the global connectivity, &#937;, of a chemical system using each of the three graph invariants FIG. <ref type="figure">1</ref>. Scheme for calculating &#937; for two example oxide systems. The top system is more connected, and the bottom system is less connected; atoms in the top system are, on average, bonded to more neighboring atoms than those in the bottom system. Portions of the L matrices for each system are shown in the center column. Each entry is computed according to Eq. ( <ref type="formula">1</ref>). Performing eigenvalue decompositions on L c and L l yields the full eigenvalue spectra, &#955; c and &#955; l . &#937; c and &#937; l are computed from these spectra according to Eq. ( <ref type="formula">4</ref>). More connected systems have larger &#937; values. discussed above. &#937; ranges from 0 to 1 and can be interpreted as the fraction of &#955; associated with a system's connectivity. This is illustrated in Fig. <ref type="figure">1</ref>, with the more connected oxide exhibiting a larger &#937; than the less connected oxide. With this definition of &#937;, we introduce the concept of global connectivity to explore the thermodynamics of hydrogen intercalation in metal oxides. First, we generated a dataset of hydrogen binding energies at various lattice oxygen sites in five bulk metal oxides. These binding energies were computed using DFT. Details are given in Sec. S2 of the Supplemental Material <ref type="bibr">[20]</ref>. The oxides we considered include V 2 O 5 , MoO 3 , WO 3 , VO 2 , and TiO 2 , as described in Sec. S3 of the Supplemental Material <ref type="bibr">[20]</ref>. Figure <ref type="figure">2</ref> shows how the DFT binding energies of hydrogen atoms in each oxide correlate with the inverse of &#937;. As can be seen, oxides with smaller &#937; values (less connected systems) exhibit more exothermic H-binding energies. These systems contain a large number of undercoordinated sites which are amenable to bonding with a guest hydrogen atom. Systems with higher &#937; values have sites that are more coordinatively saturated, resulting in more endothermic (or less exothermic) binding.</p><p>Although Fig. <ref type="figure">2</ref> suggests a relationship between hydrogen binding energies and &#937;, in some cases, single values of &#937; returned multiple values of hydrogen binding energies. This is a consequence of the global nature of &#937;: a given oxide has a unique &#937; value, so that independent site types within its structure cannot be differentiated. These multiple site types also present many possible intercalation configurations <ref type="bibr">[18]</ref>. The colored symbols in Fig. <ref type="figure">2</ref> show the energetics of the most stable binding configurations for each investigated oxide stoichiometry, and the black symbols show the unstable configurations. A line of best fit was drawn considering the most stable intercalated systems, since these are the ones with the highest probability of contributing to observed intercalation behavior.</p><p>Overall, &#937; captured the trend of increasing H binding strength across oxides (TiO 2 &lt; VO 2 &lt; WO 3 &lt; MoO 3 &lt; V 2 O 5 ), illustrating its ability to describe meaningful structural traits of each metal oxide. However, using &#937; alone as a descriptor for hydrogen binding energy gives an uncertainty &gt; 0.3 eV, which is too large to quantitatively predict the equilibrium energetics of hydrogen intercalation. The invariability of &#937; with atomic configuration also makes it impossible to obtain precise site-specific hydrogen binding energies from &#937; alone.</p><p>Hydrogen (proton-coupled electron) intercalation into metal oxides can be viewed as two events: (1) binding of a proton to a lattice oxygen and (2) the addition of an electron in the diffuse oxide conduction bands. The global nature of &#937; could capture information relevant to event 2. However, proton affinity (relevant to event 1) is dependent on local site coordination <ref type="bibr">[39]</ref>. Therefore, we hypothesized that augmenting &#937; with a local coordination descriptor could provide superior predictions of hydrogen insertion behavior in metal oxides. To construct an intercalation model capable of explaining site-specific energetics, we included local information through a scaled coordination number, CN s , defined by Eq. ( <ref type="formula">5</ref>):</p><p>The numerator is the number of bonds within two nearest neighbors of the site of interest, demonstrated by Fig. <ref type="figure">S2</ref> of the Supplemental Material <ref type="bibr">[20]</ref>. The denominator is the total number of bonds in the unit cell (according to the relevant oxide crystal structure). Hence, CN s is the fraction of bonds in the unit cell which are associated with a given site. This description of local coordination differentiates each site type and assists &#937; in describing hydrogen intercalation energies. Taken together, &#937; and CN s quantify structure at global and local scales.</p><p>Figure <ref type="figure">3</ref> summarizes the performance of a kernel ridge regression (KRR) model that was used to calculate hydrogen intercalation energies using only &#937; and CN s . This model was trained on DFT-calculated hydrogen binding energies in the investigated metal oxides. Full details on model construction are included in Sec. S5 of the Supplemental Material <ref type="bibr">[20]</ref>. Intercalation chemistry is typically analyzed through the use of convex energy hulls, such as those shown in Fig. <ref type="figure">3(a)</ref>. The x axis shows the amount of hydrogen contained in the oxide lattice, and the y axis is the formation energy of a given intercalated oxide. As can be seen, the KRR model reproduced the DFT convex energy hulls with a root mean square error (RMSE) of 0.129 eV and a mean absolute error (MAE) of 0.089 eV FIG. <ref type="figure">2</ref>. Correlation between hydrogen binding energies in extended metal oxides with the inverse of the global connectivity, &#937;. Less connected oxides containing a larger number of undercoordinated sites bind hydrogen atoms more strongly, and more connected systems with fully saturated atoms bind hydrogen more weakly. The larger colored symbols represent stable configurations at each stoichiometry, and the black symbols correspond to unstable configurations. A line of best fit is drawn through the set of stable configurations.</p><p>(in terms of H-binding energy on a site-specific basis). Accordingly, Fig. <ref type="figure">3</ref>(a) suggests that &#937; and CN s could be useful descriptors of hydrogen binding in these metal oxides.</p><p>While the convex hulls identify stable phases of intercalated oxides, a more practical thermodynamic quantity is the hydrogen intercalation potential, U H . These potentials can be calculated from the slopes of the convex energy hulls (see Supplemental Material <ref type="bibr">[20]</ref> Sec. S2). Figure <ref type="figure">3(b)</ref> compares experimental U H values to those calculated from the KRR hulls. Each horizontal grouping of data points is a collection of experimentally measured U H values for a single H-intercalation equilibrium potential. For example, the red circles are the TiO 2 =H 0.125 TiO 2 redox couple. The vertical lines on each data point are the error associated with the model prediction, taken as the MAE of the KRR model (AE0.089 eV). The filled markers are data extracted from published literature reports <ref type="bibr">[18,</ref><ref type="bibr">[40]</ref><ref type="bibr">[41]</ref><ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref><ref type="bibr">[48]</ref>, and the open markers are from measurements we performed. Methods used to obtain the experimental data included in Fig. <ref type="figure">3(b</ref>) are described in Secs. S6 and S7 of the Supplemental Material <ref type="bibr">[20]</ref>.</p><p>In general, the KRR model reproduced experimental potentials with an accuracy that would be expected from DFT calculated potentials, giving a RMSE of 0.116 eV. Furthermore, for oxides with data on more than one hydrogen intercalation equilibrium, the structure-based model correctly captured the phenomena of higher stoichiometries having more negative intercalation potentials. The performance of &#937; and CN s versus other metrics of connectivity is provided in Sec. S8 of the Supplemental Material <ref type="bibr">[20]</ref>. We reiterate that determining &#937; and CN s did not require any quantum chemical calculations or other expensive computations other than generating data for developing the model. These features therefore provide a rapid route for determining hydrogen intercalation potentials in metal oxides that is based entirely on the structure of the parent oxides. In principle, the same statistical models could be built using experimental data from one set of metal oxides to predict insertion potentials of another set.</p><p>To interrogate the physical relevance of &#937; to intercalation, we applied a convolutional neural network (CNN) to the same regression task addressed with the KRR model. The major difference in this approach was that the inputs were taken directly as the L matrices. During training, this allowed the CNN to autoembed representations of L which were significant to hydrogen intercalation. If the CNN embeddings correlate with &#937;, that would suggest &#937; is a meaningful metric of connectivity. The results of this analysis are shown in Fig. <ref type="figure">4</ref>, and full implementation details of the CNN are described in Sec. S9 of the Supplemental Material <ref type="bibr">[20]</ref>.</p><p>The axes in Fig. <ref type="figure">4</ref> are the first and second principal components of the CNN's final hidden layer. Each data point is an intercalated oxide, and the color represents each oxide's global connectivity. There is a clear gradient in color moving from left to right in Fig. <ref type="figure">4</ref>, suggesting that &#937; is strongly correlated with the feature space that was generated by the network. The hidden layer node values, and therefore their principal components, are interpretable as features that have been self-generated by the network during training. Hence, without explicitly considering Eqs. ( <ref type="formula">1</ref>)-( <ref type="formula">4</ref>), the CNN arrived at a hidden layer feature space that closely resembled &#937;. This interpretation of our model gives further evidence of system-scale, global connectivity as a physically meaningful descriptor of hydrogen intercalation thermodynamics in metal oxides.</p><p>In conclusion, this Letter introduced global connectivity &#937; as a reactivity descriptor in redox-active metal oxides. We first derived an expression for &#937; and illustrated its roots in spectral graph theory. Using DFT to calculate hydrogen intercalation energies in a set of five oxides, we showed that &#937; captured the general thermodynamic trends of intercalation. Then, we combined &#937; with a scaled local coordination, CN s , to build a regression model that depended only on structural information that is readily obtained from experiment. This structure-based model leveraged both global and local connectivities to accurately capture the trends and values of hydrogen intercalation potentials across all investigated oxides when compared against electrochemical experiments performed in the lab. To further support &#937; as a meaningful descriptor of oxide redox reactivity, we applied a convolutional neural network to convert Laplacian matrices directly into hydrogen intercalation energies. We observed a strong correlation between the principal component space of the final hidden layer of the network and &#937;, suggesting that the CNN embedded the linear algebra that links L and &#937;. Future work can further develop the concept of global connectivity by incorporating more node-and edge-level information into L. These information-rich Laplacians could then be compared to more sophisticated network models, such as crystal graph convolutional neural networks <ref type="bibr">[15]</ref>. Overall, this Letter introduces &#937; as a useful metric for global connectivity and demonstrates how simplified representations of global and local structure can be used together to predict hydrogen binding in metal oxides. </p></div></body>
		</text>
</TEI>
