<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Statistical analysis of C–H activation by oxo complexes supports diverse thermodynamic control over reactivity</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>03/25/2021</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10227152</idno>
					<idno type="doi">10.1039/D0SC06058E</idno>
					<title level='j'>Chemical Science</title>
<idno>2041-6520</idno>
<biblScope unit="volume">12</biblScope>
<biblScope unit="issue">11</biblScope>					

					<author>Joseph E. Schneider</author><author>McKenna K. Goetz</author><author>John S. Anderson</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Transition metal oxo species are key intermediates for the activation of strong C–H bonds. As such, there has been interest in understanding which structural or electronic parameters of metal oxo complexes determine their reactivity. Factors such as ground state thermodynamics, spin state, steric environment, oxygen radical character, and asynchronicity have all been cited as key contributors, yet there is no consensus on when each of these parameters is significant or the relative magnitude of their effects. Herein, we present a thorough statistical analysis of parameters that have been proposed to influence transition metal oxo mediated C–H activation. We used density functional theory (DFT) to compute parameters for transition metal oxo complexes and analyzed their ability to explain and predict an extensive data set of experimentally determined reaction barriers. We found that, in general, only thermodynamic parameters play a statistically significant role. Notably, however, there are independent and significant contributions from the oxidation potential and basicity of the oxo complexes which suggest a more complicated thermodynamic picture than what has been shown previously.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>The activation of C-H bonds through proton-coupled electron transfer (PCET) underpins a wide range of biological and synthetic processes. The applications of this reaction include drug metabolism by cytochrome P450 enzymes as well as synthetic methods for the preparation of fine chemicals. <ref type="bibr">[1]</ref><ref type="bibr">[2]</ref><ref type="bibr">[3]</ref> In many cases PCET reactions are mediated by transition metal oxo intermediates generated within either protein-based or synthetic ligand scaffolds. In these reactions, both a proton and an electron are transferred to an oxo complex resulting in the net removal of a hydrogen atom from the organic substrate. The generality of this reaction combined with the ubiquity of C-H bonds in synthesis has led to considerable interest in determining what properties govern the PCET reactivity of transition metal oxo species.</p><p>A large body of work supports that the free energy of reaction (&#916;GPCET) is central to transition metal oxo mediated C-H activation and also offers a great deal of explanatory and predictive power. <ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref><ref type="bibr">[7]</ref> Recently however, additional properties have been cited as important although it is not clear if any have a widespread effect on reactivity. Individual cases support the influence of O-centered spin density, <ref type="bibr">8</ref> spin state, <ref type="bibr">[9]</ref><ref type="bibr">[10]</ref><ref type="bibr">[11]</ref> steric environment, <ref type="bibr">[12]</ref><ref type="bibr">[13]</ref><ref type="bibr">[14]</ref> the free energies of proton and electron transfer (&#916;GPT and &#916;GET), <ref type="bibr">[15]</ref><ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">[20]</ref> or the asynchronicity (&#951;) of the reaction, <ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref> but there is a lack of consensus regarding their generality and relative importance (Scheme 1). <ref type="bibr">8,</ref><ref type="bibr">[25]</ref><ref type="bibr">[26]</ref><ref type="bibr">[27]</ref> Very few studies have explored these parameters outside of a narrow range of complexes, <ref type="bibr">4,</ref><ref type="bibr">6,</ref><ref type="bibr">10,</ref><ref type="bibr">20,</ref><ref type="bibr">21,</ref><ref type="bibr">28</ref> and none have statistically examined the significance of parameters other than &#916;GPCET on the reactivity of a broad set of metal oxo complexes.</p><p>We previously found an atypical dependence on &#916;GPT in the concerted C-H activation reactivity of a terminal Co III oxo complex which contrasts with the expected rate dependence on &#916;GPCET. <ref type="bibr">15</ref> Given the disparity of this result with the literature, we sought to understand the interplay of characteristics affecting a broad range of transition metal oxo mediated PCET reactions using multivariable linear free energy relationships (LFERs). These models can be used to relate experimentally determined data, such as reaction rates, to multiple predictor variables simultaneously. LFER models have recently been used as versatile tools to optimize organic methodology, predict reaction barrier heights, and investigate underlying mechanisms. <ref type="bibr">[29]</ref><ref type="bibr">[30]</ref><ref type="bibr">[31]</ref><ref type="bibr">[32]</ref><ref type="bibr">[33]</ref> We have applied this analysis to examine trends in rates of PCET mediated C-H activation for a broad dataset of previously reported metal oxo complexes. This analysis enables a statistical examination of several hypotheses regarding what parameters of metal oxo species determine their PCET reactivity. Unsurprisingly, we observe that &#916;GPCET is the most important factor. However, we also observe a significant role for &#916;GPT and &#916;GET beyond and independent of their contribution to &#916;GPCET. Furthermore, the other parameters investigated do not have broad significance. These results suggest that thermodynamic factors are generally the dominant contributors to transition metal oxo C-H activation reactivity, but also demonstrate that thermodynamic parameters beyond the commonly invoked &#916;GPCET are influential.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results and Discussion</head><p>We began our analysis by compiling an extensive data set of second order rate constants (k2) for the oxidation of 9,10dihydroanthracene (DHA) by thirty well-characterized metal oxo complexes. <ref type="bibr">12,</ref><ref type="bibr">13,</ref><ref type="bibr">15,</ref><ref type="bibr">17,</ref><ref type="bibr">[34]</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref><ref type="bibr">[37]</ref><ref type="bibr">[38]</ref><ref type="bibr">[39]</ref><ref type="bibr">[40]</ref><ref type="bibr">[41]</ref><ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref><ref type="bibr">[48]</ref><ref type="bibr">[49]</ref><ref type="bibr">[50]</ref><ref type="bibr">[51]</ref> This diverse data set comprises complexes of five different transition metals, tetragonal and trigonal geometries, spin multiplicities from 1 to 5, charges from -1 to +3, and d-electron counts from 0 to 6. We manually divided the data set into a training set of seventeen metal oxo complexes and a test set of thirteen metal oxo complexes such that each set had a diverse mix of species. For our analysis, we calculated the values of parameters that have been hypothesized as important to metal oxo mediated PCET reactivity for each metal oxo complex. The investigated parameters include the steric environment (quantified by percent buried volume, %BV), <ref type="bibr">52</ref> the spin density on the oxygen atom (via intrinsic bond order (IBO) analysis), <ref type="bibr">53,</ref><ref type="bibr">54</ref> the energetic cost of accessing a higher spin state if one lies closer to the product spin multiplicity than the reactant ground spin state (spin excitation), the thermodynamic free energies &#916;GPCET, &#916;GPT, and &#916;GET, and the magnitude of the asynchronicity parameter (|&#951;|). <ref type="bibr">21</ref> The detailed approaches used to determine the values for these parameters are provided in the methods section. Importantly, none of these parameters require transition state optimizations which are relatively difficult and less reliable than ground state calculations. <ref type="bibr">55,</ref><ref type="bibr">56</ref> While this precludes a direct estimation of transition state effects, we expect to indirectly capture some of them. For instance, parameters such as &#916;GPCET and |&#951;| have been shown to correlate with tunneling. <ref type="bibr">23,</ref><ref type="bibr">57</ref> We examined the effect of each of these parameters on experimental reaction barriers by building multivariable free energy models via ordinary least squares regression of the barrier heights against the parameters. Each model consists of a set of coefficients (with variable units such that the product with the respective parameter gives units of kcal/mol) and an intercept (with units kcal/mol). These models were used to generate predicted reaction barriers for each data point, which could be compared with experimental reaction barriers to assess the utility of the model. Because &#916;GPCET has strong theoretical and experimental support for affecting reaction barrier heights, <ref type="bibr">[4]</ref><ref type="bibr">[5]</ref><ref type="bibr">[6]</ref> we analyzed each parameter in combination with &#916;GPCET and compared the resulting model to regression against &#916;GPCET alone.</p><p>We evaluated each regression based on R 2 , leave-one-out (LOO) R 2 (sometimes referred to as Q 2 ), and a statistical Ftest. <ref type="bibr">58</ref>-61 R 2 is a goodness of fit measure which quantifies the amount of variation explained by a model. The predictive ability of a model is gauged with LOO R 2 , in which each data point is left out and predicted by the remaining data points and the goodness of fit is then reevaluated. Critically, unlike regular R 2 this metric does not necessarily improve with an increase in parameters; overfitted models with too many parameters perform poorly with LOO R 2 . For each R 2 , a value close to 1 indicates a good fit. Finally, we report the p-value from an F-test on each model, which shows the probability the observed correlation arises from statistical noise. The lower this p-value is, the more significant a given parameter. Additionally, the calculation of p-values considers the number of parameters added to a model, so, as with LOO R 2 , an F-test is not biased in favor of adding more parameters.</p><p>A summary of our findings is presented in Table <ref type="table">1</ref>. In line with previous reports, we find a strong correlation between the experimental reaction barriers and &#916;GPCET. This parameter alone explains 70% of the variation in reaction barriers within the training set (R 2 = 0.70) and has high predictive ability (LOO R 2 = 0.60). Interestingly, most other parameters do not significantly improve the model. While we do observe a small correlation with %BV steric metrics, the magnitude of the effect is too small to be statistically significant. Compared to the &#916;GPCET only model, spin-based parameters and |&#951;| barely improve R 2 and perform similarly or worse in LOO cross-validation. While it is difficult to rule out the importance of these parameters in individual cases, an F-test indicates they do not have a statistically significant effect across our entire data set.</p><p>In contrast, addition of &#916;GPT and &#916;GET does significantly improve the fit. For this {&#916;GPCET, &#916;GPT, &#916;GET} model, R 2 increases from 0.70 to 0.86 and LOO R 2 increases from 0.60 to 0.71, indicating both better explanation of the available data and better predictive ability. An F-test gives p &lt; 0.01 which suggests the observed effect is statistically significant. The equation from this fit is &#916;G &#8225; = 0.31 &#916;GPCET + 0.07 &#916;GPT + 0.12 &#916;GET -0.26 (all coefficients unitless; free energies and intercept in kcal/mol). Typically, &#916;GPCET is a negative value while &#916;GPT and &#916;GET are positive values. Thus, the positive sign of the &#916;GPCET coefficient indicates that a more exergonic reaction will have a lower barrier while increases in &#916;GPT and &#916;GET will raise the barrier. The larger coefficient of &#916;GPCET indicates the reaction barrier is most sensitive to this free energy. Satisfyingly, the &#916;GPCET coefficient agrees with experimental data: for metal oxo complexes that have a demonstrated trend of log(kobs) vs. substrate bond dissociation free energy (BDFE), the average slope of &#916;G &#8225; vs. substrate BDFE is ~0.3 (see Table <ref type="table">S1</ref>), very similar to the 0.31 observed in our analysis. The intercept of -Please do not adjust margins Please do not adjust margins 0.26 contains contributions to the average error not accounted for by the three free energies.</p><p>The significance of &#916;GPT and &#916;GET is intriguing because the literature discussion of these values has often been framed in terms of how they contribute to &#916;GPCET rather than in terms of their intrinsic contribution to reaction barrier heights. <ref type="bibr">[16]</ref><ref type="bibr">[17]</ref><ref type="bibr">[18]</ref><ref type="bibr">[19]</ref><ref type="bibr">27</ref> However, &#916;GPT and &#916;GET as defined here are the energies to form the intermediates involved in stepwise reactivity -the protonated metal oxo with the deprotonated substrate, or the reduced metal oxo with the oxidized substrate (Scheme 1). Critically, &#916;GPT and &#916;GET do not form a full thermodynamic cycle with &#916;GPCET and thus are fundamentally distinct. This fact is statistically supported by poor correlations between &#916;GPCET and &#916;GPT and between &#916;GPCET and &#916;GET (-0.12 and 0.31, respectively, see Regression S6). Finally, we find that &#916;GPT and &#916;GET have importance independent of a contribution to &#916;GPCET as clearly demonstrated by the LOO R 2 s and F-tests. All of our analyses therefore suggest that the combination of &#916;GPT and &#916;GET is an independent and significant contributor to C-H activation barrier heights.</p><p>While the observation of a dependence on &#916;GPT and &#916;GET that arises from our linear regressions is principally empirical, it is consistent with prior theoretical models in the literature. The physical underpinning of this dependency on &#916;GPT and &#916;GET is likely due to mixing of proton transfer and electron transfer intermediates into the concerted transition state despite these intermediates never being fully realized. <ref type="bibr">[62]</ref><ref type="bibr">[63]</ref><ref type="bibr">[64]</ref> Within transition state theory, this can be envisioned by a More-O'Ferrall-Jencks plot in which the transition state lies not on a one-dimensional line connecting reactant and product but on a two-dimensional plane containing reactant, product, and intermediates. <ref type="bibr">65,</ref><ref type="bibr">66</ref> In this case, the intermediates arise from proton transfer and electron transfer, and when either &#916;GPT or &#916;GET lowers in energy the transition state character can adopt structural and electronic components of these intermediates resulting in a lower barrier height. While the use of these classical structureenergy relationships to analyse PCET reactions has been questioned recently, <ref type="bibr">26,</ref><ref type="bibr">27</ref> proton transfer and electron transfer states and their energies also have roles in nonadiabatic rate theories of PCET which treat proton transfer in a quantum mechanical fashion. <ref type="bibr">67,</ref><ref type="bibr">68</ref> Therefore, the use of &#916;GPT and &#916;GET to predict barrier heights of PCET is consistent with prior theoretical foundations.</p><p>Assigning a direct role for &#916;GPT and &#916;GET is in line with recent computational studies of PCET transition states which invoke off-&#916;GPCET diagonal thermodynamic terms from Scheme 1, such as asynchronicity (&#951;), as key contributors to DFT derived reaction barriers. <ref type="bibr">[21]</ref><ref type="bibr">[22]</ref><ref type="bibr">[23]</ref><ref type="bibr">[24]</ref> Asynchronicity is derived not from the sum of &#916;GPT and &#916;GET, but rather their difference. Conversely, we instead find that the sum of &#916;GPT and &#916;GET have a more significant effect than |&#951;|. The reason for this discrepancy is unclear, but a possible explanation is that experimental noise prevents us from observing a comparatively more subtle trend between |&#951;| and the experimental reaction barrier heights. Furthermore, the well-controlled nature of the series of complexes previously investigated for asynchronicity may have too little variation in (&#916;GPT + &#916;GET) to manifest similarly to the effects we observe here.</p><p>Another way in which our data may not be amenable to investigating the effect of |&#951;| is the variable reorganization energy of the metal oxo complexes examined here. |&#951;| is specifically framed as an adjustment to the Marcus reorganization energy; 21 therefore |&#951;|'s effect may only be clear when reorganization is properly accounted for. While it is clear that the reorganization energy is important to PCET reactivity, there is no established way to compute it without computationally expensive transition state geometries. <ref type="bibr">11,</ref><ref type="bibr">[67]</ref><ref type="bibr">[68]</ref><ref type="bibr">[69]</ref> We have made multiple attempts to derive reorganization or deformation parameters using the optimized metal oxo and metal hydroxide geometries and frequencies, but none of these parameters have statistically significant contributions to predicted reaction barriers with or without |&#951;| (see ESI). Therefore, a combination of noise in the experimental data and our inability to compute a reliable reorganization parameter could preclude us from observing an effect of |&#951;| on the barrier heights. Nonetheless, previous studies as well as this current work offer increasing support that off-&#916;GPCET diagonal thermodynamic terms such as &#916;GPT and &#916;GET have important effects on reactivity independent of &#916;GPCET. Please do not adjust margins</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Please do not adjust margins</head><p>The all-thermodynamic model we find here provides insights and possible alternative explanations for previously reported trends in PCET reactivity. In one study, <ref type="bibr">12</ref> steric and spin state effects were invoked to explain the comparatively high reactivity of the S = 2 complex [Fe IV (O)(TMG2dien)(CH3CN)] 2+ . A higher rate of C-H activation as compared to S = 2 [Fe IV (O)(TMG3tren)] 2+ was ascribed to reduced steric hinderance in the TMG2dien complex, <ref type="bibr">13</ref> and the higher rate of C-H activation as compared to the S = 1 complexes [Fe IV (O)(N4Py)] 2+ and [Fe IV (O)(TMC)(CH3CN)] 2+ was ascribed to the S = 2 spin state in the TMG2dien complex. <ref type="bibr">36,</ref><ref type="bibr">44</ref> However, it was noted that the even faster reactivity of [Fe IV (O)(Me3NTB,CH3CN)] 2+ , which is S = 1 and has a similar %BV profile to [Fe IV (O)(TMG2dien)(CH3CN)] 2+ , <ref type="bibr">34</ref> is not easily explained by either hypothesis. Our analysis suggests that the thermodynamic properties of these complexes may provide an alternative explanation in these comparisons (see Table <ref type="table">S9</ref>). The Me3NTB complex has by far the most exergonic reaction with DHA (&#916;GPCET = -16 kcal/mol), followed by the TMG2dien complex (&#916;GPCET = -9 kcal/mol), followed by the complexes of TMG3tren, TMC, and N4Py (&#916;GPCET = -7, -6, and -6 kcal/mol, respectively). Thus, thermodynamic parameters would predict the Me3NTB complex to have the lowest reaction barrier and fastest rate of reaction, with the TMG2dien complex being the next most reactive, and the remaining complexes the least reactive as is observed experimentally.</p><p>In another study, it was observed that the rates of PCET reactions performed by [Fe IV (O)(TMC)(X)] n+ decrease with more strongly donating axial ligands X. <ref type="bibr">36</ref> Variation in &#916;GPCET was ruled out as a cause of this trend, as it was calculated to be similar for all complexes investigated. It was suggested that the accessibility of a high-spin state may explain this variation in the rates, as the energy of the quintet excited state decreased with stronger X ligands. However, our calculations indicate that while stronger axial donors increase &#916;GET, &#916;GPT decreases more substantially (see Table <ref type="table">S9</ref>). In our model, these changes result in a net decrease in the reaction barrier, suggesting that despite a similar &#916;GPCET, the reactivity trend could be explained by thermodynamic effects. These analyses do not rule out that spin state or steric effects may be important in the previous studies, but suggest that thermodynamics may also play an important role.</p><p>The fit of the training data to {&#916;GPCET, &#916;GPT, &#916;GET} and this model's performance on the test set is depicted graphically in Figure <ref type="figure">1</ref>. It is clear that the reaction barriers for most metal oxo complexes in the test set are well predicted. Nonetheless, several metal oxo complexes (given unique symbols in Figure <ref type="figure">1</ref>) deserve further discussion.</p><p>The {&#916;GPCET, &#916;GPT, &#916;GET} model behaves the most poorly in predicting reaction barriers for the Fe IV oxo and Co IV oxo complexes of the ligand 13-TMC. <ref type="bibr">49,</ref><ref type="bibr">70</ref> Essentially no barrier is predicted for these reactions, which is not observed experimentally. This is due to a large negative calculated &#916;GPCET in both cases; in fact, these complexes are outliers even in the &#916;GPCET only fit (see Regression S1). The cause of this discrepancy is not entirely clear. However, it appears to be systemic to the particular ligand scaffold rather than the identity of the metal center, which suggests this discrepancy could arise from ambiguity in the primary coordination sphere of these complexes. No structural characterization is reported for the Fe IV complex, and while a short Co-O bond is identified by EXAFS for the Co IV complex, it is difficult to conclusively determine the primary coordination sphere. Any discrepancy in coordination sphere would render our calculated parameters incorrect, potentially explaining their inability to predict the experimental reaction barriers.</p><p>The reaction barrier is overestimated for all Ru oxo complexes, and for three of them by more than two kcal/mol. As Ru is the only second row transition metal in our data set, we suspect this overestimation is due to a consistent difference between first and second row transition metals rather than Ru examples not following the same trends. For instance, it is possible that the Ru oxo complexes have relatively low structural reorganization energy or that relativistic effects influence the coefficients. It may also simply be a change in the systemic DFT error upon going to the second row. Regardless, regression of barriers from the kinetics of an individual Ru oxo complex reacting with several different substrates reveals there is a trend with &#916;GPCET, &#916;GPT, and &#916;GET with similar coefficients to those obtained from the more general model with multiple different oxo complexes (see ESI). This supports that the same trends in free energies are at play in the Ru oxo complexes.</p><p>Interestingly, the {&#916;GPCET, &#916;GPT, &#916;GET} model only moderately underestimates the reaction barrier (by ~2 kcal/mol) for a terminal Co III oxo complex which has unusual trends in its reactivity with various substrates. <ref type="bibr">15</ref> Unlike most metal oxo complexes, the reactivity of this complex does not have a clear trend with &#916;GPCET; its kinetics are instead dominated by &#916;GPT. Therefore, its adherence to trends in {&#916;GPCET, &#916;GPT, &#916;GET} as seen for the broad set of metal oxo complexes deserves further investigation. We regressed the experimental reaction barriers for the reactivity of this complex Please do not adjust margins Please do not adjust margins with several substrates against only &#916;GPT as well as against {&#916;GPT, &#916;GPCET} (Figure <ref type="figure">2</ref>). We find that the inclusion of &#916;GPCET significantly improves the model, increasing R 2 from 0.94 to 0.97 and LOO R 2 from 0.93 to 0.95 and having an F-test p-value of 0.02 (see ESI). However, the relative weighting of the contribution from &#916;GPCET is quite different than for the broader set of complexes.</p><p>In the broader set we observe that &#916;GPCET has a larger effect on the reaction barriers than either &#916;GPT or &#916;GET, which is reflected in the larger coefficient for the &#916;GPCET term than for the &#916;GPT and &#916;GET terms in the fit equation (Figure <ref type="figure">1</ref>). In contrast, &#916;GPT has a greater effect than &#916;GPCET on the reaction barriers for the Co III oxo complex, again reflected in the magnitude of their coefficients (Figure <ref type="figure">2</ref>). Furthermore, the addition of &#916;GET significantly improves the model for the broader set of metal oxo complexes (Table <ref type="table">1</ref>) but is insignificant for the series of substrates reacting with the Co III oxo complex (p-value &gt; 0.05, see Regressions S42 and S43). Overall, this Co III oxo complex is not so dissimilar from the broader set of metal oxo complexes in that the same thermodynamic free energies explain the reactivity of both. However, this individual case demonstrates a different weighting of parameters than that observed in the broad set.</p><p>Our analysis of the Co III reactivity rests on the assumption that the coefficients of the model do not change appreciably from substrate to substrate. To test this assumption, we extended our analysis of the larger set of metal oxo complexes to include reactivity with 1,4-cyclohexadiene (CHD), fluorene, and xanthene in addition to DHA. We refit the model with reported data for the reactions between each substrate and all metal oxo complexes (excluding the previously discussed Ru and 13-TMC oxo complexes). As with our regressions for DHA alone, the inclusion of &#916;GPT and &#916;GET notably improves the fit (Table <ref type="table">1</ref>, Figure <ref type="figure">3</ref>). Other parameters offer comparably little improvement to the fit and do not perform well by LOO cross validation. The equation for this model is &#916;G &#8225; = 0.23 &#916;GPCET + 0.04 &#916;GPT + 0.10 &#916;GET + 2.10, which is satisfyingly similar to the equation of the fit to DHA data alone, supporting the assumption that the coefficients of the model are not appreciably affected by the identity of the substrate.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>Overall, this thorough analysis of the reported C-H activation reactivity of transition metal oxo complexes demonstrates that &#916;GPCET, &#916;GPT, and &#916;GET have a statistically significant correlation with the reaction barrier. Interestingly, no other parameters examined here, including steric environment and spin-based parameters, provide a significant improvement to a &#916;GPCET only model. This is in contrast to previous literature reports which implicate such factors in explaining metal oxo mediated PCET. The {&#916;GPCET, &#916;GPT, &#916;GET} model predicts all but five of the reaction barrier heights for reactivity with DHA within 2 kcal/mol, and predicts most of these barrier heights within 1 kcal/mol. We find that the asynchronous reactivity of a Co III oxo complex is also well-predicted, although a fit to just this complex alone reveals changes in the relative importance of &#916;GPCET, &#916;GPT, and &#916;GET. The structural and electronic bases for such variation, and whether concomitant changes in selectivity can be leveraged, are exciting avenues for future research.</p><p>While the relative importance of these thermodynamic parameters can vary between specific cases, this study on a broad set of metal oxo complexes suggests that thermodynamic parameters provide the most general contribution to reaction barriers. Furthermore, while a strong dependence on &#916;GPCET is observed, as is expected based on literature precedent, significant and independent contributions from &#916;GPT and &#916;GET are observed. This conclusion adds to the growing body of  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Tabulation of Experimental Kinetics</head><p>In this study we used thirty reported k2 values of metal oxo species reacting with DHA. <ref type="bibr">12,</ref><ref type="bibr">13,</ref><ref type="bibr">15,</ref><ref type="bibr">17,</ref><ref type="bibr">[34]</ref><ref type="bibr">[35]</ref><ref type="bibr">[36]</ref><ref type="bibr">[37]</ref><ref type="bibr">[38]</ref><ref type="bibr">[39]</ref><ref type="bibr">[40]</ref><ref type="bibr">[41]</ref><ref type="bibr">[42]</ref><ref type="bibr">[43]</ref><ref type="bibr">[44]</ref><ref type="bibr">[45]</ref><ref type="bibr">[46]</ref><ref type="bibr">[47]</ref><ref type="bibr">[48]</ref><ref type="bibr">[49]</ref><ref type="bibr">[50]</ref><ref type="bibr">[51]</ref> For each of these oxo complexes, we tabulated various descriptors (metal, valency, dcount, coordination number, etc.), experimental parameters (M-O bond length and vibrational frequency, BDFE, pKa, etc.), and the reported kinetics for reactions with various substrates (k2, and, if reported, the statistical correction to this k2, experimental &#916;H &#8225; and &#916;S &#8225; , and the KIE) along with the conditions these data were reported in (temperature and solvent). A full tabulation is found in the SI data folder. We excluded a few metal oxo species from our analysis despite having reported kinetic data for reactivity with DHA. The reasons for these exclusions were varied: several did not have a well-defined primary coordination sphere, <ref type="bibr">[71]</ref><ref type="bibr">[72]</ref><ref type="bibr">[73]</ref><ref type="bibr">[74]</ref> we were unable to calculate the reduced form of Mn III oxo complexes without deprotonation of the hydrogen-bonding ligands, <ref type="bibr">17,</ref><ref type="bibr">20</ref> one V V oxo has too much experimental uncertainty in its k2 value, <ref type="bibr">75</ref> corrolazine complexes were too large to calculate their vibrational frequencies using our methods, <ref type="bibr">16,</ref><ref type="bibr">76</ref> vibrational frequencies did not converge for the reduced form of two Ru VI dioxo complexes, <ref type="bibr">50</ref> we did not include third row complexes or complexes with ligand radicals, <ref type="bibr">37,</ref><ref type="bibr">[77]</ref><ref type="bibr">[78]</ref><ref type="bibr">[79]</ref> and in one case saturation was reported at higher concentrations of <ref type="bibr">80</ref> We also found several reports of metal oxo mediated C-H activation of substrates besides DHA <ref type="bibr">14,</ref><ref type="bibr">39,</ref><ref type="bibr">70,</ref><ref type="bibr">[81]</ref><ref type="bibr">[82]</ref><ref type="bibr">[83]</ref><ref type="bibr">[84]</ref><ref type="bibr">[85]</ref><ref type="bibr">[86]</ref><ref type="bibr">[87]</ref><ref type="bibr">[88]</ref><ref type="bibr">[89]</ref><ref type="bibr">[90]</ref><ref type="bibr">[91]</ref><ref type="bibr">[92]</ref><ref type="bibr">[93]</ref><ref type="bibr">[94]</ref> and useful reviews. <ref type="bibr">95,</ref><ref type="bibr">96</ref> All rate constants utilized here were reported as k2 values with the exception of several rate constants used in the Co III oxo reactivity analysis. <ref type="bibr">15</ref> In this case, for substrates which did not have a reported k2, the pseudo-first order rate constant kobs at 0.0125 M of substrate was divided by 0.0125 M to obtain an approximate k2. We used all substrates with reported kinetic data in this analysis except for 1,1,3,3-tetraphenylpropene. This substrate reacts unusually slowly, which we believe to be due to large steric hindrance of the reacting C-H bond. The remaining substrates were sterically similar enough that there is no steric effect on their kinetics (see Regression S41).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Determination of Experimental Barrier Heights</head><p>Before determining barrier heights from experimental k2 values, we first multiplied each k2 by any reported stochiometric and statistical adjustments so as to start from consistently unadjusted k2 rate constants (experimental k2 rate constants are often reported with statistical corrections to facilitate comparisons between substrates, either for the stoichiometry of the substrate's reactivity or for the number of benzylic C-H bonds). We assume that where no adjustment is noted in a paper, none has been made. The barriers of PCET reactivity were then determined from the unadjusted experimental k2 values by solving the Eyring equation <ref type="bibr">97</ref> and subtracting approximate expressions for the free energy of metal oxosubstrate association:</p><p>where &#8462; is Planck's constant, &#119899; &#119862;&#119867; is 2 for DHA and CHD and 1 for fluorene and xanthene, &#119899; &#119874; is the number of oxo ligands in the metal oxo complex, &#119877;&#119879; is the thermal energy, &#120583; is the reduced mass of the metal oxo and the substrate, and &#119862; &#8728; is 1 M. Our &#119899; &#119862;&#119867; adjusts for the typical stoichiometry of each substrate's oxidation (DHA and CHD tend to lose two H-atoms, fluorene and xanthene one H-atom); we do not adjust for the number of reactive H-atoms, as ring puckering of the substrates means that not all reactive C-H bonds are equivalent. For instance, DHA has four benzylic C-H bonds, with two lying equatorial to the central ring and two lying axial. It is unlikely that the equatorial and axial positions are equally reactive, and it is entirely possible that reactivity predominantly occurs at only one of the positions. It is therefore not necessarily true that DHA is four times as reactive as an otherwise similar substrate with only one benzylic C-H bond.</p><p>The second and third terms in Equation 1 are an approximation for the free energy of association of the metal oxo and the substrate. <ref type="bibr">97</ref> This adjustment allows us to compare kinetic data collected at different temperatures. As C-H bonds are poor hydrogen bond donors, we assume that the cost of association is purely entropic (or at least that enthalpic components vary minimally between different metal oxo complexes and substrates) and further assume this entropy cost is solely the loss of translational entropy. This neglects the loss of rotational entropy and the gain of low frequency metal oxosubstrate vibrational modes, but these effects will partially cancel. Regression with &#916;GPCET and &#119877;&#119879; does not fit DHA reaction barrier heights significantly better than a fit to &#916;GPCET alone (see Regression S10), indicating that this adjustment satisfactorily accounts for the temperature dependence of the reaction barrier.</p><p>We do not take into account hydrogen bonding between the metal oxo complexes and protic solvents as we were unable to derive a suitably accurate correction. However, in the SI we demonstrate that our best attempt to do so does not change the main conclusions herein (see Table <ref type="table">S11</ref>). <ref type="bibr">28,</ref><ref type="bibr">[98]</ref><ref type="bibr">[99]</ref><ref type="bibr">[100]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Calculation of Parameters</head><p>For each of these oxo complexes we calculated the values of parameters that have been proposed to influence metal oxo mediated PCET reactivity. Specifically, the parameters investigated were %BV steric metrics, spin density on the oxygen atom, available spin states, thermodynamic free energies &#916;GPCET, &#916;GPT, and &#916;GET, and the magnitude of the asynchronicity parameter (|&#951;|). Geometry optimization and frequency calculations were performed in ORCA using the def2 basis sets of Weigend and Ahlrichs and the O3LYP functional. <ref type="bibr">101- 106</ref> For the wider set of metal oxo complexes, all transition metals were given the def2-TZVPP basis set, all metal-bonded atoms and the transferring hydrogen atom the def2-TZVP basis Please do not adjust margins Please do not adjust margins set, and the remaining atoms the def2-SV(P) basis set. Substrates were calculated entirely with the def2-TZVP basis set. For calculations regarding the Co III oxo and various substrates, Co, N, O, the carbene carbons of the ligand, and the carbon undergoing C-H activation was given def2-TZVPP; all other atoms were given def2-SVP. For both sets of calculations solvent effects were included as a polarizable continuum (CPCM with the dielectric constant of acetonitrile for the broader set of metal oxo complexes; COSMO with the dielectric constant of THF for the Co III oxo with substrates). This is primarily to mitigate the effect of self-interaction error; <ref type="bibr">107</ref> we assume that the solvent dielectric has little effect on the rate of PCET, as reported solvent effects on similar reactions are typically limited to hydrogen bonding. <ref type="bibr">28,</ref><ref type="bibr">100,</ref><ref type="bibr">108</ref> The def2 ECP was used for ruthenium. <ref type="bibr">109</ref> The resolution of identity approximation was used for coulomb integrals and the chain of spheres approximation for exchange integrals (with def2/J as the auxiliary basis). No change was made to ORCA's default grid settings. Free energies were derived from the electronic energies and vibrational calculations using the quasi-harmonic oscillator formulation of Grimme and coworkers. <ref type="bibr">110</ref> Unfortunately, several of our optimized structures have small imaginary frequencies (See ESI), which we believe is due to numerical noise of CPCM solvation. Occasionally these frequencies lie below -100 cm -1 but in each of these cases the mode is isolated to a soft dihedral motion, e.g. methyl rotation on an acetonitrile ligand. We used the absolute value of these frequencies calculating the thermodynamic enthalpy and entropy values, believing that to be a better approximation for these modes than either nonexistence or a frequency of 0 cm -1 . We were unable to reoptimize these structures to remove the imaginary frequencies.</p><p>In many cases, the correct ground state multiplicity of a species was not immediately clear. In such cases we confirmed our initial assignment by running ten geometry optimization cycles on alternate spin states and confirming these alternate assignments were several kcal/mol uphill of the assigned spin state. In a few cases were the energy was within 5 kcal/mol and the optimization was not close to convergence, we fully optimized the alternate spin state. Whenever two spin states had nearly the same energy, we chose the higher spin state as the ground state due to the typically higher entropy of high spin states.</p><p>To quantify the steric environment around each metal oxo center or substrate reactive C-H bond, we calculated percent buried volume (%BV) steric metrics using the online SambVCA web application. <ref type="bibr">52</ref> We centered the calculation on the oxygen atom (for oxos) or the transferring hydrogen (for substrates), defined the negative z-axis as going through the metal center (for oxos) or the reacting carbon center (for substrates), and defined the xz plane as containing another atom bonded to the metal or carbon (the first such atom in the .xyz file). We had the center oxygen or hydrogen atom deleted from the calculation, included hydrogen atoms in the calculation, and left all other settings to their default value (using Bondi radii scaled by 1.17, a sphere radius of 3.5 Angstroms, and a mesh setting of 0.10 Angstroms). The application returns a total percent buried volume, as well as that for individual quadrants of the sphere. For metal oxo complexes, we used the total percent buried volume (%BV Tot) and the standard deviation of these four quadrants (%BV Dev) in our regressions in order to capture both overall steric bulk and how evenly distributed this bulk is around the metal oxo moiety. For substrates, we solely used %BV Tot. See the ESI for a further discussion of steric parameters and their effect on reaction barrier heights.</p><p>To evaluate the effect of spin and spin state on reactivity, we used two parameters that have been discussed in the literature: spin density on the oxo ligand and the energy to excite to a higher spin state. <ref type="bibr">8,</ref><ref type="bibr">10</ref> Atomic spin populations were determined via IBO analysis using the freely available IBO Viewer software. <ref type="bibr">53,</ref><ref type="bibr">54</ref> We recorded the spin density on the metal and on oxygen for each metal oxo complex as well as how much spin both atoms gain upon PCET reduction; we also tabulated similar values for the IBO charges. In the regression analysis we solely used the spin density on the oxo ligand. The "Spin Excitation Energy" is the vertical energy from the ground spin state of the initial oxo complex to the lowest lying excited spin state that is within one spin multiplicity of the resulting metal hydroxide ground spin state. If the ground spin state is already one spin multiplicity greater or lower than the product hydroxide spin state, then the spin excitation energy is taken to be zero. For instance, in the case of a triplet Fe IV oxo reacting to give a sextet Fe III hydroxide the spin excitation energy is the energy of the quintet Fe IV oxo relative to the triplet Fe IV oxo at the ground state optimized geometry. This is the scenario for most Fe IV oxos in the data set. But in the case of the two nonheme Fe IV quintet oxos, <ref type="bibr">12,</ref><ref type="bibr">13</ref> the spin excitation energy is zero because the ground spin state is already within one spin multiplicity of the sextet hydroxide product. Essentially, the spin excitation energy is the energy needed to reach a spin surface on which reduction to the metal hydroxide's ground spin state is spin allowed. While this simple metric ignores the nuances of two state reactivity theory (such as the spin inversion probability) it is relatively simple to compute and has precedent as a quantitative measure of PCET reactivity. <ref type="bibr">10,</ref><ref type="bibr">36</ref> For each metal oxo-substrate combination assessed here, we tabulated the free energies of proton coupled electron transfer (&#916;GPCET, Equation <ref type="formula">2</ref>), proton transfer (&#916;GPT, Equation 3), electron transfer (&#916;GET, Equation <ref type="formula">4</ref>), and the asynchronicity as defined by Srnec and coworkers (&#951;, Equation <ref type="formula">5</ref>): <ref type="bibr">21</ref> &#916;&#119866; &#119875;&#119862;&#119864;&#119879; = &#119866; &#119872;-&#119874;&#119867; + &#119866; &#119862;&#8901; -&#119866; &#119872;=&#119874; -&#119866; &#119862;-&#119867; (2)</p><p>where &#119866; &#119872;=&#119874; is the calculated free energy of the oxo species, &#119866; &#119862;-&#119867; is the calculated free energy of the substrate, and all other free energies are defined analogously. We also tabulated the absolute value of the asynchronicity (|&#951;|), the average of &#916;GPT and &#916;GET (&#916;GCT Average), and the analogous electronic energies (same notation, with G replaced with E). This journal is &#169; The Royal Society of Chemistry 20xx</p><p>Please do not adjust margins Please do not adjust margins</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Statistical Analysis</head><p>All statistical analysis was performed in Python using the Numpy, Scipy, Pandas, Sklearn, and Matplotlib packages. <ref type="bibr">[111]</ref><ref type="bibr">[112]</ref><ref type="bibr">[113]</ref><ref type="bibr">[114]</ref><ref type="bibr">[115]</ref> A script ran a prescribed set of regression models and reported statistics on each model. All regressions were performed with ordinary least squares. Prior to fitting any regression, we separated the data into a test set and a training set of metal oxo species. While we show both test and training sets for each regression in the ESI, we initially did not plot the test set or calculate statistics with it. We solely used the training set in the earlier stages of our analysis, where we determined which parameters improved a fit to &#916;GPCET only and which did not. We then examined if the good fits to the training set extrapolated well to the test set. We had to make a few changes to the initial division of the training and test sets, however. Initially, the Co IV oxo was included in the training set, <ref type="bibr">49</ref> but that was interfering with the fit to &#916;GPCET. We moved it to the test set, and to provide insight into this poor fit we calculated the Fe IV oxo complex of 13-TMC and added it to the test set. <ref type="bibr">70</ref> We also saw, visually, that the inclusion of a Ru IV oxo in the training set was interfering with the addition of other parameters and moved all Ru oxo complexes to the test set. <ref type="bibr">38,</ref><ref type="bibr">43,</ref><ref type="bibr">45,</ref><ref type="bibr">50,</ref><ref type="bibr">51</ref> The simplest metrics reported from these models are the mean square error (MSE) and the goodness of fit R 2 . <ref type="bibr">[58]</ref><ref type="bibr">[59]</ref><ref type="bibr">[60]</ref><ref type="bibr">[61]</ref> These both give an indication of how well a model fits the available data but are prone to overfitting; more complicated models can only improve these metrics, regardless of whether or not the model is actually better.</p><p>We also evaluated each model with cross validation (CV) metrics, which can become worse upon overfitting. In K-fold cross validation, the training data is further subdivided into K subsets, and each subset is predicted by the K-1 remaining subsets. <ref type="bibr">59,</ref><ref type="bibr">61</ref> When K is the number of data points, i.e. each data point being predicted by the rest of the data points, this is known as leave-one-out (LOO) cross validation. These predicted data points can be used to calculate the MSE and R 2 as above. The MSE from LOO cross validation is an approximately unbiased estimate of the expected error of a test set; however, it has high variability from training set to training set because each prediction uses nearly every point in a given training set. By repeatedly subdividing into larger groups and averaging the resultant K-fold MSEs, one obtains a pessimistic but less variable estimate of the expected test error. As we see similar trends for both LOO and 5-fold CV, we only report LOO R 2 in the main text but show all metrics in the ESI.</p><p>Another way to determine the significance of the model is to use a statistical F-test. <ref type="bibr">58,</ref><ref type="bibr">60</ref> This allows one to compare an unrestricted model with a more restricted one (fewer parameters used as regressors, or no parameters regressed, or restrictions placed on the relationship between coefficients, etc.). In the language of hypothesis testing, the null hypothesis is that the unrestricted model offers no improvement on the restricted model and the alternate hypothesis is that there is an improvement. When both models are fit to the data, the unrestricted model will have less total squared error than the restricted model. Assuming said error of each data point is normally distributed (or that there is enough data such that the error is approximately normally distributed), that the average error is zero, and that the model is properly formulated, it is possible to determine the probability that this reduction in total squared error is spurious. This probability is known as the pvalue. The test relies on a well-defined number of degrees of freedom in both the restricted and unrestricted model to draw out what the statistical distribution of total squared error ought to be.</p><p>For regressions on multiple substrates at once, the unequal weighting of different metal oxo complexes (depending on how many substrates are reported for them) renders these statistical metrics unreliable. <ref type="bibr">61</ref> We ameliorate this issue for LOO cross validation by leaving out all reaction barriers for a given metal oxo complex together rather than one at a time. That is, we leave one metal oxo complex out and predict its reaction barrier heights based on all other metal oxos' reaction barrier heights rather than leave one reaction barrier height out and predict this barrier based off all other barriers. We accordingly only report LOO CV metrics for this set of regressions.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0"><p>This journal is &#169; The Royal Society of Chemistry 20xx J. Name., 2013, 00, 1-3 | 7</p></note>
		</body>
		</text>
</TEI>
