skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.
Attention:The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 7:00 AM ET to 7:30 AM ET on Friday, April 24 due to maintenance. We apologize for the inconvenience.


Title: A Dataset of Plausible Proton Transfer Steps for Arrow-Pushing Mechanisms
Abstract Proton transfers are fundamental steps in polar reaction mechanisms. We generated a large dataset of over 51 million kinetically plausible proton transfer steps between heteroatoms from about 8,000 acids and conjugate bases with experimental aqueous pKas, spanning pKavalues from −15 to +37. Rate factors were estimated at 25 °C using a simplified Eigen equation with pKas but without statistical factors. Steps with estimated rate constants ≥ 103M−1s−1were included in the final dataset. Additionally, 5,043 proton transfer steps from carbon acids to heteroatom bases were estimated using the Eigen-Bernasconi equation based on reported intrinsic rate constants and Brønsted β values. Carbon proton transfers with rate constants ≥ 103 M−1s−1were added to the final dataset. Each entry was encoded in SMIRKS format with electron-flow specification for machine learning compatibility. Diversity of structure was prioritized over diversity of conditions; calculated rate constants are expected to be accurate in aqueous environments. This approach and dataset should prove valuable for training models to predict stepwise mechanistic pathways.  more » « less
Award ID(s):
1955811
PAR ID:
10670684
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Nature
Date Published:
Journal Name:
Scientific Data
Volume:
13
Issue:
1
ISSN:
2052-4463
Page Range / eLocation ID:
172
Format(s):
Medium: X
Associated Dataset(s):
View Associated Dataset(s) >>
Sponsoring Org:
National Science Foundation
More Like this
  1. A zipped file containing:51M_Heteroatom.csv - 51M proton transfer steps from heteroatom acids to heteroatom bases, with SMIRKS, calculated log k1, and pKas,5KCarbonPT.csv - 5K proton transfer steps from carbon acids to heteroatom bases, with SMIRKS, calculated log k1, and pKas, Brønsted β values, statistical factors (qB, pB, qC, pC) intrinsic rate constants (ko),49ExperimentalCarbonPT.csv - 49 proton transfers from heteroatom acids to carbon bases in SMIRKS format, with experimentally measured log k1, and literature references.51M_heteroatom_raw – a subfolder with two files containing lists of 7.6K heteroatomic acids and bases in SMILES format with the acidic and basic atoms labeled, with pKas, literature references: Acid.csv, ConBase.csv100_Heteroatom.csv - A representative sample set of 100 out of the 51M proton transfer steps100K_Heteroatom.csv - A representative sample set of 100,000 out of the 51M proton transfer stepscarbon_acid_raw – a subfolder containing a list of intrinsic rate constants for carbon acids in SMILES format, with statistical factors (Carbon_Acids.csv) and a subfolder named Bases containing seven lists of heteroatom base classes (ArO-.csv, R2NH.csv, R3N.csv, “RCO2- and ArCO2-.csv”, RNH2.csv, RO-.csv and RS-.csv). Lists of heteroatom bases are in SMILES format, sectioned by class and with statistical factors, selected from the Heteroatom set 
    more » « less
  2. Abstract We have been interested in the development of rubisco‐based biomimetic systems for reversible CO2 capture from air. Our design of the chemical CO2 capture and release (CCR) system is informed by the understanding of the binding of the activator CO2(ACO2) in rubisco (ribulose‐1,5‐bisphosphate carboxylase/oxygenase). The active site consists of the tetrapeptide sequence Lys‐Asp‐Asp‐Glu (or KDDE) and the Lys sidechain amine is responsible for the CO2 capture reaction. We are studying the structural chemistry and the thermodynamics of CO2 capture based on the tetrapeptide CH3CO−KDDE−NH2 (“KDDE”) in aqueous solution to develop rubisco mimetic CCR systems. Here, we report the results of 1H NMR and 13C NMR analyses of CO2 capture by butylamine and by KDDE. The carbamylation of butylamine was studied to develop the NMR method and with the protocol established, we were able to quantify the oligopeptide carbamylation at much lower concentration. We performed a pH profile in the multi-equilibrium system and measured amine species and carbamic acid/carbamate species by the integration of 1H NMR signals as a function of pH in the range 8 ≤ pH ≤ 11. The determination of ΔG1(R) for the reaction R−NH2 + CO2 R−NH−COOH requires the solution of a multi‐equilibrium equation system, which accounts for the dissociation constants K2 and K3 controlling carbonate and bicarbonate concentrations, the acid dissociation constant K4 of the conjugated acid of the amine, and the acid dissociation constant K5 of the alkylcarbamic acid. We show how the multi‐equilibrium equation system can be solved with the measurements of the daughter/parent ratio X, the knowledge of the pH values, and the initial concentrations [HCO3]0 and [R‐NH2]0. For the reaction energies of the carbamylations of butylamine and KDDE, our best values are ΔG1(Bu) = −1.57 kcal/mol and ΔG1(KDDE) = −1.17 kcal/mol. Both CO2capture reactions are modestly exergonic and thereby ensure reversibility in an energy‐efficient manner. These results validate the hypothesis that KDDE‐type oligopeptides may serve as reversible CCR systems in aqueous solution and guide designs for their improvement. 
    more » « less
  3. Abstract. Triplet excited states of organic matter are formed when colored organicmatter (i.e., brown carbon) absorbs light. While these “triplets” can beimportant photooxidants in atmospheric drops and particles (e.g., theyrapidly oxidize phenols), very little is known about their reactivity towardmany classes of organic compounds in the atmosphere. Here we measure thebimolecular rate constants of the triplet excited state of benzophenone(3BP), a model species, with 17 water-solubleC3–C6 alkenes that have either been found in theatmosphere or are reasonable surrogates for identified species. Measured rateconstants ( k ALK + 3 BP ) vary by a factor of 30 and are in therange of (0.24–7.5) ×109 M−1 s−1. Biogenic alkenesfound in the atmosphere – e.g., cis-3-hexen-1-ol, cis-3-hexenyl acetate, andmethyl jasmonate – react rapidly, with rate constants above 1×109 M−1 s−1. Rate constants depend on alkene characteristicssuch as the location of the double bond, stereochemistry, and alkylsubstitution on the double bond. There is a reasonable correlation between k ALK + 3 BP and the calculated one-electron oxidation potential(OP) of the alkenes (R2=0.58); in contrast, rate constants are notcorrelated with bond dissociation enthalpies, bond dissociation freeenergies, or computed energy barriers for hydrogen abstraction. Using the OPrelationship, we estimate aqueous rate constants for a number of unsaturatedisoprene and limonene oxidation products with 3BP: values are inthe range of (0.080–1.7) ×109 M−1 s−1, withgenerally faster values for limonene products. Rate constants with lessreactive triplets, which are probably more environmentally relevant, arelikely roughly 25 times slower. Using our predicted rate constants, alongwith values for other reactions from the literature, we conclude thattriplets are probably minor oxidants for isoprene- and limonene-relatedcompounds in cloudy or foggy atmospheres, except in cases in which the tripletsare very reactive. 
    more » « less
  4. Proton-conducting ceramic cells with a Ni-BaCe0.8Zr0.1Y0.1O3−δ|| BaCe0.7Zr0.2Y0.1O3−δ|| Fe/BaCe0.7Zr0.2Y0.1O3−δ-backbone structure were studied in a dual-chamber setup to evaluate the effects of temperature, gas flow, voltage, and electrolyte thickness on electrochemical ammonia synthesis. Three gas feed configurations were tested to differentiate surface reaction mechanisms. Thermodynamic calculations were conducted to evaluate the experimental results in the context of equilibrium limitations. The results showed that proton transport through the electrolyte plays a minor role in the formation of ammonia, while production rates correlated more strongly with applied voltage than with current density. Ammonia synthesis appeared significantly limited by thermodynamics, making an increased hydrogen partial pressure at the cathode essential. A feed configuration supplying only nitrogen to the cathode was ineffective, whereas introducing hydrogen increased the ammonia formation rate from 1.42 × 10−10mol cm−2s−1to 3.23 × 10−9mol cm−2s−1at −1.4 V and 500 °C. When proton conduction was suppressed under the same conditions, the rate further increased to 4.32 × 10−9mol cm−2s−1. Varying the cathodic gas flow rate across the 12.57 cm2active area from 8 NL h−1to 20 NL h−1, the ammonia formation rate improved by a factor of 3.1 at −1 V, at both 500 °C and 600 °C. 
    more » « less
  5. We report on spectroscopic measurements on the 4f76s28S7/2−4f7(8S)6s6p(1P)8P5/2,7/2transitions at 466.32 nm and 462.85 nm, respectively, in neutral europium-151 and europium-153. The center of gravity frequencies for the 151 and 153 isotopes for both transitions are reported for the first time using saturated absorption spectroscopy. For the 6s6p(1P)8P5/2state, the center of gravity frequencies were found to be 642,894,493.3(4) MHz and 642,891,693.3(9) MHz for the 151 and 153 isotopes, respectively. The hyperfine constants for the upper state were found to beA(151)=−157.01(3)MHz,B(151)=74.5(4)MHz andA(153)=−69.43(14)MHz,B(153)=191.0(26)MHz. These hyperfine values are all consistent with previously published results except forB(151) that has a small discrepancy. The isotope shift was found to be 2799.54(20) MHz, a small discrepancy with previously published results. For the 6s6p(1P)8P7/2state, the center of gravity frequencies were found to be 647,708,930.6(6) MHz and 647,705,958.4(26) MHz for the 151 and 153 isotopes, respectively. The hyperfine constants for the upper state were found to beA(151)=−218.66(4)MHz,B(151)=−293.4(8)MHz andA(153)=−97.15(13)MHz,B(153)=−750(3)MHz. These values are all consistent with previously published results except forA(151) that has a small discrepancy. The isotope shift was found to be 2972.8(5) MHz, a small discrepancy with previously measured results. 
    more » « less