Dynamic protein structures are crucial for deciphering their diverse biological functions. Two-dimensional infrared (2DIR) spectroscopy stands as an ideal tool for tracing rapid conformational evolutions in proteins. However, linking spectral characteristics to dynamic structures poses a formidable challenge. Here, we present a pretrained machine learning model based on 2DIR spectra analysis. This model has learned signal features from approximately 204,300 spectra to establish a “spectrum-structure” correlation, thereby tracing the dynamic conformations of proteins. It excels in accurately predicting the dynamic content changes of various secondary structures and demonstrates universal transferability on real folding trajectories spanning timescales from microseconds to milliseconds. Beyond exceptional predictive performance, the model offers attention-based spectral explanations of dynamic conformational changes. Our 2DIR-based pretrained model is anticipated to provide unique insights into the dynamic structural information of proteins in their native environments.
more »
« less
A new method based on pseudo-Zernike polynomials to analyze and extract dynamical and spectral information from the 2DIR spectra
Ultrafast two-dimensional infrared (2DIR) spectroscopy is a relatively new methodology, which has now been widely used to study the molecular structure and dynamics of molecular processes occurring in solution. Typically, in 2DIR spectroscopy the dynamics of a system is inferred from the evolution of 2DIR spectral features over waiting times. One of the most important metrics derived from the 2DIR is the frequency–frequency correlation function (FFCF), which can be extracted using different methods, including center and nodal line slope. However, these methods struggle to correctly describe the dynamics in 2DIR spectra with multiple and overlapping transitions. Here, a new approach, utilizing pseudo-Zernike moments, is introduced to retrieve the FFCF dynamics of each spectral component from complex 2DIR spectra. The results show that this new method not only produces equivalent results to more established methodologies in simple spectra but also successfully extracts the FFCF dynamics of individual component from very congested and unresolved 2DIR spectra. In addition, this new methodology can be used to locate the individual frequency components from those complex spectra. Overall, a new methodology for analyzing the 2D spectra is presented here, which allows us to retrieve previously unattainable spectral features from the 2DIR spectra.
more »
« less
- Award ID(s):
- 1751735
- PAR ID:
- 10532802
- Publisher / Repository:
- American Institute of Physics
- Date Published:
- Journal Name:
- The Journal of Chemical Physics
- Volume:
- 159
- Issue:
- 3
- ISSN:
- 0021-9606
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Optical two-dimensional (2D) coherent spectroscopy excels in studying coupling and dynamics in complex systems. The dynamical information can be learned from lineshape analysis to extract the corresponding linewidth. However, it is usually challenging to fit a 2D spectrum, especially when the homogeneous and inhomogeneous linewidths are comparable. We implemented a machine learning algorithm to analyze 2D spectra to retrieve homogeneous and inhomogeneous linewidths. The algorithm was trained using simulated 2D spectra with known linewidth values. The trained algorithm can analyze both simulated (not used in training) and experimental spectra to extract the homogeneous and inhomogeneous linewidths. This approach can be potentially applied to 2D spectra with more sophisticated spectral features.more » « less
-
Two-dimensional infrared (2DIR) spectroscopy has become an established method for generating vibrational spectra in condensed phase samples composed of mixtures that yield heavily congested infrared and Raman spectra. These condensed phase 2DIR spectrometers can provide very high temporal resolution (<1 ps), but the spectral resolution is generally insufficient for resolving rotational peaks in gas phase spectra. Conventional (1D) rovibrational spectra of gas phase molecules are often plagued by severe spectral congestion, even when the sample is not a mixture. Spectral congestion can obscure the patterns in rovibrational spectra that are needed to assign peaks in the spectra. A method for generating high resolution 2DIR spectra of gas phase molecules has now been developed and tested using methane as the sample. The 2D rovibrational patterns that are recorded resemble an asterisk with a center position that provides the frequencies of both of the two coupled vibrational levels. The ability to generate easily recognizable 2D rovibrational patterns, regardless of temperature, should make the technique useful for a wide range of applications that are otherwise difficult or impossible when using conventional 1D rovibrational spectroscopy.more » « less
-
Coherent multidimensional spectroscopy provides experimental access to molecular structure and subpicosecond dynamics in solution. Dynamics are typically inferred from the evolution of lineshapes over a function of waiting time. Numerous spectral analysis methods, such as center/nodal line slope, have been developed to extract these dynamics. However, the extracted dynamics can depend heavily on subjective choices, such as the region selected for CLS analysis or the chosen models. In this study, we introduce a novel approach to extracting dynamics from ultrafast two-dimensional infrared (2D IR) spectra by using dynamic mode decomposition (DMD). As a data-driven method, DMD directly extracts spatiotemporal structures from the complex 2D IR spectra. We evaluated the performance of DMD in simulated and experimental spectra containing overlapped peaks. We show that DMD can retrieve the dynamics of overlapped transitions and cross peaks that are typically challenging to extract with traditional methods. In addition, we demonstrate that combining conditional generative adversarial neural networks with DMD can recover dynamics even at low signal-to-noise ratios. DMD methods do not require preliminary assumptions and can be readily extended to other multidimensional spectroscopies.more » « less
-
Spectroscopy is a key analytical tool that provides valuable insight into molecular structure and is widely used to identify chemical samples. Tagging spectroscopy is a form of action spectroscopy in which the absorption of a single photon by a molecular ion is detected via the loss of a weakly attached, inert “tag” particle (e.g. He, Ne, N2).1–3 The absorption spectrum is derived from the tag loss rate as a function of incident radiation frequency. To date, all spectroscopy of gas phase polyatomic molecules has been restricted to large molecular ensembles, complicating spectral interpretation by the presence of multiple chemical and isomeric species. Here we present a novel tagging spectroscopic scheme to analyze the purest possible sample: a single gas phase molecule. We demonstrate this technique with the measurement of the infrared spectrum of a single tropylium (C7H7+ ) molecular ion; to our knowledge the first recorded spectrum of a single gas phase polyatomic molecule. Our method’s high sensitivity revealed spectral features previously unobserved using traditional tagging methods.4 Our approach in principle enables analysis of multi-component mixtures by identifying constituent molecules one at a time. Single molecule sensitivity extends action spectroscopy to rare samples, such as those of extraterrestrial origin,5,6 or to reactive reaction intermediates formed at number densities too low for traditional action methods.more » « less
An official website of the United States government

