Abstract Plant metabolomes are structurally diverse. One of the most popular techniques for sampling this diversity is liquid chromatography–mass spectrometry (LC‐MS), which typically detects thousands of peaks from single organ extracts, many representing true metabolites. These peaks are usually annotated using in‐house retention time or spectral libraries, in silico fragmentation libraries, and increasingly through computational techniques such as machine learning. Despite these advances, over 85% of LC‐MS peaks remain unidentified, posing a major challenge for data analysis and biological interpretation. This bottleneck limits our ability to fully understand the diversity, functions, and evolution of plant metabolites. In this review, we first summarize current approaches for metabolite identification, highlighting their challenges and limitations. We further focus on alternative strategies that bypass the need for metabolite identification, allowing researchers to interpret global metabolic patterns and pinpoint key metabolite signals. These methods include molecular networking, distance‐based approaches, information theory–based metrics, and discriminant analysis. Additionally, we explore their practical applications in plant science and highlight a set of useful tools to support researchers in analyzing complex plant metabolomics data. By adopting these approaches, researchers can enhance their ability to uncover new insights into plant metabolism.
more »
« less
Creation of a Plant Metabolite Spectral Library for Untargeted and Targeted Metabolomics
Large-scale high throughput metabolomic technologies are indispensable components of systems biology in terms of discovering and defining the metabolite parts of the system. However, the lack of a plant metabolite spectral library limits the metabolite identification of plant metabolomic studies. Here, we have created a plant metabolite spectral library using 544 authentic standards, which increased the efficiency of identification for untargeted metabolomic studies. The process of creating the spectral library was described, and the mzVault library was deposited in the public repository for free download. Furthermore, based on the spectral library, we describe a process of creating a pseudo-targeted method, which was applied to a proof-of-concept study of Arabidopsis leaf extracts. As authentic standards become available, more metabolite spectra can be easily incorporated into the spectral library to improve the mzVault package.
more »
« less
- Award ID(s):
- 1920420
- PAR ID:
- 10438836
- Date Published:
- Journal Name:
- International Journal of Molecular Sciences
- Volume:
- 24
- Issue:
- 3
- ISSN:
- 1422-0067
- Page Range / eLocation ID:
- 2249
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Infrared spectroscopy is a widely used tool for studying microplastics and identifying microparticles. Researchers rely on spectral libraries to differentiate between synthetic and natural materials. Unfortunately, spectral library matching is not perfect, and best practices require researchers to use time consuming, manual peak matching to assess spectral matches. Moving toward automated matching requires increased confidence in the matching process. Using spectra matching software may increase the efficiency of particle identification, however some matching strategies may confuse natural materials such as cotton, silk, and plant matter with common classes of synthetics such as polyesters and polyamides. In this experiment, we prepared 22 pristine sample materials from natural and synthetic sources and measured micro-Fourier transform infrared (µFTIR) spectra in transmission mode for each sample using a Thermo Nicolet iN10 MX instrument. The collected spectra were then input into two spectral library matching systems (Omnic Picta and Open Specy), using a total of five identification routines. Next, we placed a subset of four pristine microplastic materials in a biologically active river system for two weeks to simulate environmental samples. These simulated environmental samples were processed using 10% hydrogen peroxide for 24 h to remove organic contamination and then identified using the strongest performing library. We found that libraries with fewer sample spectra produced lower correlation matches and that using derivative correction greatly reduced the number of inaccuracies in identifying materials as either natural or synthetic. We also found that environmental fouling reduced the correlation value of library matches when compared to pristine particles, however the effect was not consistent across the four materials tested. Overall, we found that the accuracy of automated library matching in the tested systems and processing routines varied from 64.1 to 98.0% for distinguishing between natural and synthetic materials, and that a high Hit Quality Index (HQI) did not always correlate with accuracy. These results are important for the microplastic field, demonstrating a need to rigorously test spectral libraries and processing routines with known materials to ensure identification accuracy.more » « less
-
Abstract Neuropeptides have tremendous potential for application in modern medicine, including utility as biomarkers and therapeutics. To overcome the inherent challenges associated with neuropeptide identification and characterization, data‐independent acquisition (DIA) is a fitting mass spectrometry (MS) method of choice to achieve sensitive and accurate analysis. It is advantageous for preliminary neuropeptidomic studies to occur in less complex organisms, with crustacean models serving as a popular choice due to their relatively simple nervous system. With spectral libraries serving as a means to interpret DIA‐MS output spectra, andCancer borealisas a model of choice for neuropeptide analysis, we performed the first spectral library mapping of crustacean neuropeptides. Leveraging pre‐existing data‐dependent acquisition (DDA) spectra, a spectral library was built using PEAKS Online. The library is comprised of 333 unique neuropeptides. The identification results obtained through the use of this spectral library were compared with those achieved through library‐free analysis of crustacean brain, pericardial organs (PO), and thoracic ganglia (TG) tissues. A statistically significant increase (Student'st‐test,Pvalue < 0.05) in the number of identifications achieved from the TG data was observed in the spectral library results. Furthermore, in each of the tissues, a distinctly different set of identifications was found in the library search compared to the library‐free search. This work highlights the necessity for the use of spectral libraries in neuropeptide analysis, illustrating the advantage of spectral libraries for interpreting DIA spectra in a reproducible manner with greater neuropeptidomic depth.more » « less
-
Abstract Premise The specialized metabolites of plants are recognized as key chemical traits in mediating the ecology and evolution of sundry plant–biotic interactions, from pollination to seed predation. Intra‐ and interspecific patterns of specialized metabolite diversity have been studied extensively in leaves, but the diverse biotic interactions that contribute to specialized metabolite diversity encompass all plant organs. Focusing on two species of Psychotria shrubs, we investigated and compared patterns of specialized metabolite diversity in leaves and fruit with respect to each organ's diversity of biotic interactions. Methods To evaluate associations between biotic interaction diversity and specialized metabolite diversity, we combined UPLC‐MS metabolomic analysis of foliar and fruit specialized metabolites with existing surveys of leaf‐ and fruit‐centered biotic interactions. We compared patterns of specialized metabolite richness and variance among vegetative and reproductive tissues, among plants, and between species. Results In our study system, leaves interact with a far larger number of consumer species than do fruit, while fruit‐centric interactions are more ecologically diverse in that they involve antagonistic and mutualistic consumers. This aspect of fruit‐centric interactions was reflected in specialized metabolite richness—leaves contained more than fruit, while each organ contained over 200 organ‐specific specialized metabolites. Within each species, leaf‐ and fruit‐specialized metabolite composition varied independently of one another across individual plants. Contrasts in specialized metabolite composition were stronger between organs than between species. Conclusions As ecologically disparate plant organs with organ‐specific specialized metabolite traits, leaves and fruit can each contribute to the tremendous overall diversity of plant specialized metabolites.more » « less
-
Introduction Products of plant secondary metabolism, such as phenolic compounds, flavonoids, alkaloids, and hormones, play an important role in plant growth, development, stress resistance. The plant family Rubiaceae is extremely diverse and abundant in Central America and contains several economically important genera, e.g. Coffea and other medicinal plants. These are known for the production of bioactive polyphenols (e.g. caffeine and quinine), which have had major impacts on human society. The overall goal of this study was to develop a high-throughput workflow to identify and quantify plant polyphenols. Methods First, a method was optimized to extract over 40 families of phytochemicals. Then, a high-throughput metabolomic platform has been developed to identify and quantify 184 polyphenols in 15 min. Results The current metabolomics study of secondary metabolites was conducted on leaves from one commercial coffee variety and two wild species that also belong to the Rubiaceae family. Global profiling was performed using liquid chromatography high-resolution time-of-flight mass spectrometry. Features whose abundance was significantly different between coffee species were discriminated using statistical analysis and annotated using spectral databases. The identified features were validated by commercially available standards using our newly developed liquid chromatography tandem mass spectrometry method. Discussion Caffeine, trigonelline and theobromine were highly abundant in coffee leaves, as expected. Interestingly, wild Rubiaceae leaves had a higher diversity of phytochemicals in comparison to commercial coffee: defense-related molecules, such as phenylpropanoids (e.g., cinnamic acid), the terpenoid gibberellic acid, and the monolignol sinapaldehyde were found more abundantly in wild Rubiaceae leaves.more » « less
An official website of the United States government

