skip to main content


Title: Automating methods for estimating metabolite volatility

The volatility of metabolites can influence their biological roles and inform optimal methods for their detection. Yet, volatility information is not readily available for the large number of described metabolites, limiting the exploration of volatility as a fundamental trait of metabolites. Here, we adapted methods to estimate vapor pressure from the functional group composition of individual molecules (SIMPOL.1) to predict the gas-phase partitioning of compounds in different environments. We implemented these methods in a new open pipeline calledvolcalcthat uses chemoinformatic tools to automate these volatility estimates for all metabolites in an extensive and continuously updated pathway database: the Kyoto Encyclopedia of Genes and Genomes (KEGG) that connects metabolites, organisms, and reactions. We first benchmark the automated pipeline against a manually curated data set and show that the same category of volatility (e.g., nonvolatile, low, moderate, high) is predicted for 93% of compounds. We then demonstrate howvolcalcmight be used to generate and test hypotheses about the role of volatility in biological systems and organisms. Specifically, we estimate that 3.4 and 26.6% of compounds in KEGG have high volatility depending on the environment (soil vs. clean atmosphere, respectively) and that a core set of volatiles is shared among all domains of life (30%) with the largest proportion of kingdom-specific volatiles identified in bacteria. Withvolcalc, we lay a foundation for uncovering the role of the volatilome using an approach that is easily integrated with other bioinformatic pipelines and can be continually refined to consider additional dimensions to volatility. Thevolcalcpackage is an accessible tool to help design and test hypotheses on volatile metabolites and their unique roles in biological systems.

 
more » « less
Award ID(s):
2034192 2045332
PAR ID:
10482592
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Publisher / Repository:
Frontiers in Microbiology
Date Published:
Journal Name:
Frontiers in Microbiology
Volume:
14
ISSN:
1664-302X
Subject(s) / Keyword(s):
bioinformatics chemoinformatics metabolic database VOCs volatile metabolite volatility
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Volatility describes the tendency for a compound to partition into the gas phase and volatile metabolites facilitate unique biological interactions which have an influence on Earth's atmospheric physics and chemistry. Estimating which metabolites may be volatile is difficult, especially for those which do not have measured vapor pressures. Volcalc is a newly developed vapor pressure estimation tool which utilizes the SIMPOL.1 method, allowing users to rapidly identify plausible volatile metabolites within the Kyoto Encyclopedia for Genes and Genomes (KEGG) database. Here, we estimate the volatiles of all KEGG metabolites and associate them with KEGG reactions, enzymes, orthologs (KOs) and whole genomes within the KEGG database. This information may be used to identify which genes or species may be linked to particular forms of volatile metabolism, for the purpose hypothesis generation and integration into additional bioinformatics pipelines. This data is listed as a compliment to the publication "Automating methods for estimating metabolite volatility". The column "Paper" indicates whether the listed species is one from the subset analyzed within the data for Figure 3.
    For inquiries regarding the contents of this dataset, please contact the Corresponding Author listed in the README.txt file. Administrative inquiries (e.g., removal requests, trouble downloading, etc.) can be directed to data-management@arizona.edu 
    more » « less
  2. Abstract

    Biogenic volatile organic compounds (VOCs) constitute a significant portion of gas-phase metabolites in modern ecosystems and have unique roles in moderating atmospheric oxidative capacity, solar radiation balance, and aerosol formation. It has been theorized that VOCs may account for observed geological and evolutionary phenomena during the Archaean, but the direct contribution of biology to early non-methane VOC cycling remains unexplored. Here, we provide an assessment of all potential VOCs metabolized by the last universal common ancestor (LUCA). We identify enzyme functions linked to LUCA orthologous protein groups across eight literature sources and estimate the volatility of all associated substrates to identify ancient volatile metabolites. We hone in on volatile metabolites with confirmed modern emissions that exist in conserved metabolic pathways and produce a curated list of the most likely LUCA VOCs. We introduce volatile organic metabolites associated with early life and discuss their potential influence on early carbon cycling and atmospheric chemistry.

     
    more » « less
  3. Abstract

    Plants make a variety of specialized metabolites that can mediate interactions with animals, microbes, and competitor plants. Understanding how plants synthesize these compounds enables studies of their biological roles by manipulating their synthesis in vivo as well as producing them in vitro. Acylsugars are a group of protective metabolites that accumulate in the trichomes of many Solanaceae family plants. Acylinositol biosynthesis is of interest because it appears to be restricted to a subgroup of species within the Solanum genus. Previous work characterized a triacylinositol acetyltransferase involved in acylinositol biosynthesis in the Andean fruit plantSolanum quitoense(lulo or naranjilla). We characterized three additionalS. quitoensetrichome expressed enzymes and found that virus‐induced gene silencing of each caused changes in acylinositol accumulation. pH was shown to influence the stability and rearrangement of the product of ASAT1H and could potentially play a role in acylinositol biosynthesis. Surprisingly, the in vitro triacylinositol products of these enzymes are distinct from those that accumulatein planta. This suggests that additional enzymes are required in acylinositol biosynthesis. These characterizedS. quitoenseenzymes, nonetheless, provide opportunities to test the biological impact and properties of these triacylinositols in vitro.

     
    more » « less
  4. null (Ed.)
    Soils harbor complex biological processes intertwined with metabolic inputs from microbes and plants. Measuring the soil metabolome can reveal active metabolic pathways, providing insight into the presence of specific organisms and ecological interactions. A subset of the metabolome is volatile; however, current soil studies rarely consider volatile organic compounds (VOCs), contributing to biases in sample processing and metabolomic analytical techniques. Therefore, we hypothesize that overall, the volatility of detected compounds measured using current metabolomic analytical techniques will be lower than undetected compounds, a reflection of missed VOCs. To illustrate this, we examined a peatland metabolomic dataset collected using three common metabolomic analytical techniques: nuclear magnetic resonance (NMR), gas chromatography-mass spectroscopy (GC-MS), and fourier-transform ion cyclotron resonance mass spectrometry (FT-ICR-MS). We mapped the compounds to three metabolic pathways (monoterpenoid biosynthesis, diterpenoid biosynthesis, and polycyclic aromatic hydrocarbon degradation), chosen for their activity in peatland ecosystems and involvement of VOCs. We estimated the volatility of the compounds by calculating relative volatility indices (RVIs), and as hypothesized, the average RVI of undetected compounds within each of our focal pathways was higher than detected compounds ( p < 0.001). Moreover, higher RVI compounds were absent even in sub-pathways where lower RVI compounds were observed. Our findings suggest that typical soil metabolomic analytical techniques may overlook VOCs and leave missing links in metabolic pathways. To more completely represent the volatile fraction of the soil metabolome, we suggest that environmental scientists take into consideration these biases when designing and interpreting their data and/or add direct online measurement methods that capture the integral role of VOCs in soil systems. 
    more » « less
  5. Abstract. The composition of organic aerosol under different ambient conditions aswell as their phase state have been a subject of intense study in recentyears. One way to study particle properties is to measure the particlesize shrinkage in a diluted environment at isothermal conditions. From thesemeasurements it is possible to separate the fraction of low-volatilitycompounds from high-volatility compounds. In this work, we analyse andevaluate a method for obtaining particle composition and viscosity frommeasurements using process models coupled with input optimizationalgorithms. Two optimization methods, the Monte Carlo genetic algorithm andBayesian inference, are used together with process models describing thedynamics of particle evaporation. The process model optimization scheme ininferring particle composition in a volatility-basis-set sense andcomposition-dependent particle viscosity is tested with artificiallygenerated data sets and real experimental data. Optimizing model input sothat the output matches these data yields a good match for the estimatedquantities. Both optimization methods give equally good results when theyare used to estimate particle composition to artificially test data. The timescale of the experiments and the initial particle size are found to beimportant in defining the range of values that can be identified for theproperties from the optimization. 
    more » « less