skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Optimising the use of gene expression data to predict plant metabolic pathway memberships
Summary Plant metabolites from diverse pathways are important for plant survival, human nutrition and medicine. The pathway memberships of most plant enzyme genes are unknown. While co‐expression is useful for assigning genes to pathways, expression correlation may exist only under specific spatiotemporal and conditional contexts.Utilising > 600 tomato (Solanum lycopersicum) expression data combinations, three strategies for predicting memberships in 85 pathways were explored.Optimal predictions for different pathways require distinct data combinations indicative of pathway functions. Naive prediction (i.e. identifying pathways with the most similarly expressed genes) is error prone. In 52 pathways, unsupervised learning performed better than supervised approaches, possibly due to limited training data availability. Using gene‐to‐pathway expression similarities led to prediction models that outperformed those based simply on expression levels. Using 36 experimental validated genes, the pathway‐best model prediction accuracy is 58.3%, significantly better compared with that for predicting annotated genes without experimental evidence (37.0%) or random guess (1.2%), demonstrating the importance of data quality.Our study highlights the need to extensively explore expression‐based features and prediction strategies to maximise the accuracy of metabolic pathway membership assignment. The prediction framework outlined here can be applied to other species and serves as a baseline model for future comparisons.  more » « less
Award ID(s):
1655386 1546617
PAR ID:
10449972
Author(s) / Creator(s):
 ;  ;  ;  ;  ;  
Publisher / Repository:
Wiley-Blackwell
Date Published:
Journal Name:
New Phytologist
Volume:
231
Issue:
1
ISSN:
0028-646X
Page Range / eLocation ID:
p. 475-489
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary In plants, the biosynthetic pathways of some specialized metabolites are partitioned into specialized or rare cell types, as exemplified by the monoterpenoid indole alkaloid (MIA) pathway ofCatharanthus roseus(Madagascar Periwinkle), the source of the anticancer compounds vinblastine and vincristine. In the leaf, theC. roseusMIA biosynthetic pathway is partitioned into three cell types with the final known steps of the pathway expressed in the rare cell type termed idioblast. How cell‐type specificity of MIA biosynthesis is achieved is poorly understood.We generated single‐cell multi‐omics data fromC. roseusleaves. Integrating gene expression and chromatin accessibility profiles across single cells, as well as transcription factor (TF)‐binding site profiles, we constructed a cell‐type‐aware gene regulatory network for MIA biosynthesis.We showcased cell‐type‐specific TFs as well as cell‐type‐specificcis‐regulatory elements. Using motif enrichment analysis, co‐expression across cell types, and functional validation approaches, we discovered a novel idioblast‐specific TF (Idioblast MYB1,CrIDM1) that activates expression of late‐stage MIA biosynthetic genes in the idioblast.These analyses not only led to the discovery of the first documented cell‐type‐specific TF that regulates the expression of two idioblast‐specific biosynthetic genes within an idioblast metabolic regulon but also provides insights into cell‐type‐specific metabolic regulation. 
    more » « less
  2. Summary The molecular mechanisms of quantitative resistance (QR) to fungal pathogens and their relationships with growth pathways are poorly understood.We identified tomato TRK1 (TPK1b Related Kinase1) and determined its functions in tomato QR and plant growth. TRK1 is a receptor‐like cytoplasmic kinase that complexes with tomato LysM Receptor Kinase (SlLYK1).SlLYK1andTRK1are required for chitin‐induced fungal resistance, accumulation of reactive oxygen species, and expression of immune response genes. Notably, TRK1 and SlLYK1 regulate SlMYC2, a major transcriptional regulator of jasmonic acid (JA) responses and fungal resistance, at transcriptional and post‐transcriptional levels.Further, TRK1 is also required for maintenance of proper meristem growth, as revealed by the ectopic meristematic activity, enhanced branching, and altered floral structures inTRK1RNAi plants. Consistently, TRK1 interacts with SlCLV1 and SlWUS, andTRK1RNAi plants show increased expression ofSlCLV3andSlWUSin shoot apices. Interestingly, TRK1 suppresses chitin‐induced gene expression in meristems but promotes expression of the same genes in leaves. SlCLV1 and TRK1 perform contrasting functions in defense but similar functions in plant growth.Overall, through molecular and biochemical interactions with critical regulators, TRK1 links upstream defense and growth signals to downstream factor in fungal resistance and growth homeostasis response regulators. 
    more » « less
  3. Summary In this study, we investigate the genetic mechanisms responsible for the loss of anthocyanins in betalain‐pigmented Caryophyllales, considering our hypothesis of multiple transitions to betalain pigmentation.Utilizing transcriptomic and genomic datasets across 357 species and 31 families, we scrutinize 18 flavonoid pathway genes and six regulatory genes spanning four transitions to betalain pigmentation. We examined evidence for hypotheses of wholesale gene loss, modified gene function, altered gene expression, and degeneration of the MBW (MYB‐bHLH‐WD40) trasnscription factor complex, within betalain‐pigmented lineages.Our analyses reveal that most flavonoid synthesis genes remain conserved in betalain‐pigmented lineages, with the notable exception ofTT19orthologs, essential for the final step in anthocyanidin synthesis, which appear to have been repeatedly and entirely lost. Additional late‐stage flavonoid pathway genes upstream ofTT19also manifest strikingly reduced expression in betalain‐pigmented species. Additionally, we find repeated loss and alteration in the MBW transcription complex essential for canonical anthocyanin synthesis.Consequently, the loss and exclusion of anthocyanins in betalain‐pigmented species appear to be orchestrated through several mechanisms: loss of a key enzyme, downregulation of synthesis genes, and degeneration of regulatory complexes. These changes have occurred iteratively in Caryophyllales, often coinciding with evolutionary transitions to betalain pigmentation. 
    more » « less
  4. Summary Approximately one‐half of all flowering plants express genetically based physiological mechanisms that prevent self‐fertilisation. One such mechanism, termed RNase‐based self‐incompatibility, employs ribonucleases as the pistil component. Although it is widespread, it has only been characterised in a handful of distantly related families, partly due to the difficulties presented by life history traits of many plants, which complicate genetic research. Many species in the cactus family are known to express self‐incompatibility but the underlying mechanisms remain unknown.We demonstrate the utility of a candidate‐based RNA‐seq approach, combined with some unusual features of self‐incompatibility‐causing genes, which we use to uncover the genetic basis of the underlying mechanisms. Specifically, we assembled transcriptomes fromSchlumbergera truncata(crab cactus or false Christmas cactus), and interrogated them for tissue‐specific expression of candidate genes, structural characteristics, correlation with expressed phenotype(s), and phylogenetic placement.The results were consistent with operation of the RNase‐based self‐incompatibility mechanism in Cactaceae.The finding yields additional evidence that the ancestor of nearly all eudicots possessed RNase‐based self‐incompatibility, as well as a clear path to better conservation practices for one of the most charismatic plant families. 
    more » « less
  5. Summary Reflectance spectroscopy is a rapid method for estimating traits and discriminating species. Spectral libraries from herbarium specimens represent an untapped resource for generating broad phenomic datasets across space, time, and taxa.We conducted a proof‐of‐concept study using trait data and spectra from herbarium specimens up to 179 yr old, alongside data from recently dried and pressed leaves. We validated model accuracy and transferability for trait prediction and taxonomic discrimination.Trait models from herbarium spectra predicted leaf mass per area (LMA) withR2 = 0.94 and %RMSE = 4.86%. Models for LMA prediction were transferable between herbarium and pressed spectra, achievingR2 = 0.88, %RMSE = 8.76% for herbarium to pressed spectra, andR2 = 0.76, %RMSE = 10.5% for the reverse transfer. Discriminant models classified leaf spectra from 25 species with 74% accuracy, and classification probabilities were significantly associated with several herbarium specimen quality metrics.The results validate herbarium spectral data for trait prediction and taxonomic discrimination, and demonstrate that trait modeling can benefit from the complementary use of pressed‐leaf and herbarium‐leaf spectral datasets. These promising advancements help to justify the spectral digitization of plant biodiversity collections and support their application in broad ecological and evolutionary investigations. 
    more » « less