skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Plant Reactome Knowledgebase: empowering plant pathway exploration and OMICS data analysis
Abstract Plant Reactome (https://plantreactome.gramene.org) is a freely accessible, comprehensive plant pathway knowledgebase. It provides curated reference pathways from rice (Oryza sativa) and gene-orthology-based pathway projections to 129 additional species, spanning single-cell photoautotrophs, non-vascular plants, and higher plants, thus encompassing a wide-ranging taxonomic diversity. Currently, Plant Reactome houses a collection of 339 reference pathways, covering metabolic and transport pathways, hormone signaling, genetic regulations of developmental processes, and intricate transcriptional networks that orchestrate a plant's response to abiotic and biotic stimuli. Beyond being a mere repository, Plant Reactome serves as a dynamic data discovery platform. Users can analyze and visualize omics data, such as gene expression, gene-gene interaction, proteome, and metabolome data, all within the rich context of plant pathways. Plant Reactome is dedicated to fostering data interoperability, upholding global data standards, and embracing the tenets of the Findable, Accessible, Interoperable and Re-usable (FAIR) data policy.  more » « less
Award ID(s):
2029854
PAR ID:
10475090
Author(s) / Creator(s):
; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
52
Issue:
D1
ISSN:
0305-1048
Format(s):
Medium: X Size: p. D1538-D1547
Size(s):
p. D1538-D1547
Sponsoring Org:
National Science Foundation
More Like this
  1. Summary Plant metabolites from diverse pathways are important for plant survival, human nutrition and medicine. The pathway memberships of most plant enzyme genes are unknown. While co‐expression is useful for assigning genes to pathways, expression correlation may exist only under specific spatiotemporal and conditional contexts.Utilising > 600 tomato (Solanum lycopersicum) expression data combinations, three strategies for predicting memberships in 85 pathways were explored.Optimal predictions for different pathways require distinct data combinations indicative of pathway functions. Naive prediction (i.e. identifying pathways with the most similarly expressed genes) is error prone. In 52 pathways, unsupervised learning performed better than supervised approaches, possibly due to limited training data availability. Using gene‐to‐pathway expression similarities led to prediction models that outperformed those based simply on expression levels. Using 36 experimental validated genes, the pathway‐best model prediction accuracy is 58.3%, significantly better compared with that for predicting annotated genes without experimental evidence (37.0%) or random guess (1.2%), demonstrating the importance of data quality.Our study highlights the need to extensively explore expression‐based features and prediction strategies to maximise the accuracy of metabolic pathway membership assignment. The prediction framework outlined here can be applied to other species and serves as a baseline model for future comparisons. 
    more » « less
  2. Abstract Plant innate immunity relies on nucleotide binding leucine-rich repeat receptors (NLRs) that recognize pathogen-derived molecules and activate downstream signaling pathways. We analyzed the variation in NLR gene copy number and identified plants with a low number of NLR genes relative to sister species. We specifically focused on four plants from two distinct lineages, one monocot lineage (Alismatales) and one eudicot lineage (Lentibulariaceae). In these lineages, the loss of NLR genes coincides with loss of the well-known downstream immune signaling complex ENHANCED DISEASE SUSCEPTIBILITY 1 (EDS1)/PHYTOALEXIN DEFICIENT 4 (PAD4). We expanded our analysis across whole proteomes and found that other characterized immune genes were absent only in Lentibulariaceae and Alismatales. Additionally, we identified genes of unknown function that were convergently lost together with EDS1/PAD4 in five plant species. Gene expression analyses in Arabidopsis (Arabidopsis thaliana) and Oryza sativa revealed that several homologs of the candidates are differentially expressed during pathogen infection, drought, and abscisic acid treatment. Our analysis provides evolutionary evidence for the rewiring of plant immunity in some plant lineages, as well as the coevolution of the EDS1/PAD4 pathway and drought responses. 
    more » « less
  3. Abstract Discovering natural product biosynthetic pathways of medicinal plants is challenging and laborious. Capturing the coregulation patterns of pathway enzymes, particularly transcriptomic regulation, has proven an effective method to accelerate pathway identification. In this study, we developed a yeast‐based screening method to capture the protein‐protein interactions (PPI) between plant enzymes, which is another useful pattern to complement the prevalent approach. Combining this method with plant multiomics analysis, we discovered four enzyme complexes and their organized pathways from kratom, an alkaloid‐producing plant. The four pathway branches involved six enzymes, including a strictosidine synthase, a strictosidine β‐D‐glucosidase (MsSGD), and four medium‐chain dehydrogenase/reductases (MsMDRs). PPI screening selected six MsMDRs interacting with MsSGD from 20 candidates predicted by multiomics analysis. Four of the six MsMDRs were then characterized as functional, indicating the high selectivity of the PPI screening method. This study highlights the opportunity of leveraging post‐translational regulation features to discover novel plant natural product biosynthetic pathways. 
    more » « less
  4. Background/Objectives: Pathway annotations of non-macromolecular (relatively small) biomolecules facilitate biological and biomedical interpretation of metabolomics datasets. However, low pathway annotation levels of detected biomolecules hinder this type of interpretation. Thus, predicting the pathway involvement of detected but unannotated biomolecules has a high potential to improve metabolomics data analysis and omics integration. Past publications have only made use of the Kyoto Encyclopedia of Genes and Genomes-derived datasets to develop machine learning models to predict pathway involvement. However, to our knowledge, the Reactome knowledgebase has not been utilized to develop these types of predictive models. Methods: We created a dataset ready for machine learning using chemical representations of all pathway-annotated compounds available from the Reactome knowledgebase. Next, we trained and evaluated a multilayer perceptron binary classifier using combined metabolite-pathway paired feature vectors engineered from this new dataset. Results: While models trained on a prior corresponding KEGG dataset with 502 pathways scored a mean Matthew’s correlation coefficient (MCC) of 0.847 and a 0.0098 standard deviation, the models trained on the Reactome dataset with 3985 pathways demonstrated improved performance with a mean MCC of 0.916, but with a higher standard deviation of 0.0149. Conclusions: These results indicate that the pathways in Reactome can also be effectively predicted, greatly increasing the number of human-defined pathways available for prediction. 
    more » « less
  5. Abstract Heterotrimeric G-protein complexes comprising Gα-, Gβ-, and Gγ-subunits and the regulator of G-protein signaling (RGS) are conserved across most eukaryotic lineages. Signaling pathways mediated by these proteins influence overall growth, development, and physiology. In plants, this protein complex has been characterized primarily from angiosperms with the exception of spreading-leaved earth moss (Physcomitrium patens) and Chara braunii (charophytic algae). Even within angiosperms, specific G-protein components are missing in certain species, whereas unique plant-specific variants—the extra-large Gα (XLGα) and the cysteine-rich Gγ proteins—also exist. The distribution and evolutionary history of G-proteins and their function in nonangiosperm lineages remain mostly unknown. We explored this using the wealth of available sequence data spanning algae to angiosperms representing extant species that diverged approximately 1,500 million years ago, using BLAST, synteny analysis, and custom-built Hidden Markov Model profile searches. We show that a minimal set of components forming the XLGαβγ trimer exists in the entire land plant lineage, but their presence is sporadic in algae. Additionally, individual components have distinct evolutionary histories. The XLGα exhibits many lineage-specific gene duplications, whereas Gα and RGS show several instances of gene loss. Similarly, Gβ remained constant in both number and structure, but Gγ diverged before the emergence of land plants and underwent changes in protein domains, which led to three distinct subtypes. These results highlight the evolutionary oddities and summarize the phyletic patterns of this conserved signaling pathway in plants. They also provide a framework to formulate pertinent questions on plant G-protein signaling within an evolutionary context. 
    more » « less