Abstract Changes in gene expression are important for responses to abiotic stress. Transcriptome profiling of heat- or cold-stressed maize genotypes identifies many changes in transcript abundance. We used comparisons of expression responses in multiple genotypes to identify alleles with variable responses to heat or cold stress and to distinguish examples of cis- or trans-regulatory variation for stress-responsive expression changes. We used motifs enriched near the transcription start sites (TSSs) for thermal stress-responsive genes to develop predictive models of gene expression responses. Prediction accuracies can be improved by focusing only on motifs within unmethylated regions near the TSS and vary for genes with different dynamic responses to stress. Models trained on expression responses in a single genotype and promoter sequences provided lower performance when applied to other genotypes but this could be improved by using models trained on data from all three genotypes tested. The analysis of genes with cis-regulatory variation provides evidence for structural variants that result in presence/absence of transcription factor binding sites in creating variable responses. This study provides insights into cis-regulatory motifs for heat- and cold-responsive gene expression and defines a framework for developing models to predict expression responses across multiple genotypes.
more »
« less
Predicting transcriptional responses to cold stress across plant species
Although genome-sequence assemblies are available for a growing number of plant species, gene-expression responses to stimuli have been cataloged for only a subset of these species. Many genes show altered transcription patterns in response to abiotic stresses. However, orthologous genes in related species often exhibit different responses to a given stress. Accordingly, data on the regulation of gene expression in one species are not reliable predictors of orthologous gene responses in a related species. Here, we trained a supervised classification model to identify genes that transcriptionally respond to cold stress. A model trained with only features calculated directly from genome assemblies exhibited only modest decreases in performance relative to models trained by using genomic, chromatin, and evolution/diversity features. Models trained with data from one species successfully predicted which genes would respond to cold stress in other related species. Cross-species predictions remained accurate when training was performed in cold-sensitive species and predictions were performed in cold-tolerant species and vice versa. Models trained with data on gene expression in multiple species provided at least equivalent performance to models trained and tested in a single species and outperformed single-species models in cross-species prediction. These results suggest that classifiers trained on stress data from well-studied species may suffice for predicting gene-expression patterns in related, less-studied species with sequenced genomes.
more »
« less
- Award ID(s):
- 1845175
- PAR ID:
- 10320744
- Date Published:
- Journal Name:
- Proceedings of the National Academy of Sciences
- Volume:
- 118
- Issue:
- 10
- ISSN:
- 0027-8424
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Transcription factors (TFs) play a central role in regulating molecular level responses of plants to external stresses such as water limiting conditions, but identification of such TFs in the genome remains a challenge. Here, we describe a network-based supervised machine learning framework that accurately predicts and ranks all TFs in the genome according to their potential association with drought tolerance. We show that top ranked regulators fall mainly into two ‘age’ groups; genes that appeared first in land plants and genes that emerged later in the Oryza clade. TFs predicted to be high in the ranking belong to specific gene families, have relatively simple intron/exon and protein structures, and functionally converge to regulate primary and secondary metabolism pathways. Repeated trials of nested cross-validation tests showed that models trained only on regulatory network patterns, inferred from large transcriptome datasets, outperform models trained on heterogenous genomic features in the prediction of known drought response regulators. A new R/Shiny based web application, called the DroughtApp, provides a primer for generation of new testable hypotheses related to regulation of drought stress response. Furthermore, to test the system we experimentally validated predictions on the functional role of the rice transcription factor OsbHLH148, using RNA sequencing of knockout mutants in response to drought stress and protein-DNA interaction assays. Our study exemplifies the integration of domain knowledge for prioritization of regulatory genes in biological pathways of well-studied agricultural traits.more » « less
-
Irfan, Mohammad (Ed.)Drought is a significant environmental stressor that severely impairs plant growth and agricultural productivity. Unraveling the molecular mechanisms underlying plant responses to drought is crucial for developing crops with enhanced resilience. In this study, we investigated the transcriptomic responses of cultivated tomato (Solanum lycopersicum) and its drought-tolerant wild relative,Solanum pennellii, to identify “stress-ready” gene expression patterns associated with pre-adaptation to arid environments. Through RNA-seq analysis, we identified orthologous genes between the two species and compared their transcriptomic profiles under both control and drought conditions. Approximately 43% of the orthologous genes exhibited species-specific expression patterns, while nearly 20% were classified as stress-ready. These stress-ready genes were significantly enriched for functions related to nucleosome assembly, RNA metabolism, and transcriptional regulation. Furthermore, transcription factor binding motif analysis revealed a marked enrichment of ERF family motifs, emphasizing their role in both stress-ready and species-specific responses. Our findings indicate that regulatory mechanisms, particularly those mediated by ERF transcription factors, are pivotal to the drought resilience ofS. pennellii, providing a foundation for future crop improvement strategies.more » « less
-
SUMMARY Extreme dryness is lethal for nearly all plants, excluding the so‐called resurrection plants, which evolved vegetative desiccation tolerance (VDT) by recruiting genes common in most plants. To better understand the evolution of VDT, we generated chromosome‐level assemblies and improved genome annotations of twoSelaginellaspecies with contrasting abilities to survive desiccation. We identified genomic features and critical mechanisms associated with VDT through sister‐group comparative genomics integrating multi‐omics data. Our findings indicate thatSelaginellaevolved VDT through the expansion of some stress protection‐related gene families and the contraction of senescence‐related genes. Comparative analyses revealed that desiccation‐tolerantSelaginellaspecies employ a combination of constitutive and inducible protection mechanisms to survive desiccation. We show that transcriptional priming of stress tolerance‐related genes and accumulation of flavonoids in unstressed plants are hallmarks of VDT inSelaginella. During water loss, the resurrectionSelaginellainduces phospholipids and glutathione metabolism, responses that are missing in the desiccation‐sensitive species. Additionally, gene regulatory network analyses indicate the suppression of growth processes as a major component of VDT. This study presents novel perspectives on how gene dosage impacts crucial protective mechanisms and the regulation of central processes to survive extreme dehydration.more » « less
-
Harris, T (Ed.)Abstract Potato is a key food crop with a complex, polyploid genome. Advancements in sequencing technologies coupled with improvements in genome assembly algorithms have enabled generation of phased, chromosome-scale genome assemblies for cultivated tetraploid potato. The SpudDB database houses potato genome sequence and annotation, with the doubled monoploid DM 1–3 516 R44 (hereafter DM) genome serving as the reference genome and haplotype. Diverse annotation data types for DM genes are provided through a suite of Gene Report Pages including gene expression profiles across 438 potato samples. To further annotate potato genes based on expression, 65 gene co-expression modules were constructed that permit the identification of tightly co-regulated genes within DM across development and responses to wounding, abiotic stress, and biotic stress. Genome browser views of DM and 28 other potato genomes are provided along with a download page for genome sequence and annotation. To link syntenic genes within and between haplotypes, syntelogs were identified across 25 cultivated potato genomes. Through access to potato genome sequences and associated annotations, SpudDB can enable potato biologists, geneticists, and breeders to continue to improve this key food crop.more » « less
An official website of the United States government

