skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on November 14, 2025

Title: Noncanonical transcription initiation is primarily tissue specific and epigenetically tuned in paleopolyploid plants
Abstract Alternative transcription initiation (ATI) appears to be a ubiquitous regulatory mechanism of gene expression in eukaryotes. However, the extent to which it affects the products of gene expression and how it evolves and is regulated remain unknown. Here, we report genome-wide identification and analysis of transcription start sites (TSSs) in various soybean (Glycine max) tissues using a survey of transcription initiation at promoter elements with high-throughput sequencing (STRIPE-seq). We defined 193,579 TSS clusters/regions (TSRs) in 37,911 annotated genes, with 56.5% located in canonical regulatory regions and 43.5% from start codons to 3′ untranslated regions, which were responsible for changes in open reading frames of 24,131 genes. Strikingly, 6,845 genes underwent ATI within coding sequences (CDSs). These CDS-TSRs were tissue-specific, did not have TATA-boxes typical of canonical promoters, and were embedded in nucleosome-free regions flanked by nucleosomes with enhanced levels of histone marks potentially associated with intragenic transcriptional initiation, suggesting that ATI within CDSs was epigenetically tuned and associated with tissue-specific functions. Overall, duplicated genes possessed more TSRs, exhibited lower degrees of tissue specificity, and underwent stronger purifying selection than singletons. This study highlights the significance of ATI and the genomic and epigenomic factors shaping the distribution of ATI in CDSs in a paleopolyploid eukaryote.  more » « less
Award ID(s):
2128023
PAR ID:
10558440
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
The Plant Cell
ISSN:
1040-4651
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Ripening is crucial for the development of fleshy fruits that release their seeds following consumption by frugivores and are important contributors to human health and nutritional security. Many genetic ripening regulators have been identified, especially in the model system tomato, yet more remain to be discovered and integrated into comprehensive regulatory models. Most tomato ripening genes have been studied in pericarp tissue, though recent evidence indicates that locule tissue is a site of early ripening-gene activities. Here we identified and functionally characterized an Ethylene Response Factor gene,SlERF.D6, by investigating tomato transcriptome data throughout plant development, emphasizing genes elevated in the locule during fruit development and ripening.SlERF.D6loss-of-function mutants resulting from CRISPR/Cas9 gene editing delayed ripening initiation and carotenoid accumulation in both pericarp and locule tissues. Transcriptome analysis of lines altered inSlERF.D6expression revealed multiple classes of altered genes including ripening regulators, in addition to carotenoid, cell wall and ethylene pathway genes, suggesting comprehensive ripening control. Distinct regulatory patterns in pericarp versus locule tissues were observed indicating tissue-specific activity of this transcription factor. Analysis of SlERF.D6 interaction with target promoters revealed an AP2/ERF transcription factor(SlDEAR2) as a target of SlERF.D6. Furthermore, we show that a third transcription factor gene,SlTCP12, is a target of SlDEAR2, presenting a tri-component module of ripening control. 
    more » « less
  2. Plants are subjected to extreme environmental conditions and must adapt rapidly. The phytohormone abscisic acid (ABA) accumulates during abiotic stress, signaling transcriptional changes that trigger physiological responses. Epigenetic modifications often facilitate transcription, particularly at genes exhibiting temporal, tissue-specific and environmentally-induced expression. In maize ( Zea mays ), MEDIATOR OF PARAMUTATION 1 (MOP1) is required for progression of an RNA-dependent epigenetic pathway that regulates transcriptional silencing of loci genomewide. MOP1 function has been previously correlated with genomic regions adjoining particular types of transposable elements and genic regions, suggesting that this regulatory pathway functions to maintain distinct transcriptional activities within genomic spaces, and that loss of MOP1 may modify the responsiveness of some loci to other regulatory pathways. As critical regulators of gene expression, MOP1 and ABA pathways each regulate specific genes. To determine whether loss of MOP1 impacts ABA-responsive gene expression in maize, mop1-1 and Mop1 homozygous seedlings were subjected to exogenous ABA and RNA-sequencing. A total of 3,242 differentially expressed genes (DEGs) were identified in four pairwise comparisons. Overall, ABA-induced changes in gene expression were enhanced in mop1-1 homozygous plants. The highest number of DEGs were identified in ABA-induced mop1-1 mutants, including many transcription factors; this suggests combinatorial regulatory scenarios including direct and indirect transcriptional responses to genetic disruption ( mop1-1 ) and/or stimulus-induction of a hierarchical, cascading network of responsive genes. Additionally, a modest increase in CHH methylation at putative MOP1-RdDM loci in response to ABA was observed in some genotypes, suggesting that epigenetic variation might influence environmentally-induced transcriptional responses in maize. 
    more » « less
  3. Abstract The discovery of cancer driver mutations is a fundamental goal in cancer research. While many cancer driver mutations have been discovered in the protein-coding genome, research into potential cancer drivers in the non-coding regions showed limited success so far. Here, we present a novel comprehensive framework Dr.Nod for detection of non-coding cis-regulatory candidate driver mutations that are associated with dysregulated gene expression using tissue-matched enhancer-gene annotations. Applying the framework to data from over 1500 tumours across eight tissues revealed a 4.4-fold enrichment of candidate driver mutations in regulatory regions of known cancer driver genes. An overarching conclusion that emerges is that the non-coding driver mutations contribute to cancer by significantly altering transcription factor binding sites, leading to upregulation of tissue-matched oncogenes and down-regulation of tumour-suppressor genes. Interestingly, more than half of the detected cancer-promoting non-coding regulatory driver mutations are over 20 kb distant from the cancer-associated genes they regulate. Our results show the importance of tissue-matched enhancer-gene maps, functional impact of mutations, and complex background mutagenesis model for the prediction of non-coding regulatory drivers. In conclusion, our study demonstrates that non-coding mutations in enhancers play a previously underappreciated role in cancer and dysregulation of clinically relevant target genes. 
    more » « less
  4. Abstract Changes in gene expression are important for responses to abiotic stress. Transcriptome profiling of heat- or cold-stressed maize genotypes identifies many changes in transcript abundance. We used comparisons of expression responses in multiple genotypes to identify alleles with variable responses to heat or cold stress and to distinguish examples of cis- or trans-regulatory variation for stress-responsive expression changes. We used motifs enriched near the transcription start sites (TSSs) for thermal stress-responsive genes to develop predictive models of gene expression responses. Prediction accuracies can be improved by focusing only on motifs within unmethylated regions near the TSS and vary for genes with different dynamic responses to stress. Models trained on expression responses in a single genotype and promoter sequences provided lower performance when applied to other genotypes but this could be improved by using models trained on data from all three genotypes tested. The analysis of genes with cis-regulatory variation provides evidence for structural variants that result in presence/absence of transcription factor binding sites in creating variable responses. This study provides insights into cis-regulatory motifs for heat- and cold-responsive gene expression and defines a framework for developing models to predict expression responses across multiple genotypes. 
    more » « less
  5. Modeling biological processes and genetic-regulatory networks using in silico approaches provides a valuable framework for understanding how genes and associated allelic and genotypic differences result in specific traits. Submergence tolerance is a significant agronomic trait in rice; however, the gene–gene interactions linked with this polygenic trait remain largely unknown. In this study, we constructed a network of 57 transcription factors involved in seed germination and coleoptile elongation under submergence. The gene–gene interactions were based on the co-expression profiles of genes and the presence of transcription factor binding sites in the promoter region of target genes. We also incorporated published experimental evidence, wherever available, to support gene–gene, gene–protein, and protein–protein interactions. The co-expression data were obtained by re-analyzing publicly available transcriptome data from rice. Notably, this network includes OSH1, OSH15, OSH71, Sub1B, ERFs, WRKYs, NACs, ZFP36, TCPs, etc., which play key regulatory roles in seed germination, coleoptile elongation and submergence response, and mediate gravitropic signaling by regulating OsLAZY1 and/or IL2. The network of transcription factors was manually biocurated and submitted to the Plant Reactome Knowledgebase to make it publicly accessible. We expect this work will facilitate the re-analysis/re-use of OMICs data and aid genomics research to accelerate crop improvement. 
    more » « less