Summary In plants, the biosynthetic pathways of some specialized metabolites are partitioned into specialized or rare cell types, as exemplified by the monoterpenoid indole alkaloid (MIA) pathway ofCatharanthus roseus(Madagascar Periwinkle), the source of the anticancer compounds vinblastine and vincristine. In the leaf, theC. roseusMIA biosynthetic pathway is partitioned into three cell types with the final known steps of the pathway expressed in the rare cell type termed idioblast. How cell‐type specificity of MIA biosynthesis is achieved is poorly understood.We generated single‐cell multi‐omics data fromC. roseusleaves. Integrating gene expression and chromatin accessibility profiles across single cells, as well as transcription factor (TF)‐binding site profiles, we constructed a cell‐type‐aware gene regulatory network for MIA biosynthesis.We showcased cell‐type‐specific TFs as well as cell‐type‐specificcis‐regulatory elements. Using motif enrichment analysis, co‐expression across cell types, and functional validation approaches, we discovered a novel idioblast‐specific TF (Idioblast MYB1,CrIDM1) that activates expression of late‐stage MIA biosynthetic genes in the idioblast.These analyses not only led to the discovery of the first documented cell‐type‐specific TF that regulates the expression of two idioblast‐specific biosynthetic genes within an idioblast metabolic regulon but also provides insights into cell‐type‐specific metabolic regulation.
more »
« less
Transcription factor enrichment analysis (TFEA) quantifies the activity of multiple transcription factors from a single experiment
Abstract Detecting changes in the activity of a transcription factor (TF) in response to a perturbation provides insights into the underlying cellular process. Transcription Factor Enrichment Analysis (TFEA) is a robust and reliable computational method that detects positional motif enrichment associated with changes in transcription observed in response to a perturbation. TFEA detects positional motif enrichment within a list of ranked regions of interest (ROIs), typically sites of RNA polymerase initiation inferred from regulatory data such as nascent transcription. Therefore, we also introducemuMerge, a statistically principled method of generating a consensus list of ROIs from multiple replicates and conditions. TFEA is broadly applicable to data that informs on transcriptional regulation including nascent transcription (eg. PRO-Seq), CAGE, histone ChIP-Seq, and accessibility data (e.g., ATAC-Seq). TFEA not only identifies the key regulators responding to a perturbation, but also temporally unravels regulatory networks with time series data. Consequently, TFEA serves as a hypothesis-generating tool that provides an easy, rigorous, and cost-effective means to broadly assess TF activity yielding new biological insights.
more »
« less
- Award ID(s):
- 1759949
- PAR ID:
- 10233493
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Communications Biology
- Volume:
- 4
- Issue:
- 1
- ISSN:
- 2399-3642
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Regulating transcription allows organisms to respond to their environment, both within a single generation (plasticity) and across generations (adaptation). We examined transcriptional differences in gill tissues of fishes in thePoecilia mexicanaspecies complex (family Poeciliidae), which have colonized toxic springs rich in hydrogen sulfide (H2S) in southern Mexico. There are gene expression differences between sulfidic and non-sulfidic populations, yet regulatory mechanisms mediating this gene expression variation remain poorly studied. We combined capped-small RNA sequencing (csRNA-seq), which captures actively transcribed (i.e. nascent) transcripts, and messenger RNA sequencing (mRNA-seq) to examine how variation in transcription, enhancer activity, and associated transcription factor binding sites may facilitate adaptation to extreme environments. csRNA-seq revealed thousands of differentially initiated transcripts between sulfidic and non-sulfidic populations, many of which are involved in H2S detoxification and response. Analyses of transcription factor binding sites in promoter and putative enhancer csRNA-seq peaks identified a suite of transcription factors likely involved in regulating H2S-specific shifts in gene expression, including several key transcription factors known to respond to hypoxia. Our findings uncover a complex interplay of regulatory processes that reflect the divergence of extremophile populations ofP. mexicanafrom their non-sulfidic ancestors and suggest shared responses among evolutionarily independent lineages.more » « less
-
Short tandem repeats (STRs) are enriched in eukaryoticcis-regulatory elements and alter gene expression, yet how they regulate transcription remains unknown. We found that STRs modulate transcription factor (TF)–DNA affinities and apparent on-rates by about 70-fold by directly binding TF DNA-binding domains, with energetic impacts exceeding many consensus motif mutations. STRs maximize the number of weakly preferred microstates near target sites, thereby increasing TF density, with impacts well predicted by statistical mechanics. Confirming that STRs also affect TF binding in cells, neural networks trained only on in vivo occupancies predicted effects identical to those observed in vitro. Approximately 90% of TFs preferentially bound STRs that need not resemble known motifs, providing a cis-regulatory mechanism to target TFs to genomic sites.more » « less
-
Abstract DNA–transcription factor (TF) interactions are essential for gene regulation. Fully characterizing TF recognition specificities and identifying their genomic binding targets are important to understand TF function and regulatory networks. Recently, high-throughput sequencing technology HT-SELEX (high-throughput systematic evolution of ligands by exponential enrichment) has been used to measure hundreds of TFs, providing massive datasets that comprise TF binding preferences. However, there is a need to develop comprehensive computational modeling to fully extract and characterize critical TF binding preferences and fail to distinguish genome-wide binding targets. In this study, we developed a global pairwise model called DCA-Scapes trained with experimental HT-SELEX data. Our approach uncovered high-resolution TF recognition specificity landscapes, enabled the prediction of in vivo binding sequences, and was validated with ChIP-seq (ChIP sequencing) data. In addition, the DCA-Scapes model was utilized to refine the locations of binding regions and accurately identify the binding sites within the ChIP-seq enriched peaks. Moreover, we extended our model to cover the entire human genome, uncovering potential TF target sites that exhibit tissue-specific TF recognition across various cellular environments.more » « less
-
Abstract Transcription is the primary regulatory step in gene expression. Divergent transcription initiation from promoters and enhancers produces stable RNAs from genes and unstable RNAs from enhancers1,2. Nascent RNA capture and sequencing assays simultaneously measure gene and enhancer activity in cell populations3. However, fundamental questions about the temporal regulation of transcription and enhancer–gene coordination remain unanswered, primarily because of the absence of a single-cell perspective on active transcription. In this study, we present scGRO–seq—a new single-cell nascent RNA sequencing assay that uses click chemistry—and unveil coordinated transcription throughout the genome. We demonstrate the episodic nature of transcription and the co-transcription of functionally related genes. scGRO–seq can estimate burst size and frequency by directly quantifying transcribing RNA polymerases in individual cells and can leverage replication-dependent non-polyadenylated histone gene transcription to elucidate cell cycle dynamics. The single-nucleotide spatial and temporal resolution of scGRO–seq enables the identification of networks of enhancers and genes. Our results suggest that the bursting of transcription at super-enhancers precedes bursting from associated genes. By imparting insights into the dynamic nature of global transcription and the origin and propagation of transcription signals, we demonstrate the ability of scGRO–seq to investigate the mechanisms of transcription regulation and the role of enhancers in gene expression.more » « less
An official website of the United States government
