skip to main content


Title: The cis-regulatory codes of response to combined heat and drought stress in Arabidopsis thaliana
Abstract Plants respond to their environment by dynamically modulating gene expression. A powerful approach for understanding how these responses are regulated is to integrate information about cis-regulatory elements (CREs) into models called cis-regulatory codes. Transcriptional response to combined stress is typically not the sum of the responses to the individual stresses. However, cis-regulatory codes underlying combined stress response have not been established. Here we modeled transcriptional response to single and combined heat and drought stress in Arabidopsis thaliana. We grouped genes by their pattern of response (independent, antagonistic and synergistic) and trained machine learning models to predict their response using putative CREs (pCREs) as features (median F-measure = 0.64). We then developed a deep learning approach to integrate additional omics information (sequence conservation, chromatin accessibility and histone modification) into our models, improving performance by 6.2%. While pCREs important for predicting independent and antagonistic responses tended to resemble binding motifs of transcription factors associated with heat and/or drought stress, important synergistic pCREs resembled binding motifs of transcription factors not known to be associated with stress. These findings demonstrate how in silico approaches can improve our understanding of the complex codes regulating response to combined stress and help us identify prime targets for future characterization.  more » « less
Award ID(s):
1655386 1546617
NSF-PAR ID:
10222401
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
NAR Genomics and Bioinformatics
Volume:
2
Issue:
3
ISSN:
2631-9268
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Isoprene has recently been proposed to be a signaling molecule that can enhance tolerance of both biotic and abiotic stress. Not all plants make isoprene, but all plants tested to date respond to isoprene. We hypothesized that isoprene interacts with existing signaling pathways rather than requiring novel mechanisms for its effect on plants. We analyzed the cis‐regulatory elements (CREs) in promoters of isoprene‐responsive genes and the corresponding transcription factors binding these promoter elements to obtain clues about the transcription factors and other proteins involved in isoprene signaling. Promoter regions of isoprene‐responsive genes were characterized using the Arabidopsis cis‐regulatory element database. CREs bind ARR1, Dof, DPBF, bHLH112, GATA factors, GT‐1, MYB, and WRKY transcription factors, and light‐responsive elements were overrepresented in promoters of isoprene‐responsive genes; CBF‐, HSF‐, WUS‐binding motifs were underrepresented. Transcription factors corresponding to CREs overrepresented in promoters of isoprene‐responsive genes were mainly those important for stress responses: drought‐, salt/osmotic‐, oxidative‐, herbivory/wounding and pathogen‐stress. More than half of the isoprene‐responsive genes contained at least one binding site for TFs of the class IV (homeodomain leucine zipper) HD‐ZIP family, such as GL2, ATML1, PDF2, HDG11, ATHB17. While the HD‐zipper‐loop‐zipper (ZLZ) domain binds to the L1 box of the promoter region, a special domain called the steroidogenic acute regulatory protein‐related lipid transfer, or START domain, can bind ligands such as fatty acids (e.g., linolenic and linoleic acid). We tested whether isoprene might bind in such a START domain. Molecular simulations and modeling to test interactions between isoprene and a class IV HD‐ZIP family START‐domain‐containing protein were carried out. Without membrane penetration by the HDG11 START domain, isoprene within the lipid bilayer was inaccessible to this domain, preventing protein interactions with membrane bound isoprene. The cross‐talk between isoprene‐mediated signaling and other growth regulator and stress signaling pathways, in terms of common CREs and transcription factors could enhance the stability of the isoprene emission trait when it evolves in a plant but so far it has not been possible to say what how isoprene is sensed to initiate signaling responses.

     
    more » « less
  2. Summary

    Adverse environmental conditions reduce crop productivity and often increase the load of unfolded or misfolded proteins in the endoplasmic reticulum (ER). This potentially lethal condition, known as ER stress, is buffered by the unfolded protein response (UPR), a set of signaling pathways designed to either recover ER functionality or ignite programmed cell death. Despite the biological significance of the UPR to the life of the organism, the regulatory transcriptional landscape underpinning ER stress management is largely unmapped, especially in crops. To fill this significant knowledge gap, we performed a large‐scale systems‐level analysis of the protein–DNA interaction (PDI) network in maize (Zea mays). Using 23 promoter fragments of six UPR marker genes in a high‐throughput enhanced yeast one‐hybrid assay, we identified a highly interconnected network of 262 transcription factors (TFs) associated with significant biological traits and 831 PDIs underlying the UPR. We established a temporal hierarchy of TF binding to gene promoters within the same family as well as across different families of TFs. Cistrome analysis revealed the dynamic activities of a variety ofcis‐regulatory elements (CREs) in ER stress‐responsive gene promoters. By integrating the cistrome results into a TF network analysis, we mapped a subnetwork of TFs associated with a CRE that may contribute to UPR management. Finally, we validated the role of a predicted network hub gene using the Arabidopsis system. The PDIs, TF networks, and CREs identified in our work are foundational resources for understanding transcription‐regulatory mechanisms in the stress responses and crop improvement.

     
    more » « less
  3. Abstract

    Mutations in cis-regulatory regions play an important role in the domestication and improvement of crops by altering gene expression. However, assessing the in vivo impact of cis-regulatory elements (CREs) on transcriptional regulation and phenotypic outcomes remains challenging. Previously, we showed that the dominant Barren inflorescence3 (Bif3) mutant of maize (Zea mays) contains a duplicated copy of the homeobox transcription factor gene ZmWUSCHEL1 (ZmWUS1), named ZmWUS1-B. ZmWUS1-B is controlled by a spontaneously generated novel promoter region that dramatically increases its expression and alters patterning and development of young ears. Overexpression of ZmWUS1-B is caused by a unique enhancer region containing multimerized binding sites for type B RESPONSE REGULATORs (RRs), key transcription factors in cytokinin signaling. To better understand how the enhancer increases the expression of ZmWUS1 in vivo, we specifically targeted the ZmWUS1-B enhancer region by CRISPR-Cas9-mediated editing. A series of deletion events with different numbers of type B RR DNA binding motifs (AGATAT) enabled us to determine how the number of AGATAT motifs impacts in vivo expression of ZmWUS1-B and consequently ear development. In combination with dual-luciferase assays in maize protoplasts, our analysis reveals that AGATAT motifs have an additive effect on ZmWUS1-B expression, while the distance separating AGATAT motifs does not appear to have a meaningful impact, indicating that the enhancer activity derives from the sum of individual CREs. These results also suggest that in maize inflorescence development, there is a threshold of buffering capacity for ZmWUS1 overexpression.

     
    more » « less
  4. Falush, Daniel (Ed.)
    Abstract Although some variation introgressed from Neanderthals has undergone selective sweeps, little is known about its functional significance. We used a Massively Parallel Reporter Assay (MPRA) to assay 5,353 high-frequency introgressed variants for their ability to modulate the gene expression within 170 bp of endogenous sequence. We identified 2,548 variants in active putative cis-regulatory elements (CREs) and 292 expression-modulating variants (emVars). These emVars are predicted to alter the binding motifs of important immune transcription factors, are enriched for associations with neutrophil and white blood cell count, and are associated with the expression of genes that function in innate immune pathways including inflammatory response and antiviral defense. We combined the MPRA data with other data sets to identify strong candidates to be driver variants of positive selection including an emVar that may contribute to protection against severe COVID-19 response. We endogenously deleted two CREs containing expression-modulation variants linked to immune function, rs11624425 and rs80317430, identifying their primary genic targets as ELMSAN1, and PAN2 and STAT2, respectively, three genes differentially expressed during influenza infection. Overall, we present the first database of experimentally identified expression-modulating Neanderthal-introgressed alleles contributing to potential immune response in modern humans. 
    more » « less
  5. Abstract Changes in gene expression are important for responses to abiotic stress. Transcriptome profiling of heat- or cold-stressed maize genotypes identifies many changes in transcript abundance. We used comparisons of expression responses in multiple genotypes to identify alleles with variable responses to heat or cold stress and to distinguish examples of cis- or trans-regulatory variation for stress-responsive expression changes. We used motifs enriched near the transcription start sites (TSSs) for thermal stress-responsive genes to develop predictive models of gene expression responses. Prediction accuracies can be improved by focusing only on motifs within unmethylated regions near the TSS and vary for genes with different dynamic responses to stress. Models trained on expression responses in a single genotype and promoter sequences provided lower performance when applied to other genotypes but this could be improved by using models trained on data from all three genotypes tested. The analysis of genes with cis-regulatory variation provides evidence for structural variants that result in presence/absence of transcription factor binding sites in creating variable responses. This study provides insights into cis-regulatory motifs for heat- and cold-responsive gene expression and defines a framework for developing models to predict expression responses across multiple genotypes. 
    more » « less