Machine learning approaches have been applied to identify transcription factor (TF)–DNA interaction important for gene regulation and expression. However, due to the enormous search space of the genome, it is challenging to build models capable of surveying entire reference genomes, especially in species where models were not trained. In this study, we surveyed a variety of methods for classification of epigenomics data in an attempt to improve the detection for 12 members of the auxin response factor (ARF)-binding DNAs from maize and soybean as assessed by DNA Affinity Purification and sequencing (DAP-seq). We used the classification for prediction by minimizing the genome search space by only surveying unmethylated regions (UMRs). For identification of DAP-seq-binding events within the UMRs, we achieved 78.72 % accuracy rate across 12 members of ARFs of maize on average by encoding DNA with count vectorization for k-mer with a logistic regression classifier with up-sampling and feature selection. Importantly, feature selection helps to uncover known and potentially novel ARF-binding motifs. This demonstrates an independent method for identification of TF-binding sites. Finally, we tested the model built with maize DAP-seq data and applied it directly to the soybean genome and found high false-negative rates, which accounted for more than 40 % across the ARF TFs tested. The findings in this study suggest the potential use of various methods to predict TF–DNA interactions within and between species with varying degrees of success.
AUXIN RESPONSE FACTORS (ARFs) are plant-specific transcription factors (TFs) that couple perception of the hormone auxin to gene expression programs essential to all land plants. As with many large TF families, a key question is whether individual members determine developmental specificity by binding distinct target genes. We use DAP-seq to generate genome-wide in vitro TF:DNA interaction maps for fourteen maize ARFs from the evolutionarily conserved A and B clades. Comparative analysis reveal a high degree of binding site overlap for ARFs of the same clade, but largely distinct clade A and B binding. Many sites are however co-occupied by ARFs from both clades, suggesting transcriptional coordination for many genes. Among these, we investigate known QTLs and use machine learning to predict the impact of
- NSF-PAR ID:
- 10154296
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Nature Communications
- Volume:
- 9
- Issue:
- 1
- ISSN:
- 2041-1723
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
Abstract -
Summary Despite well established roles of micro
RNA s in plant development, few aspects have been addressed to understand their effects in seeds especially on lipid metabolism. In this study, we showed that overexpressing microRNA 167A (miR167OE ) in camelina (Camelina sativa ) under a seed‐specific promoter changed fatty acid composition and increased seed size. Specifically, the miR167OE seeds had a lower α‐linolenic acid with a concomitantly higher linoleic acid content than the wild‐type. This decreased level of fatty acid desaturation corresponded to a decreased transcriptional expression of the camelina fatty acid desaturase3 (Cs ) in developing seeds. MiR167 targeted the transcription factor auxin response factor (CsFAD 3ARF 8) in camelina, as had been reported previously in Arabidopsis. Chromatin immunoprecipitation experiments combined with transcriptome analysis indicated that CsARF 8 bound to promoters of camelina andbZIP 67 genes. These transcription factors directly or through theABI 3ABI 3‐bZIP 12 pathway regulateCs expression and affect α‐linolenic acid accumulation. In addition, to decipher the miR167A‐CsFAD 3ARF 8 mediated transcriptional cascade forCs suppression, transcriptome analysis was conducted to implicate mechanisms that regulate seed size in camelina. Expression levels of many genes were altered in miR167FAD 3OE , including orthologs that have previously been identified to affect seed size in other plants. Most notably, genes for seed coat development such as suberin and lignin biosynthesis were down‐regulated. This study provides valuable insights into the regulatory mechanism of fatty acid metabolism and seed size determination, and suggests possible approaches to improve these important traits in camelina. -
SUMMARY Arabidopsis thaliana ABSCISIC ACID INSENSITIVE3 (ABI3) is a transcription factor in the B3 domain family. ABI3, along with B3 domain transcription factors LEAFY COTYLEDON2 (LEC2) and FUSCA3 (FUS3), and LEC1, a subunit of the CCAAT box‐binding complex, form the so‐called LAFL network to control various aspects of seed development and maturation. ABI3 also contributes to the abscisic acid (ABA) response. We report on chromatin immunoprecipitation‐tiling array experiments to map binding sites for ABI3 globally. We also assessed transcriptomes in response to ABI3 by comparing developingabi3‐5 and wild‐type seeds and combined this information to ascertain direct and indirect responsive ABI3 target genes. ABI3 can induce and repress its transcription of target genes directly and some intriguing differences exist incis motifs between these groups of genes. Directly regulated targets reflect the role of ABI3 in seed maturation, desiccation tolerance, entry into a quiescent state and longevity. Interestingly, ABI3 directly represses a gene encoding a microRNA (MIR160B ) that targetsAUXIN RESPONSE FACTOR (ARF )10 andARF16 that are involved in establishment of dormancy. In addition, ABI3, like FUS3, regulates genes encodingMIR156 but while FUS3 only induces genes encoding this product, ABI3 induces these genes during the early stages of seed development, but represses these genes during late development. The interplay between ABI3, the otherLAFL genes, and theVP1/ABI3‐LIKE (VAL ) genes, which are involved in the transition to seedling development are examined and reveal complex interactions controlling development. -
Summary Auxin is widely involved in plant growth and development. However, the molecular mechanism on how auxin carries out this work is unclear. In particular, the effect of auxin on pre‐
mRNA post‐transcriptional regulation is mostly unknown. By using a poly(A) tag (PAT ) sequencing approach,mRNA alternative polyadenylation (APA ) profiles after auxin treatment were revealed. We showed that hundreds of poly(A) site clusters (PAC s) are affected by auxin at the transcriptome level, where auxin reducesPAC distribution in 5′‐untranslated region (UTR ), but increases in the 3′UTR .APA site usage frequencies of 42 genes were switched by auxin, suggesting that auxin affects the choice of poly(A) sites. Furthermore, poly(A) signal selection was altered after auxin treatment. For example, a mutant of poly(A) signal binding proteinCPSF 30 showed altered sensitivity to auxin treatment, indicating interactions between auxin and the poly(A) signal recognition machinery. We also found that auxin activity on lateral root development is likely mediated by altered expression of ,ARF 7 andARF 19 through poly(A) site switches. Our results shed light on the molecular mechanisms of auxin responses relative to its interactions withIAA 14mRNA polyadenylation. -
Abstract Auxin is a hormone that is required for hypocotyl elongation during seedling development. In response to auxin, rapid changes in transcript and protein abundance occur in hypocotyls, and some auxin responsive gene expression is linked to hypocotyl growth. To functionally validate proteomic studies, a reverse genetics screen was performed on mutants in auxin‐regulated proteins to identify novel regulators of plant growth. This uncovered a long hypocotyl mutant, which we called
slim shady , in an annotated insertion line inIMMUNOREGULATORY RNA‐BINDING PROTEIN (IRR ). Overexpression of theIRR gene failed to rescue theslim shady phenotype and characterization of a second T‐DNA allele of IRR found that it had a wild‐type (WT) hypocotyl length. Theslim shady mutant has an elevated expression of numerous genes associated with the brassinosteroid‐auxin‐phytochrome (BAP) regulatory module compared to WT, including transcription factors that regulate brassinosteroid, auxin, and phytochrome pathways. Additionally,slim shady seedlings fail to exhibit a strong transcriptional response to auxin. Using whole genome sequence data and genetic complementation analysis with SALK_015201C, we determined that a novel single nucleotide polymorphism inPHYTOCHROME B was responsible for theslim shady phenotype. This is predicted to induce a frameshift and premature stop codon at leucine 1125, within the histidine kinase‐related domain of the carboxy terminus of PHYB, which is required for phytochrome signaling and function. Genetic complementation analyses withphyb‐9 confirmed thatslim shady is a mutant allele ofPHYB . This study advances our understanding of the molecular mechanisms in seedling development, by furthering our understanding of how light signaling is linked to auxin‐dependent cell elongation. Furthermore, this study highlights the importance of confirming the genetic identity of research material before attributing phenotypes to known mutations sourced from T‐DNA stocks.