skip to main content


Title: Programmable gene regulation for metabolic engineering using decoy transcription factor binding sites
Abstract Transcription factor decoy binding sites are short DNA sequences that can titrate a transcription factor away from its natural binding site, therefore regulating gene expression. In this study, we harness synthetic transcription factor decoy systems to regulate gene expression for metabolic pathways in Escherichia coli. We show that transcription factor decoys can effectively regulate expression of native and heterologous genes. Tunability of the decoy can be engineered via changes in copy number or modifications to the DNA decoy site sequence. Using arginine biosynthesis as a showcase, we observed a 16-fold increase in arginine production when we introduced the decoy system to steer metabolic flux towards increased arginine biosynthesis, with negligible growth differences compared to the wild type strain. The decoy-based production strain retains high genetic integrity; in contrast to a gene knock-out approach where mutations were common, we detected no mutations in the production system using the decoy-based strain. We further show that transcription factor decoys are amenable to multiplexed library screening by demonstrating enhanced tolerance to pinene with a combinatorial decoy library. Our study shows that transcription factor decoy binding sites are a powerful and compact tool for metabolic engineering.  more » « less
Award ID(s):
1804096
NSF-PAR ID:
10376986
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
49
Issue:
2
ISSN:
0305-1048
Page Range / eLocation ID:
1163 to 1172
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Transcription factor (TF)–promoter pairs have been repurposed from native hosts to provide tools to measure intracellular biochemical production titer and dynamically control gene expression. Most often, native TF–promoter systems require rigorous screening to obtain desirable characteristics optimized for biotechnological applications. High-throughput techniques may provide a rational and less labor-intensive strategy to engineer user-defined TF–promoter pairs using fluorescence-activated cell sorting and deep sequencing methods (sort-seq). Based on the designed promoter library’s distribution characteristics, we elucidate sequence–function interactions between the TF and DNA. In this work, we use the sort-seq method to study the sequence–function relationship of a σ 54 -dependent, butanol-responsive TF–promoter pair, BmoR-P BMO derived from Thauera butanivorans , at the nucleotide level to improve biosensor characteristics, specifically an improved dynamic range. Activities of promoters from a mutagenized P BMO library were sorted based on gfp expression and subsequently deep sequenced to correlate site-specific sequences with changes in dynamic range. We identified site-specific mutations that increase the sensor output. Double mutant and a single mutant, CA(129,130)TC and G(205)A, in P BMO promoter increased dynamic ranges of 4-fold and 1.65-fold compared with the native system, respectively. In addition, sort-seq identified essential sites required for the proper function of the σ 54 -dependent promoter biosensor in the context of the host. This work can enable high-throughput screening methods for strain development. 
    more » « less
  2. Transcription factor (TF)–promoter pairs have been repurposed from native hosts to provide tools to measure intracellular biochemical production titer and dynamically control gene expression. Most often, native TF–promoter systems require rigorous screening to obtain desirable characteristics optimized for biotechnological applications. High-throughput techniques may provide a rational and less labor-intensive strategy to engineer user-defined TF–promoter pairs using fluorescence-activated cell sorting and deep sequencing methods (sort-seq). Based on the designed promoter library’s distribution characteristics, we elucidate sequence–function interactions between the TF and DNA. In this work, we use the sort-seq method to study the sequence–function relationship of a σ54-dependent, butanol-responsive TF–promoter pair, BmoR-PBMO derived from Thauera butanivorans, at the nucleotide level to improve biosensor characteristics, specifically an improved dynamic range. Activities of promoters from a mutagenized PBMO library were sorted based on gfp expression and subsequently deep sequenced to correlate site-specific sequences with changes in dynamic range. We identified site-specific mutations that increase the sensor output. Double mutant and a single mutant, CA(129,130)TC and G(205)A, in PBMO promoter increased dynamic ranges of 4-fold and 1.65-fold compared with the native system, respectively. In addition, sort-seq identified essential sites required for the proper function of the σ54-dependent promoter biosensor in the context of the host. This work can enable high-throughput screening methods for strain development. 
    more » « less
  3. null (Ed.)
    How homeodomain proteins gain sufficient specificity to control different cell fates has been a long-standing problem in developmental biology. The conserved Gsx homeodomain proteins regulate specific aspects of neural development in animals from flies to mammals, and yet they belong to a large transcription factor family that bind nearly identical DNA sequences in vitro. Here, we show that the mouse and fly Gsx factors unexpectedly gain DNA binding specificity by forming cooperative homodimers on precisely spaced and oriented DNA sites. High-resolution genomic binding assays revealed that Gsx2 binds both monomer and homodimer sites in the developing mouse ventral telencephalon. Importantly, reporter assays showed that Gsx2 mediates opposing outcomes in a DNA binding site-dependent manner: Monomer Gsx2 binding represses transcription, whereas homodimer binding stimulates gene expression. In Drosophila , the Gsx homolog, Ind, similarly represses or stimulates transcription in a site-dependent manner via an autoregulatory enhancer containing a combination of monomer and homodimer sites. Integrating these findings, we test a model showing how the homodimer to monomer site ratio and the Gsx protein levels defines gene up-regulation versus down-regulation. Altogether, these data serve as a new paradigm for how cooperative homeodomain transcription factor binding can increase target specificity and alter regulatory outcomes. 
    more » « less
  4. Transcription factors are proteins that recognize specific DNA sequences and affect local transcriptional processes. They are the primary means by which all organisms control specific gene expression. Understanding which DNA sequences a particular transcription factor recognizes provides important clues into the set of genes that they regulate and, through this, their potential biological functions. Insights may be gained through homology searches and genetic means. However, these approaches can be misleading, especially when comparing distantly related organisms or in cases of complicated transcriptional regulation. In this work, we used a biochemistry-based approach to determine the spectrum of DNA sequences specifically bound by the Thermus thermophilus HB8 TetR-family transcription factor TTHB023. The consensus sequence 5′–(a/c)Y(g/t)A(A/C)YGryCR(g/t)T(c/a)R(g/t)–3′ was found to have a nanomolar binding affinity with TTHB023. Analyzing the T. thermophilus HB8 genome, several TTHB023 consensus binding sites were mapped to the promoters of genes involved in fatty acid biosynthesis. Notably, some of these were not identified previously through genetic approaches, ostensibly given their potential co-regulation by the Thermus thermophilus HB8 TetR-family transcriptional repressor TTHA0167. Our investigation provides additional evidence supporting the usefulness of a biochemistry-based approach for characterizing putative transcription factors, especially in the case of cooperative regulation. 
    more » « less
  5. Goldman, Gustavo H. (Ed.)
    ABSTRACT Fungal secondary metabolites are widely used as therapeutics and are vital components of drug discovery programs. A major challenge hindering discovery of novel secondary metabolites is that the underlying pathways involved in their biosynthesis are transcriptionally silent under typical laboratory growth conditions, making it difficult to identify the transcriptional networks that they are embedded in. Furthermore, while the genes participating in secondary metabolic pathways are typically found in contiguous clusters on the genome, known as biosynthetic gene clusters (BGCs), this is not always the case, especially for global and pathway-specific regulators of pathways’ activities. To address these challenges, we used 283 genome-wide gene expression data sets of the ascomycete cell factory Aspergillus niger generated during growth under 155 different conditions to construct two gene coexpression networks based on Spearman’s correlation coefficients (SCCs) and on mutual rank-transformed Pearson’s correlation coefficients (MR-PCCs). By mining these networks, we predicted six transcription factors, named MjkA to MjkF, to regulate secondary metabolism in A. niger . Overexpression of each transcription factor using the Tet-On cassette modulated the production of multiple secondary metabolites. We found that the SCC and MR-PCC approaches complemented each other, enabling the delineation of putative global (SCC) and pathway-specific (MR-PCC) transcription factors. These results highlight the potential of coexpression network approaches to identify and activate fungal secondary metabolic pathways and their products. More broadly, we argue that drug discovery programs in fungi should move beyond the BGC paradigm and focus on understanding the global regulatory networks in which secondary metabolic pathways are embedded. IMPORTANCE There is an urgent need for novel bioactive molecules in both agriculture and medicine. The genomes of fungi are thought to contain vast numbers of metabolic pathways involved in the biosynthesis of secondary metabolites with diverse bioactivities. Because these metabolites are biosynthesized only under specific conditions, the vast majority of the fungal pharmacopeia awaits discovery. To discover the genetic networks that regulate the activity of secondary metabolites, we examined the genome-wide profiles of gene activity of the cell factory Aspergillus niger across hundreds of conditions. By constructing global networks that link genes with similar activities across conditions, we identified six putative global and pathway-specific regulators of secondary metabolite biosynthesis. Our study shows that elucidating the behavior of the genetic networks of fungi under diverse conditions harbors enormous promise for understanding fungal secondary metabolism, which ultimately may lead to novel drug candidates. 
    more » « less