skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Nucleotide-level characterization and improvement of l -arabinose- and l -rhamnose-inducible systems in E. coli using a high-throughput approach
Abstract The commonly used arabinose- and rhamnose-inducible Escherichia coli promoters, PBAD and PRha, exhibit tight regulation through activation via their respective transcription factors, AraC and RhaS, alongside the cyclic AMP receptor protein. The mechanisms of these promoters have been characterized on a parts level, but nucleotide-level analysis has yet to be elucidated. Therefore, we describe here a massively parallel reporter assay that maps regulatory sites at the nucleotide level. The relative importance of nucleotides in each binding site is revealed, including loci not included in previous annotations. For PBAD, we confirm known sites and reveal novel binding sites involved in modulating gene expression. In PRha, we refine the length and sequence specificity of rhaI half-sites, updating previous annotations and providing nucleotide level insights into RhaS-mediated regulation. Mutations that lead to increased promoter strength, wider dynamic range, and altered basal expression are identified for both promoters. Engineered versions of PBAD and PRha promoters based on this data show improvements in dynamic range alongside a seven- and three-fold increase in promoter strength, respectively, with a slight increase in basal expression for the PBAD promoters and no significant increase for PRha. This work expands the genetic parts “toolkit” and increases the understanding of these important commonly used promoters.  more » « less
Award ID(s):
1847226
PAR ID:
10582092
Author(s) / Creator(s):
; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
53
Issue:
7
ISSN:
0305-1048
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Svensson, Sarah L (Ed.)
    ABSTRACT In starvingBacillus subtilisbacteria,the initiation of two survival programs—biofilm formation and sporulation—is controlled by the same phosphorylated master regulator, Spo0A~P. Its gene,spo0A,is transcribed from two promoters, Pvand Ps,that are, respectively, regulated by RNA polymerase (RNAP) holoenzymes bearing σAand σH. Notably, transcription is directly autoregulated by Spo0A~P binding sites known as 0A1, 0A2, and 0A3 box, located in between the two promoters. It remains unclear whether, at the onset of starvation, these boxes activate or repressspo0Aexpression, and whether the Spo0A~P transcriptional feedback plays a role in the increase inspo0Aexpression. Based on the experimental data of the promoter activities under systematic perturbation of the promoter architecture, we developed a biophysical model of transcriptional regulation ofspo0Aby Spo0A~P binding to each of the 0A boxes. The model predicts that Spo0A~P binding to its boxes does not affect the RNAP recruitment to the promoters but instead affects the transcriptional initiation rate. Moreover, the effects of Spo0A~P binding to 0A boxes are mainly repressive and saturated early at the onset of starvation. Therefore, the increase inspo0Aexpression is mainly driven by the increase in RNAP holoenzyme levels. Additionally, we reveal that Spo0A~P affinity to 0A boxes is strongest at 0A3 and weakest at 0A2 and that there are attractive forces between the occupied 0A boxes. Our findings, in addition to clarifying how the sporulation master regulator is controlled, offer a framework to predict regulatory outcomes of complex gene-regulatory mechanisms. IMPORTANCECell differentiation is often critical for survival. In bacteria, differentiation decisions are controlled by transcriptional master regulators under transcriptional feedback control. Therefore, understanding how master regulators are transcriptionally regulated is required to understand differentiation. However, in many cases, the underlying regulation is complex, with multiple transcription factor binding sites and multiple promoters, making it challenging to dissect the exact mechanisms. Here, we address this problem for theBacillus subtilismaster regulator Spo0A. Using a biophysical model, we quantitatively characterize the effect of individual transcription factor binding sites on eachspo0Apromoter. Furthermore, the model allows us to identify the specific transcription step that is affected by transcription factor binding. Such a model is promising for the quantitative study of a wide range of master regulators involved in transcriptional feedback. 
    more » « less
  2. The transcriptional anti-silencing and DNA-binding protein, VirB, is essential for the virulence of Shigella species and, yet, sequences required for VirB-DNA binding are poorly understood. While a 7-8 bp VirB-binding site has been proposed, it was derived from studies at a single VirB-dependent promoter, icsB. Our previous in vivo studies at a different VirB-dependent promoter, icsP, found that the proposed VirB-binding site was insufficient for regulation. Instead, the required site was found to be organized as a near-perfect inverted repeat separated by a single nucleotide spacer. Thus, the proposed 7-8 bp VirB-binding site needed to be re-evaluated. Here, we engineer and validate a molecular tool to capture protein-DNA binding interactions in vivo. Our data show that a sequence organized as a near-perfect inverted repeat is required for VirB-DNA binding interactions in vivo at both the icsB and icsP promoters. Furthermore, the previously proposed VirB-binding site and multiple sites found as a result of its description (i.e., sites located at the virB, virF, spa15, and virA promoters) are not sufficient for VirB to bind in vivo using this tool. The implications of these findings are discussed. 
    more » « less
  3. Transcription factor (TF)–promoter pairs have been repurposed from native hosts to provide tools to measure intracellular biochemical production titer and dynamically control gene expression. Most often, native TF–promoter systems require rigorous screening to obtain desirable characteristics optimized for biotechnological applications. High-throughput techniques may provide a rational and less labor-intensive strategy to engineer user-defined TF–promoter pairs using fluorescence-activated cell sorting and deep sequencing methods (sort-seq). Based on the designed promoter library’s distribution characteristics, we elucidate sequence–function interactions between the TF and DNA. In this work, we use the sort-seq method to study the sequence–function relationship of a σ54-dependent, butanol-responsive TF–promoter pair, BmoR-PBMO derived from Thauera butanivorans, at the nucleotide level to improve biosensor characteristics, specifically an improved dynamic range. Activities of promoters from a mutagenized PBMO library were sorted based on gfp expression and subsequently deep sequenced to correlate site-specific sequences with changes in dynamic range. We identified site-specific mutations that increase the sensor output. Double mutant and a single mutant, CA(129,130)TC and G(205)A, in PBMO promoter increased dynamic ranges of 4-fold and 1.65-fold compared with the native system, respectively. In addition, sort-seq identified essential sites required for the proper function of the σ54-dependent promoter biosensor in the context of the host. This work can enable high-throughput screening methods for strain development. 
    more » « less
  4. Abstract ARGONAUTES are the central effector proteins ofRNAsilencing which bind target transcripts in a smallRNA‐guided manner.Arabidopsis thalianahas 10ARGONAUTE(AGO) genes, with specialized roles inRNA‐directedDNAmethylation, post‐transcriptional gene silencing, and antiviral defense. To better understand specialization amongAGOgenes at the level of transcriptional regulation we tested a library of 1497 transcription factors for binding to the promoters ofAGO1,AGO10, andAGO7using yeast 1‐hybrid assays. A ranked list of candidateDNA‐bindingTFs revealed binding of theAGO7promoter by a number of proteins in two families: the miR156‐regulatedSPLfamily and the miR319‐regulatedTCPfamily, both of which have roles in developmental timing and leaf morphology. Possible functions forSPLandTCPbinding are unclear: we showed that these binding sites are not required for the polar expression pattern ofAGO7, nor for the function ofAGO7in leaf shape. NormalAGO7transcription levels and function appear to depend instead on an adjacent 124‐bp region. Progress in understanding the structure of this promoter may aid efforts to understand how the conservedAGO7‐triggeredTAS3pathway functions in timing and polarity. 
    more » « less
  5. Klumpp, Stefan (Ed.)
    Dense arrangements of binding sites within nucleotide sequences can collectively influence downstream transcription rates or initiate biomolecular interactions. For example, natural promoter regions can harbor many overlapping transcription factor binding sites that influence the rate of transcription initiation. Despite the prevalence of overlapping binding sites in nature, rapid design of nucleotide sequences with many overlapping sites remains a challenge. Here, we show that this is an NP-hard problem, coined here as the nucleotide String Packing Problem (SPP). We then introduce a computational technique that efficiently assembles sets of DNA-protein binding sites into dense, contiguous stretches of double-stranded DNA. For the efficient design of nucleotide sequences spanning hundreds of base pairs, we reduce the SPP to an Orienteering Problem with integer distances, and then leverage modern integer linear programming solvers. Our method optimally packs sets of 20–100 binding sites into dense nucleotide arrays of 50–300 base pairs in 0.05–10 seconds. Unlike approximation algorithms or meta-heuristics, our approach finds provably optimal solutions. We demonstrate how our method can generate large sets of diverse sequences suitable for library generation, where the frequency of binding site usage across the returned sequences can be controlled by modulating the objective function. As an example, we then show how adding additional constraints, like the inclusion of sequence elements with fixed positions, allows for the design of bacterial promoters. The nucleotide string packing approach we present can accelerate the design of sequences with complex DNA-protein interactions. When used in combination with synthesis and high-throughput screening, this design strategy could help interrogate how complex binding site arrangements impact either gene expression or biomolecular mechanisms in varied cellular contexts. 
    more » « less