skip to main content


Title: Using Restriction Endonuclease, Protection, Selection, and Amplification to Identify Preferred DNA-Binding Sequences of Microbial Transcription Factors
ABSTRACT Regulation of gene expression is a vital component of cellular biology. Transcription factor proteins often bind regulatory DNA sequences upstream of transcription start sites to facilitate the activation or repression of RNA polymerase. Research laboratories have devoted many projects to understanding the transcription regulatory networks for transcription factors, as these regulated genes provide critical insight into the biology of the host organism. Various in vivo and in vitro assays have been developed to elucidate transcription regulatory networks. Several assays, including SELEX-seq and ChIP-seq, capture DNA-bound transcription factors to determine the preferred DNA-binding sequences, which can then be mapped to the host organism’s genome to identify candidate regulatory genes. In this protocol, we describe an alternative in vitro , iterative selection approach to ascertaining DNA-binding sequences of a transcription factor of interest using restriction endonuclease, protection, selection, and amplification (REPSA). Contrary to traditional antibody-based capture methods, REPSA selects for transcription factor-bound DNA sequences by challenging binding reactions with a type IIS restriction endonuclease. Cleavage-resistant DNA species are amplified by PCR and then used as inputs for the next round of REPSA. This process is repeated until a protected DNA species is observed by gel electrophoresis, which is an indication of a successful REPSA experiment. Subsequent high-throughput sequencing of REPSA-selected DNAs accompanied by motif discovery and scanning analyses can be used for determining transcription factor consensus binding sequences and potential regulated genes, providing critical first steps in determining organisms’ transcription regulatory networks. IMPORTANCE Transcription regulatory proteins are an essential class of proteins that help maintain cellular homeostasis by adapting the transcriptome based on environmental cues. Dysregulation of transcription factors can lead to diseases such as cancer, and many eukaryotic and prokaryotic transcription factors have become enticing therapeutic targets. Additionally, in many understudied organisms, the transcription regulatory networks for uncharacterized transcription factors remain unknown. As such, the need for experimental techniques to establish transcription regulatory networks is paramount. Here, we describe a step-by-step protocol for REPSA, an inexpensive, iterative selection technique to identify transcription factor-binding sequences without the need for antibody-based capture methods.  more » « less
Award ID(s):
2208795 2041202
NSF-PAR ID:
10413687
Author(s) / Creator(s):
;
Editor(s):
Polen, Tino
Date Published:
Journal Name:
Microbiology Spectrum
Volume:
11
Issue:
1
ISSN:
2165-0497
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Transcription factors (TFs) have been extensively researched in certain well-studied organisms, but far less so in others. Following the whole-genome sequencing of a new organism, TFs are typically identified through their homology with related proteins in other organisms. However, recent findings demonstrate that structurally similar TFs from distantly related bacteria are not usually evolutionary orthologs. Here we explore TTHB099, a cAMP receptor protein (CRP)-family TF from the extremophile Thermus thermophilus HB8. Using the in vitro iterative selection method Restriction Endonuclease Protection, Selection and Amplification (REPSA), we identified the preferred DNA-binding motif for TTHB099, 5′–TGT(A/g)NBSYRSVN(T/c)ACA–3′, and mapped potential binding sites and regulated genes within the T. thermophilus HB8 genome. Comparisons with expression profile data in TTHB099-deficient and wild type strains suggested that, unlike E. coli CRP (CRPEc), TTHB099 does not have a simple regulatory mechanism. However, we hypothesize that TTHB099 can be a dual-regulator similar to CRPEc. 
    more » « less
  2. Champion, Patricia A. (Ed.)
    ABSTRACT D-block metal cations are essential for most biological processes; however, excessive metal exposure can be deleterious to the survival of microorganisms. To tightly control heavy metal regulation, prokaryotic organisms have developed several mechanisms to sense and adapt to changes in intracellular and extracellular metal concentrations. The ferric uptake regulator superfamily of transcription factors associates with DNA when complexed with a regulatory metal cofactor and often represses the transcription of genes involved in metal transport, thus providing a genomic response to an environmental stressor. Although extensively studied in mesothermic organisms, there is little information describing ferric uptake regulator homologs in thermophiles. In this study, we biochemically characterize the ferric uptake regulator homolog TTHA1292 in the extreme thermophile Thermus thermophilus HB8. We identify the preferred DNA-binding sequence of TTHA1292 using the combinatorial approach, restriction endonuclease, protection, selection, and amplification (REPSA). We map this sequence to the Thermus thermophilus HB8 genome and identify the TTHA1292 transcription regulatory network, which includes the zinc ABC transporter subunit genes TTHA0596 and TTHA0453/4 . We formally implicate TTHA1292 as a zinc uptake regulator and show that zinc coordination is critical for the multimerization of TTHA1292 dimers on DNA in vitro and transcription repression in vivo . IMPORTANCE Discovering how organisms sense and adapt to their environments is paramount to understanding biology. Thermophilic organisms have adapted to survive at elevated temperatures (>50°C); however, our understanding of how these organisms adapt to changes in their environment is limited. In this study, we identify a zinc uptake regulator in the extreme thermophile Thermus thermophilus HB8 that provides a genomic response to fluctuations in zinc availability. These results provide insights into thermophile biology, as well as the zinc uptake regulator family of proteins. 
    more » « less
  3. Advances in genomic sequencing have allowed the identification of a multitude of genes encoding putative transcriptional regulatory proteins. Lacking, often, is a fuller understanding of the biological roles played by these proteins, the genes they regulate or regulon. Conventionally this is achieved through a genetic approach involving putative transcription factor gene manipulation and observations of changes in an organism’s transcriptome. However, such an approach is not always feasible or can yield misleading findings. Here, we describe a biochemistry-centric approach, involving identification of preferred DNA-binding sequences for the Thermus thermophilus HB8 transcriptional repressor TTHA0973 using the selection method Restriction Endonuclease Protection, Selection and Amplification (REPSA), massively parallel sequencing, and bioinformatic analyses. We identified a consensus TTHA0973 recognition sequence of 5′–AACnAACGTTnGTT–3′ that exhibited nanomolar binding affinity. This sequence was mapped to several sites within the T. thermophilus HB8 genome, a subset of which corresponded to promoter regions regulating genes involved in phenylacetic acid degradation. These studies further demonstrate the utility of a biochemistry-centric approach for the facile identification of potential biological functions for orphan transcription factors in a variety of organisms. 
    more » « less
  4. null (Ed.)
    Transcription regulatory proteins, also known as transcription factors, function as molecular switches modulating the first step in gene expression, transcription initiation. Cyclic-AMP receptor proteins (CRPs) and fumarate and nitrate reduction regulators (FNRs) compose the CRP/FNR superfamily of transcription factors, regulating gene expression in response to a spectrum of stimuli. In the present work, a reverse-genetic methodology was applied to the study of TTHA1359, one of four CRP/FNR superfamily transcription factors in the model organism Thermus thermophilus HB8. Restriction Endonuclease Protection, Selection, and Amplification (REPSA) followed by next-generation sequencing techniques and bioinformatic motif discovery allowed identification of a DNA-binding consensus for TTHA1359, 5′–AWTGTRA(N)6TYACAWT–3′, which TTHA1359 binds to with high affinity. By bioinformatically mapping the consensus to the T. thermophilus HB8 genome, several potential regulatory TTHA1359-binding sites were identified and validated in vitro. The findings contribute to the knowledge of TTHA1359 regulatory activity within T. thermophilus HB8 and demonstrate the effectiveness of a reverse-genetic methodology in the study of putative transcription factors. 
    more » « less
  5. Abstract

    Plants respond to wounding stress by changing gene expression patterns and inducing the production of hormones including jasmonic acid. This wounding transcriptional response activates specialized metabolism pathways such as the glucosinolate pathways in Arabidopsis thaliana. While the regulatory factors and sequences controlling a subset of wound-response genes are known, it remains unclear how wound response is regulated globally. Here, we how these responses are regulated by incorporating putative cis-regulatory elements, known transcription factor binding sites, in vitro DNA affinity purification sequencing, and DNase I hypersensitive sites to predict genes with different wound-response patterns using machine learning. We observed that regulatory sites and regions of open chromatin differed between genes upregulated at early and late wounding time-points as well as between genes induced by jasmonic acid and those not induced. Expanding on what we currently know, we identified cis-elements that improved model predictions of expression clusters over known binding sites. Using a combination of genome editing, in vitro DNA-binding assays, and transient expression assays using native and mutated cis-regulatory elements, we experimentally validated four of the predicted elements, three of which were not previously known to function in wound-response regulation. Our study provides a global model predictive of wound response and identifies new regulatory sequences important for wounding without requiring prior knowledge of the transcriptional regulators.

     
    more » « less