skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Commonly asked questions about transcriptional activation domains
Eukaryotic transcription factors activate gene expression with their DNA-binding domains and activation domains. DNA- binding domains bind the genome by recognizing structurally related DNA sequences; they are structured, conserved, and predictable from protein sequences. Activation domains recruit chromatin modifiers, coactivator complexes, or basal tran- scriptional machinery via structurally diverse protein-protein interactions. Activation domains and DNA-binding domains have been called independent, modular units, but there are many departures from modularity, including interactions be- tween these regions and overlap in function. Compared to DNA-binding domains, activation domains are poorly under- stood because they are poorly conserved, intrinsically disor- dered, and difficult to predict from protein sequences. This review, organized around commonly asked questions, de- scribes recent progress that the field has made in under- standing the sequence features that control activation domains and predicting them from sequence.  more » « less
Award ID(s):
2112057
PAR ID:
10510794
Author(s) / Creator(s):
; ;
Publisher / Repository:
Current Opinion in Structural Biology
Date Published:
Journal Name:
Current Opinion in Structural Biology
Volume:
84
Issue:
C
ISSN:
0959-440X
Page Range / eLocation ID:
102732
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Kaplan, C (Ed.)
    Abstract Transcription factors activate gene expression in development, homeostasis, and stress with DNA binding domains and activation domains. Although there exist excellent computational models for predicting DNA binding domains from protein sequence, models for predicting activation domains from protein sequence have lagged, particularly in metazoans. We recently developed a simple and accurate predictor of acidic activation domains on human transcription factors. Here, we show how the accuracy of this human predictor arises from the clustering of aromatic, leucine, and acidic residues, which together are necessary for acidic activation domain function. When we combine our predictor with the predictions of convolutional neural network (CNN) models trained in yeast, the intersection is more accurate than individual models, emphasizing that each approach carries orthogonal information. We synthesize these findings into a new set of activation domain predictions on human transcription factors. 
    more » « less
  2. null (Ed.)
    Electrostatic protein/DNA interactions arise from the neutralization of the DNA phosphodiester backbone as well as coupled exchanges by charged protein residues as salt bridges or with mobile ions. Much focus has been and continues to be paid to interfacial ion pairs with DNA. The role of extra-interfacial ionic interactions, particularly as dynamic drivers of DNA sequence selectivity, remain poorly known. The ETS family of transcription factors represents an attractive model for addressing this knowledge gap given their diverse ionic composition in primary structures that fold to a tightly conserved DNA-binding motif. To probe the importance of extra-interfacial salt bridges in DNA recognition, we compared the salt-dependent binding by Elk1 with ETV6, two ETS homologs differing markedly in ionic composition. While both proteins exhibit salt-dependent binding with cognate DNA that corresponds to interfacial phosphate contacts, their nonspecific binding diverges from cognate binding as well as each other. Molecular dynamics simulations in explicit solvent, which generated ionic interactions in agreement with the experimental binding data, revealed distinct salt-bridge dynamics in the nonspecific complexes formed by the two proteins. Impaired DNA contact by ETV6 resulted in fewer backbone contacts in the nonspecific complex, while Elk1 exhibited a redistribution of extra-interfacial salt bridges via residues that are non-conserved between the two ETS relatives. Thus, primary structure variation in ionic residues can encode highly differentiated specificity mechanisms in a highly conserved DNA-binding motif. 
    more » « less
  3. Abstract Sequence-specific activation by transcription factors is essential for gene regulation1,2. Key to this are activation domains, which often fall within disordered regions of transcription factors3,4and recruit co-activators to initiate transcription5. These interactions are difficult to characterize via most experimental techniques because they are typically weak and transient6,7. Consequently, we know very little about whether these interactions are promiscuous or specific, the mechanisms of binding, and how these interactions tune the strength of gene activation. To address these questions, we developed a microfluidic platform for expression and purification of hundreds of activation domains in parallel followed by direct measurement of co-activator binding affinities (STAMMPPING, for Simultaneous Trapping of Affinity Measurements via a Microfluidic Protein-Protein INteraction Generator). By applying STAMMPPING to quantify direct interactions between eight co-activators and 204 human activation domains (>1,500Kds), we provide the first quantitative map of these interactions and reveal 334 novel binding pairs. We find that the metazoan-specific co-activator P300 directly binds >100 activation domains, potentially explaining its widespread recruitment across the genome to influence transcriptional activation. Despite sharing similar molecular properties (e.g.enrichment of negative and hydrophobic residues), activation domains utilize distinct biophysical properties to recruit certain co-activator domains. Co-activator domain affinity and occupancy are well-predicted by analytical models that account for multivalency, andin vitroaffinities quantitatively predict activation in cells with an ultrasensitive response. Not only do our results demonstrate the ability to measure affinities between even weak protein-protein interactions in high throughput, but they also provide a necessary resource of over 1,500 activation domain/co-activator affinities which lays the foundation for understanding the molecular basis of transcriptional activation. 
    more » « less
  4. The transcriptional anti-silencing and DNA-binding protein, VirB, is essential for the virulence of Shigella species and, yet, sequences required for VirB-DNA binding are poorly understood. While a 7-8 bp VirB-binding site has been proposed, it was derived from studies at a single VirB-dependent promoter, icsB. Our previous in vivo studies at a different VirB-dependent promoter, icsP, found that the proposed VirB-binding site was insufficient for regulation. Instead, the required site was found to be organized as a near-perfect inverted repeat separated by a single nucleotide spacer. Thus, the proposed 7-8 bp VirB-binding site needed to be re-evaluated. Here, we engineer and validate a molecular tool to capture protein-DNA binding interactions in vivo. Our data show that a sequence organized as a near-perfect inverted repeat is required for VirB-DNA binding interactions in vivo at both the icsB and icsP promoters. Furthermore, the previously proposed VirB-binding site and multiple sites found as a result of its description (i.e., sites located at the virB, virF, spa15, and virA promoters) are not sufficient for VirB to bind in vivo using this tool. The implications of these findings are discussed. 
    more » « less
  5. Abstract One of key features of intrinsically disordered regions (IDRs) is facilitation of protein–protein and protein–nucleic acids interactions. These disordered binding regions include molecular recognition features (MoRFs), short linear motifs (SLiMs) and longer binding domains. Vast majority of current predictors of disordered binding regions target MoRFs, with a handful of methods that predict SLiMs and disordered protein-binding domains. A new and broader class of disordered binding regions, linear interacting peptides (LIPs), was introduced recently and applied in the MobiDB resource. LIPs are segments in protein sequences that undergo disorder-to-order transition upon binding to a protein or a nucleic acid, and they cover MoRFs, SLiMs and disordered protein-binding domains. Although current predictors of MoRFs and disordered protein-binding regions could be used to identify some LIPs, there are no dedicated sequence-based predictors of LIPs. To this end, we introduce CLIP, a new predictor of LIPs that utilizes robust logistic regression model to combine three complementary types of inputs: co-evolutionary information derived from multiple sequence alignments, physicochemical profiles and disorder predictions. Ablation analysis suggests that the co-evolutionary information is particularly useful for this prediction and that combining the three inputs provides substantial improvements when compared to using these inputs individually. Comparative empirical assessments using low-similarity test datasets reveal that CLIP secures area under receiver operating characteristic curve (AUC) of 0.8 and substantially improves over the results produced by the closest current tools that predict MoRFs and disordered protein-binding regions. The webserver of CLIP is freely available at http://biomine.cs.vcu.edu/servers/CLIP/ and the standalone code can be downloaded from http://yanglab.qd.sdu.edu.cn/download/CLIP/. 
    more » « less