skip to main content


Title: Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to Mediator
Cells adapt and respond to changes by regulating the activity of their genes. To turn genes on or off, they use a family of proteins called transcription factors. Transcription factors influence specific but overlapping groups of genes, so that each gene is controlled by several transcription factors that act together like a dimmer switch to regulate gene activity. The presence of transcription factors attracts proteins such as the Mediator complex, which activates genes by gathering the protein machines that read the genes. The more transcription factors are found near a specific gene, the more strongly they attract Mediator and the more active the gene is. A specific region on the transcription factor called the activation domain is necessary for this process. The biochemical sequences of these domains vary greatly between species, yet activation domains from, for example, yeast and human proteins are often interchangeable. To understand why this is the case, Sanborn et al. analyzed the genome of baker’s yeast and identified 150 activation domains, each very different in sequence. Three-quarters of them bound to a subunit of the Mediator complex called Med15. Sanborn et al. then developed a machine learning algorithm to predict activation domains in both yeast and humans. This algorithm also showed that negatively charged and greasy regions on the activation domains were essential to be activated by the Mediator complex. Further analyses revealed that activation domains used different poses to bind multiple sites on Med15, a behavior known as ‘fuzzy’ binding. This creates a high overall affinity even though the binding strength at each individual site is low, enabling the protein complexes to remain dynamic. These weak interactions together permit fine control over the activity of several genes, allowing cells to respond quickly and precisely to many changes. The computer algorithm used here provides a new way to identify activation domains across species and could improve our understanding of how living things grow, adapt and evolve. It could also give new insights into mechanisms of disease, particularly cancer, where transcription factors are often faulty.  more » « less
Award ID(s):
2019745
NSF-PAR ID:
10248522
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
eLife
Volume:
10
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Cells in the brain, liver and skin, as well as many other organs, all contain the same DNA, yet behave in very different ways. This is because before a gene can produce its corresponding protein, it must first be transcribed into messenger RNA. As an organism grows, the transcription of certain genes is switched on or off by regulatory molecules called transcription factors, which guide cells towards a specific ‘fate’. These molecules bind to specific locations within the regulatory regions of DNA, and for decades biologist have tried to use the arrangement of these sites to predict which proteins a cell will make. Theoretical models known as thermodynamic models have been able to successfully predict transcription in bacteria. However, this has proved more challenging to do in eukaryotes, such as yeast, fruit flies and humans. One of the key differences is that DNA in eukaryotes is typically tightly wound into bundles called nucleosomes, which must be disentangled in order for transcription factors to access the DNA. Previous thermodynamic models have suggested that DNA in eukaryotes randomly switches between being in a wound and unwound state. The models assume that once unwound, regulatory proteins stabilize the DNA in this form, making it easier for other transcription factors to bind to the DNA. Now, Eck, Liu et al. have tested some of these models by studying the transcription of a gene involved in the development of fruit flies. The experiments showed that no thermodynamic model could accurately mimic how this gene is regulated in the embryos of fruit flies. This led Eck, Liu et al. to identify a model that is better at predicting the activation pattern of this developmental gene. In this model, instead of just ‘locking’ DNA into an unwound shape, transcription factors can also actively speed up the unwinding of DNA. This improved understanding builds towards the goal of predicting gene regulation, where DNA sequences can be used to tell where and when cell decisions will be made. In the future, this could allow the development of new types of therapies that can regulate transcription in different diseases. 
    more » « less
  2. The human genome contains all the instructions needed to build the human body. However, each human cell does not read all of these instructions, which come in the form of genes encoded in the DNA. Instead, different subsets of genes are switched on in each type of cell, while the rest of the genes are switched off. DNA within human cells is wrapped around proteins called histones, to form hundreds of thousands of structures called nucleosomes. If the DNA that encodes a gene contains a lot of nucleosomes, the DNA is not very accessible and the gene will generally be off; removing the histones or rearranging the nucleosomes can turn the gene on. Each histone contains a region called a tail – because it protrudes like the tail of a cat – that can be chemically modified in dozens of different ways. Particular combinations of histone modifications are thought to signal how the nucleosomes should be arranged so that each gene is properly regulated. However, it is unclear how these combinations of modifications actually work because, historically, it has been difficult to study tails in the context of a nucleosome. Instead most studies had looked at tails that had been removed from the nucleosome. Now, Morrison et al. set out to investigate how one protein, called BPTF, recognizes a specific chemical modification on the tail of a histone, referred to as H3K4me3, in the context of a human nucleosome. Unexpectedly, the experiments showed that the histone-binding domain of BPTF, which binds to H3K4me3, was impeded when the tail was attached to the nucleosome but not when it was removed from the nucleosome. Morrison et al. went on to show that this was because the histone tail is tucked onto the rest of the nucleosome and not easily accessible. Further experiments revealed that additional chemical modifications made the tail more accessible, making it easier for the histone-binding domain to bind. Together these findings show that a combination of histone modifications acts to positively regulate the binding of a regulatory protein to H3K4me3 in the context of the nucleosome by actually regulating the nucleosome itself. The disruption of the histone signals is known to lead to a number of diseases, including cancer, autoimmune disease, and neurological disorders, and these findings could guide further research that may lead to new treatments. Yet first, much more work is needed to investigate how other histone modifications are recognized in the context of the nucleosome, and how the large number of possible combinations of histone signals affects this process. 
    more » « less
  3. Many proteins exhibit a property called ‘allostery’. In allostery, an input signal at a specific site of a protein – such as a molecule binding, or the protein absorbing a photon of light – leads to a change in output at another site far away. For example, the protein might catalyze a chemical reaction faster or bind to another molecule more tightly in the presence of the input signal. This protein ‘remote control’ allows cells to sense and respond to changes in their environment. An ability to rapidly engineer new allosteric mechanisms into proteins is much sought after because this would provide an approach for building biosensors and other useful tools. One common approach to engineering new allosteric regulation is to combine a ‘sensor’ or input region from one protein with an ‘output’ region or domain from another. When researchers engineer allostery using this approach of combining input and output domains from different proteins, the difference in the output when the input is ‘on’ versus ‘off’ is often small, a situation called ‘modest allostery’. McCormick et al. wanted to know how to optimize this domain combination approach to increase the difference in output between the ‘on’ and ‘off’ states. More specifically, McCormick et al. wanted to find out whether swapping out or mutating specific amino acids (each of the individual building blocks that make up a protein) enhances or disrupts allostery. They also wanted to know if there are many possible mutations that change the effectiveness of allostery, or if this property is controlled by just a few amino acids. Finally, McCormick et al. questioned where in a protein most of these allostery-tuning mutations were located. To answer these questions, McCormick et al. engineered a new allosteric protein by inserting a light-sensing domain (input) into a protein involved in metabolism (a metabolic enzyme that produces a biomolecule called a tetrahydrofolate) to yield a light-controlled enzyme. Next, they introduced mutations into both the ‘input’ and ‘output’ domains to see where they had a greater effect on allostery. After filtering out mutations that destroyed the function of the output domain, McCormick et al. found that only about 5% of mutations to the ‘output’ domain altered the allosteric response of their engineered enzyme. In fact, most mutations that disrupted allostery were found near the site where the ‘input’ domain was inserted, while mutations that enhanced allostery were sprinkled throughout the enzyme, often on its protein surface. This was surprising in light of the commonly-held assumption that mutations on protein surfaces have little impact on the activity of the ‘output’ domain. Overall, the effect of individual mutations on allostery was small, but McCormick et al. found that these mutations can sometimes be combined to yield larger effects. McCormick et al.’s results suggest a new approach for optimizing engineered allosteric proteins: by introducing mutations on the protein surface. It also opens up new questions: mechanically, how do surface sites affect allostery? In the future, it will be important to characterize how combinations of mutations can optimize allosteric regulation, and to determine what evolutionary trajectories to high performance allosteric ‘switches’ look like. 
    more » « less
  4. Komeili, Arash (Ed.)
    ABSTRACT Histone proteins are found across diverse lineages of Archaea , many of which package DNA and form chromatin. However, previous research has led to the hypothesis that the histone-like proteins of high-salt-adapted archaea, or halophiles, function differently. The sole histone protein encoded by the model halophilic species Halobacterium salinarum , HpyA, is nonessential and expressed at levels too low to enable genome-wide DNA packaging. Instead, HpyA mediates the transcriptional response to salt stress. Here we compare the features of genome-wide binding of HpyA to those of HstA, the sole histone of another model halophile, Haloferax volcanii . hstA , like hpyA , is a nonessential gene. To better understand HpyA and HstA functions, protein-DNA binding data (chromatin immunoprecipitation sequencing [ChIP-seq]) of these halophilic histones are compared to publicly available ChIP-seq data from DNA binding proteins across all domains of life, including transcription factors (TFs), nucleoid-associated proteins (NAPs), and histones. These analyses demonstrate that HpyA and HstA bind the genome infrequently in discrete regions, which is similar to TFs but unlike NAPs, which bind a much larger genomic fraction. However, unlike TFs that typically bind in intergenic regions, HpyA and HstA binding sites are located in both coding and intergenic regions. The genome-wide dinucleotide periodicity known to facilitate histone binding was undetectable in the genomes of both species. Instead, TF-like and histone-like binding sequence preferences were detected for HstA and HpyA, respectively. Taken together, these data suggest that halophilic archaeal histones are unlikely to facilitate genome-wide chromatin formation and that their function defies categorization as a TF, NAP, or histone. IMPORTANCE Most cells in eukaryotic species—from yeast to humans—possess histone proteins that pack and unpack DNA in response to environmental cues. These essential proteins regulate genes necessary for important cellular processes, including development and stress protection. Although the histone fold domain originated in the domain of life Archaea , the function of archaeal histone-like proteins is not well understood relative to those of eukaryotes. We recently discovered that, unlike histones of eukaryotes, histones in hypersaline-adapted archaeal species do not package DNA and can act as transcription factors (TFs) to regulate stress response gene expression. However, the function of histones across species of hypersaline-adapted archaea still remains unclear. Here, we compare hypersaline histone function to a variety of DNA binding proteins across the tree of life, revealing histone-like behavior in some respects and specific transcriptional regulatory function in others. 
    more » « less
  5. null (Ed.)
    In all higher organisms, life begins with a single cell. During the early stages of development, this single cell grows and divides multiple times to develop into the many different kinds of cells that make up an organism. This is a highly regulated process during which cells receive instructions telling them what kind of cell to become. These instructions are relayed via genes, and a particular combination of activated genes determines the cell’s fate. Specific pieces of DNA, known as enhancers, act as switches that control when and where genes are active, while so-called shadow enhancers are found in groups and work together to turn on the same gene in a similar way. Shadow enhancers are often active during the early stages of life to direct the formation of specialized cells in different parts of the body. But so far, it has been unclear why it is beneficial to the divide the role of activating genes across several shadow enhancers rather than a single one. Here, Waymack et al. examined shadow enhancers around a gene called Kruppel in embryos of the fruit fly Drosophila melanogaster . Manipulating the shadow enhancers showed that they help to make gene activity more resistant to changes. Factors such as fluctuations in temperature have different effects on each shadow enhancer. Having several shadow enhancers working together ensures that, whatever happens, the right genes still get activated. For genes like Kruppel , which are key for healthy development, the ability to withstand unexpected changes is a valuable evolutionary benefit. The study of Waymack et al. reveals why shadow enhancers are involved in the regulation of many genes, which may help to better understand developmental defects. Many conditions caused by such defects are influenced by both genetics and the environment. Genetic illnesses can vary in severity, which may be related to the roles of shadow enhancers. As such, studying shadow enhancers could lead to new approaches for treating genetic diseases. 
    more » « less