Abstract Sequence-specific activation by transcription factors is essential for gene regulation1,2. Key to this are activation domains, which often fall within disordered regions of transcription factors3,4and recruit co-activators to initiate transcription5. These interactions are difficult to characterize via most experimental techniques because they are typically weak and transient6,7. Consequently, we know very little about whether these interactions are promiscuous or specific, the mechanisms of binding, and how these interactions tune the strength of gene activation. To address these questions, we developed a microfluidic platform for expression and purification of hundreds of activation domains in parallel followed by direct measurement of co-activator binding affinities (STAMMPPING, for Simultaneous Trapping of Affinity Measurements via a Microfluidic Protein-Protein INteraction Generator). By applying STAMMPPING to quantify direct interactions between eight co-activators and 204 human activation domains (>1,500Kds), we provide the first quantitative map of these interactions and reveal 334 novel binding pairs. We find that the metazoan-specific co-activator P300 directly binds >100 activation domains, potentially explaining its widespread recruitment across the genome to influence transcriptional activation. Despite sharing similar molecular properties (e.g.enrichment of negative and hydrophobic residues), activation domains utilize distinct biophysical properties to recruit certain co-activator domains. Co-activator domain affinity and occupancy are well-predicted by analytical models that account for multivalency, andin vitroaffinities quantitatively predict activation in cells with an ultrasensitive response. Not only do our results demonstrate the ability to measure affinities between even weak protein-protein interactions in high throughput, but they also provide a necessary resource of over 1,500 activation domain/co-activator affinities which lays the foundation for understanding the molecular basis of transcriptional activation.
more »
« less
Transcription-dependent domain-scale three-dimensional genome organization in the dinoflagellate Breviolum minutum
Abstract Dinoflagellate chromosomes represent a unique evolutionary experiment, as they exist in a permanently condensed, liquid crystalline state; are not packaged by histones; and contain genes organized into tandem gene arrays, with minimal transcriptional regulation. We analyze the three-dimensional genome ofBreviolum minutum, and find large topological domains (dinoflagellate topologically associating domains, which we term ‘dinoTADs’) without chromatin loops, which are demarcated by convergent gene array boundaries. Transcriptional inhibition disrupts dinoTADs, implicating transcription-induced supercoiling as the primary topological force in dinoflagellates.
more »
« less
- Award ID(s):
- 1645164
- PAR ID:
- 10224565
- Publisher / Repository:
- Nature Publishing Group
- Date Published:
- Journal Name:
- Nature Genetics
- Volume:
- 53
- Issue:
- 5
- ISSN:
- 1061-4036
- Page Range / eLocation ID:
- p. 613-617
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Background: Dinoflagellates are taxonomically diverse and ecologically important phytoplankton that are ubiquitously present in marine and freshwater environments. Mostly photosynthetic, dinoflagellates provide the basis of aquatic primary production; most taxa are free-living, while some can form symbiotic and parasitic associations with other organisms. However, knowledge of the molecular mechanisms that underpin the adaptation of these organisms to diverse ecological niches is limited by the scarce availability of genomic data, partly due to their large genome sizes estimated up to 250 Gbp. Currently available dinoflagellate genome data are restricted to Symbiodiniaceae (particularly symbionts of reef-building corals) and parasitic lineages, from taxa that have smaller genome size ranges, while genomic information from more diverse free living species is still lacking. Results: Here, we present two draft diploid genome assemblies of the free-living dinoflagellate Polarella glacialis, isolated from the Arctic and Antarctica. We found that about 68% of the genomes are composed of repetitive sequence, with long terminal repeats likely contributing to intra-species structural divergence and distinct genome sizes (3.0 and 2.7 Gbp). For each genome, guided using full-length transcriptome data, we predicted > 50,000 high-quality protein-coding genes, of which ~40% are in unidirectional gene clusters and ~25% comprise single exons. Multi-genome comparison unveiled genes specific to P. glacialis and a common, putatively bacterial origin of ice-binding domains in cold-adapted dinoflagellates. Conclusions: Our results elucidate how selection acts within the context of a complex genome structure to facilitate local adaptation. Because most dinoflagellate genes are constitutively expressed, Polarella glacialis has enhanced transcriptional responses via unidirectional, tandem duplication of single-exon genes that encode functions critical to survival in cold, low-light polar environments. These genomes provide a foundational reference for future research on dinoflagellate evolution.more » « less
-
Topologically associating domains and the evolution of three‐dimensional genome architecture in riceSUMMARY We examined the nature and evolution of three‐dimensional (3D) genome conformation, including topologically associating domains (TADs), in five genomes within the genusOryza. These included three varieties from subspecies within domesticated Asian riceO. sativaas well as their closely related wild relativesO. rufipogonandO. meridionalis. We used the high‐resolution chromosome conformation capture technique Micro‐C, which we modified for use in rice. Our analysis of rice TADs shows that TAD boundaries have high transcriptional activity, low methylation levels, low transposable element (TE) content, and increased gene density. We also find a significant correlation of expression levels for genes within TADs, suggesting that they do function as genomic domains with shared regulatory features. Our findings indicate that animal and plant TADs may share more commonalities than were initially thought, as evidenced by similar genetic and epigenetic signatures associated with TADs and boundaries. To examine 3D genome divergence, we employed a computer vision‐based algorithm for the comparison of chromatin contact maps and complemented this analysis by assessing the evolutionary conservation of individual TADs and their boundaries. We conclude that overall chromatin organization is conserved in rice, and 3D structural divergence correlates with evolutionary distance between genomes. We also note that individual TADs are not well conserved, even at short evolutionary timescales.more » « less
-
Abstract Antibiotic treatment kills a large portion of a population, while a small, tolerant subpopulation survives. Tolerant bacteria disrupt antibiotic efficacy and increase the likelihood that a population gains antibiotic resistance, a growing health concern. We examined howE. colitranscriptional networks changed in response to lethal ampicillin concentrations. We are the first to apply transcriptional regulatory network (TRN) analysis to antibiotic tolerance by leveraging existing knowledge and our transcriptional data. TRN analysis shows that gene expression changes specific to ampicillin treatment are likely caused by specific sigma and transcription factors typically regulated by proteolysis. These results demonstrate that to survive lethal concentration of ampicillin specific regulatory proteins change activity and cause a coordinated transcriptional response that leverages multiple gene systems.more » « less
-
Abstract Current complex prediction models are the result of fitting deep neural networks, graph convolutional networks or transducers to a set of training data. A key challenge with these models is that they are highly parameterized, which makes describing and interpreting the prediction strategies difficult. We use topological data analysis to transform these complex prediction models into a simplified topological view of the prediction landscape. The result is a map of the predictions that enables inspection of the model results with more specificity than dimensionality-reduction methods such as tSNE and UMAP. The methods scale up to large datasets across different domains. We present a case study of a transformer-based model previously designed to predict expression levels of a piece of DNA in thousands of genomic tracks. When the model is used to study mutations in theBRCA1gene, our topological analysis shows that it is sensitive to the location of a mutation and the exon structure ofBRCA1in ways that cannot be found with tools based on dimensionality reduction. Moreover, the topological framework offers multiple ways to inspect results, including an error estimate that is more accurate than model uncertainty. Further studies show how these ideas produce useful results in graph-based learning and image classification.more » « less
An official website of the United States government
