skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: The nucleosome unwrapping free energy landscape defines distinct regions of transcription factor accessibility and kinetics
Abstract Transcription factors (TF) require access to target sites within nucleosomes to initiate transcription. The target site position within the nucleosome significantly influences TF occupancy, but how is not quantitatively understood. Using ensemble and single-molecule fluorescence measurements, we investigated the targeting and occupancy of the transcription factor, Gal4, at different positions within the nucleosome. We observe a dramatic decrease in TF occupancy to sites extending past 30 base pairs (bp) into the nucleosome which cannot be explained by changes in the TF dissociation rate or binding site orientation. Instead, the nucleosome unwrapping free energy landscape is the primary determinant of Gal4 occupancy by reducing the Gal4 binding rate. The unwrapping free energy landscape defines two distinct regions of accessibility and kinetics with a boundary at 30 bp into the nucleosome where the inner region is over 100-fold less accessible. The Gal4 binding rate in the inner region no longer depends on its concentration because it is limited by the nucleosome unwrapping rate, while the frequency of nucleosome rewrapping decreases because Gal4 exchanges multiple times before the nucleosome rewraps. Our findings highlight the importance of the nucleosome unwrapping free energy landscape on TF occupancy and dynamics that ultimately influences transcription initiation.  more » « less
Award ID(s):
1715321
PAR ID:
10392619
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Nucleic Acids Research
Volume:
51
Issue:
3
ISSN:
0305-1048
Page Range / eLocation ID:
p. 1139-1153
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The chromatin architecture in promoters is thought to regulate gene expression, but it remains uncertain how most transcription factors (TFs) impact nucleosome position. The MuvB TF complex regulates cell-cycle dependent gene-expression and is critical for differentiation and proliferation during development and cancer. MuvB can both positively and negatively regulate expression, but the structure of MuvB and its biochemical function are poorly understood. Here we determine the overall architecture of MuvB assembly and the crystal structure of a subcomplex critical for MuvB function in gene repression. We find that the MuvB subunits LIN9 and LIN37 function as scaffolding proteins that arrange the other subunits LIN52, LIN54 and RBAP48 for TF, DNA, and histone binding, respectively. Biochemical and structural data demonstrate that MuvB binds nucleosomes through an interface that is distinct from LIN54-DNA consensus site recognition and that MuvB increases nucleosome occupancy in a reconstituted promoter. We find in arrested cells that MuvB primarily associates with a tightly positioned +1 nucleosome near the transcription start site (TSS) of MuvB-regulated genes. These results support a model that MuvB binds and stabilizes nucleosomes just downstream of the TSS on its target promoters to repress gene expression. 
    more » « less
  2. null (Ed.)
    Abstract Master transcription factors reprogram cell fate in multicellular eukaryotes. Pioneer transcription factors have prominent roles in this process because of their ability to contact their cognate binding motifs in closed chromatin. Reprogramming is pervasive in plants, whose development is plastic and tuned by the environment, yet little is known about pioneer transcription factors in this kingdom. Here, we show that the master transcription factor LEAFY (LFY), which promotes floral fate through upregulation of the floral commitment factor APETALA1 ( AP1 ), is a pioneer transcription factor. In vitro, LFY binds to the endogenous AP1 target locus DNA assembled into a nucleosome. In vivo, LFY associates with nucleosome occupied binding sites at the majority of its target loci, including AP1 . Upon binding, LFY ‘unlocks’ chromatin locally by displacing the H1 linker histone and by recruiting SWI/SNF chromatin remodelers, but broad changes in chromatin accessibility occur later. Our study provides a mechanistic framework for patterning of inflorescence architecture and uncovers striking similarities between LFY and animal pioneer transcription factor. 
    more » « less
  3. Abstract During gene regulation, DNA accessibility is thought to limit the availability of transcription factor (TF) binding sites, while TFs can increase DNA accessibility to recruit additional factors that upregulate gene expression. Given this interplay, the causative regulatory events in the modulation of gene expression remain unknown for the vast majority of genes. We utilized deeply sequenced ATAC-Seq data and site-specific knock-in reporter genes to investigate the relationship between the binding-site resolution dynamics of DNA accessibility and the expression dynamics of the enhancers of Cebpa during macrophage-neutrophil differentiation. While the enhancers upregulate reporter expression during the earliest stages of differentiation, there is little corresponding increase in their total accessibility. Conversely, total accessibility peaks during the last stages of differentiation without any increase in enhancer activity. The accessibility of positions neighboring C/EBP-family TF binding sites, which indicates TF occupancy, does increase significantly during early differentiation, showing that the early upregulation of enhancer activity is driven by TF binding. These results imply that a generalized increase in DNA accessibility is not sufficient, and binding by enhancer-specific TFs is necessary, for the upregulation of gene expression. Additionally, high-coverage ATAC-Seq combined with time-series expression data can infer the sequence of regulatory events at binding-site resolution. 
    more » « less
  4. The intrinsic DNA sequence preferences and cell type–specific cooperative partners of transcription factors (TFs) are typically highly conserved. Hence, despite the rapid evolutionary turnover of individual TF binding sites, predictive sequence models of cell type–specific genomic occupancy of a TF in one species should generalize to closely matched cell types in a related species. To assess the viability of cross-species TF binding prediction, we train neural networks to discriminate ChIP-seq peak locations from genomic background and evaluate their performance within and across species. Cross-species predictive performance is consistently worse than within-species performance, which we show is caused in part by species-specific repeats. To account for this domain shift, we use an augmented network architecture to automatically discourage learning of training species–specific sequence features. This domain adaptation approach corrects for prediction errors on species-specific repeats and improves overall cross-species model performance. Our results show that cross-species TF binding prediction is feasible when models account for domain shifts driven by species-specific repeats. 
    more » « less
  5. Abstract Transcription factors (TF) are proteins that bind DNA in a sequence-specific manner to regulate gene transcription. Despite their unique intrinsic sequence preferences,in vivogenomic occupancy profiles of TFs differ across cellular contexts. Hence, deciphering the sequence determinants of TF binding, both intrinsic and context-specific, is essential to understand gene regulation and the impact of regulatory, non-coding genetic variation. Biophysical models trained onin vitroTF binding assays can estimate intrinsic affinity landscapes and predict occupancy based on TF concentration and affinity. However, these models cannot adequately explain context-specific,in vivobinding profiles. Conversely, deep learning models, trained onin vivoTF binding assays, effectively predict and explain genomic occupancy profiles as a function of complex regulatory sequence syntax, albeit without a clear biophysical interpretation. To reconcile these complementary models ofin vitroandin vivoTF binding, we developed Affinity Distillation (AD), a method that extracts thermodynamic affinitiesde-novofrom deep learning models of TF chromatin immunoprecipitation (ChIP) experiments by marginalizing away the influence of genomic sequence context. Applied to neural networks modeling diverse classes of yeast and mammalian TFs, AD predicts energetic impacts of sequence variation within and surrounding motifs on TF binding as measured by diversein vitroassays with superior dynamic range and accuracy compared to motif-based methods. Furthermore, AD can accurately discern affinities of TF paralogs. Our results highlight thermodynamic affinity as a key determinant ofin vivobinding, suggest that deep learning models ofin vivobinding implicitly learn high-resolution affinity landscapes, and show that these affinities can be successfully distilled using AD. This new biophysical interpretation of deep learning models enables high-throughputin silicoexperiments to explore the influence of sequence context and variation on both intrinsic affinity andin vivooccupancy. 
    more » « less