skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: GenomicLinks: deep learning predictions of 3D chromatin interactions in the maize genome
Abstract Gene regulation in eukaryotes is partly shaped by the 3D organization of chromatin within the cell nucleus. Distal interactions between cis-regulatory elements and their target genes are widespread, and many causal loci underlying heritable agricultural traits have been mapped to distal non-coding elements. The biology underlying chromatin loop formation in plants is poorly understood. Dissecting the sequence features that mediate distal interactions is an important step toward identifying putative molecular mechanisms. Here, we trained GenomicLinks, a deep learning model, to identify DNA sequence features predictive of 3D chromatin interactions in maize. We found that the presence of binding motifs of specific transcription factor classes, especially bHLH, is predictive of chromatin interaction specificities. Using an in silico mutagenesis approach we show the removal of these motifs from loop anchors leads to reduced interaction probabilities. We were able to validate these predictions with single-cell co-accessibility data from different maize genotypes that harbor natural substitutions in these TF binding motifs. GenomicLinks is currently implemented as an open-source web tool, which should facilitate its wider use in the plant research community.  more » « less
Award ID(s):
1856627
PAR ID:
10580220
Author(s) / Creator(s):
; ; ; ; ; ;
Publisher / Repository:
OUP
Date Published:
Journal Name:
NAR Genomics and Bioinformatics
Volume:
6
Issue:
3
ISSN:
2631-9268
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Chromatin looping is important for gene regulation, and studies of 3D chromatin structure across species and cell types have improved our understanding of the principles governing chromatin looping. However, 3D genome evolution and its relationship with natural selection remains largely unexplored. In mammals, the CTCF protein defines the boundaries of most chromatin loops, and variations in CTCF occupancy are associated with looping divergence. While many CTCF binding sites fall within transposable elements (TEs), their contribution to 3D chromatin structural evolution is unknown. Here we report the relative contributions of TE-driven CTCF binding site expansions to conserved and divergent chromatin looping in human and mouse. We demonstrate that TE-derived CTCF binding divergence may explain a large fraction of variable loops. These variable loops contribute significantly to corresponding gene expression variability across cells and species, possibly by refining sub-TAD-scale loop contacts responsible for cell-type-specific enhancer-promoter interactions. 
    more » « less
  2. Abstract Nuclear compartments are prominent features of 3D chromatin organization, but sequencing depth limitations have impeded investigation at ultra fine-scale. CTCF loops are generally studied at a finer scale, but the impact of looping on proximal interactions remains enigmatic. Here, we critically examine nuclear compartments and CTCF loop-proximal interactions using a combination of in situ Hi-C at unparalleled depth, algorithm development, and biophysical modeling. Producing a large Hi-C map with 33 billion contacts in conjunction with an algorithm for performing principal component analysis on sparse, super massive matrices (POSSUMM), we resolve compartments to 500 bp. Our results demonstrate that essentially all active promoters and distal enhancers localize in the A compartment, even when flanking sequences do not. Furthermore, we find that the TSS and TTS of paused genes are often segregated into separate compartments. We then identify diffuse interactions that radiate from CTCF loop anchors, which correlate with strong enhancer-promoter interactions and proximal transcription. We also find that these diffuse interactions depend on CTCF’s RNA binding domains. In this work, we demonstrate features of fine-scale chromatin organization consistent with a revised model in which compartments are more precise than commonly thought while CTCF loops are more protracted. 
    more » « less
  3. Abstract Three-dimensional (3D) structures of the genome are dynamic, heterogeneous and functionally important. Live cell imaging has become the leading method for chromatin dynamics tracking. However, existing CRISPR- and TALE-based genomic labeling techniques have been hampered by laborious protocols and are ineffective in labeling non-repetitive sequences. Here, we report a versatile CRISPR/Casilio-based imaging method that allows for a nonrepetitive genomic locus to be labeled using one guide RNA. We construct Casilio dual-color probes to visualize the dynamic interactions of DNA elements in single live cells in the presence or absence of the cohesin subunit RAD21. Using a three-color palette, we track the dynamic 3D locations of multiple reference points along a chromatin loop. Casilio imaging reveals intercellular heterogeneity and interallelic asynchrony in chromatin interaction dynamics, underscoring the importance of studying genome structures in 4D. 
    more » « less
  4. Gene expression and complex phenotypes are determined by the activity of cis-regulatory elements. However, an understanding of how extant genetic variants affect cis regulation remains limited. Here, we investigated the consequences of cis-regulatory diversity using single-cell genomics of more than 0.7 million nuclei across 172Zea mays(maize) inbreds. Our analyses pinpointed cis-regulatory elements distinct to domesticated maize and revealed how historical transposon activity has shaped the cis-regulatory landscape. Leveraging population genetics principles, we fine-mapped about 22,000 chromatin accessibility–associated genetic variants with widespread cell type–specific effects. Variants in TEOSINTE BRANCHED1/CYCLOIDEA/PROLIFERATING CELL FACTOR–binding sites were the most prevalent determinants of chromatin accessibility. Finally, integrating chromatin accessibility–associated variants, organismal trait variation, and population differentiation revealed how local adaptation has rewired regulatory networks in unique cellular contexts to alter maize flowering. 
    more » « less
  5. van Steensel, Bas (Ed.)
    Heterochromatin spreading, the expansion of repressive chromatin structure from sequence-specific nucleation sites, is critical for stable gene silencing. Spreading re-establishes gene-poor constitutive heterochromatin across cell cycles but can also invade gene-rich euchromatin de novo to steer cell fate decisions. How chromatin context (i.e. euchromatic, heterochromatic) or different nucleation pathways influence heterochromatin spreading remains poorly understood. Previously, we developed a single-cell sensor in fission yeast that can separately record heterochromatic gene silencing at nucleation sequences and distal sites. Here we couple our quantitative assay to a genetic screen to identify genes encoding nuclear factors linked to the regulation of heterochromatin nucleation and the distal spreading of gene silencing. We find that mechanisms underlying gene silencing distal to a nucleation site differ by chromatin context. For example, Clr6 histone deacetylase complexes containing the Fkh2 transcription factor are specifically required for heterochromatin spreading at constitutive sites. Fkh2 recruits Clr6 to nucleation-distal chromatin sites in such contexts. In addition, we find that a number of chromatin remodeling complexes antagonize nucleation-distal gene silencing. Our results separate the regulation of heterochromatic gene silencing at nucleation versus distal sites and show that it is controlled by context-dependent mechanisms. The results of our genetic analysis constitute a broad community resource that will support further analysis of the mechanisms underlying the spread of epigenetic silencing along chromatin. 
    more » « less