skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Genome wide clustering on integrated chromatin states and Micro-C contacts reveals chromatin interaction signatures
Abstract We can now analyze 3D physical interactions of chromatin regions with chromatin conformation capture technologies, in addition to the 1D chromatin state annotations, but methods to integrate this information are lacking. We propose a method to integrate the chromatin state of interacting regions into a vector representation through the contact-weighted sum of chromatin states. Unsupervised clustering on integrated chromatin states and Micro-C contacts reveals common patterns of chromatin interaction signatures. This provides an integrated view of the complex dynamics of concurrent change occurring in chromatin state and in chromatin interaction, adding another layer of annotation beyond chromatin state or Hi-C contact separately.  more » « less
Award ID(s):
1750532
PAR ID:
10546703
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
NAR Genomics and Bioinformatics
Volume:
6
Issue:
4
ISSN:
2631-9268
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract MotivationHigh-throughput conformation capture experiments, such as Hi-C provide genome-wide maps of chromatin interactions, enabling life scientists to investigate the role of the three-dimensional structure of genomes in gene regulation and other essential cellular functions. A fundamental problem in the analysis of Hi-C data is how to compare two contact maps derived from Hi-C experiments. Detecting similarities and differences between contact maps are critical in evaluating the reproducibility of replicate experiments and for identifying differential genomic regions with biological significance. Due to the complexity of chromatin conformations and the presence of technology-driven and sequence-specific biases, the comparative analysis of Hi-C data is analytically and computationally challenging. ResultsWe present a novel method called Selfish for the comparative analysis of Hi-C data that takes advantage of the structural self-similarity in contact maps. We define a novel self-similarity measure to design algorithms for (i) measuring reproducibility for Hi-C replicate experiments and (ii) finding differential chromatin interactions between two contact maps. Extensive experimental results on simulated and real data show that Selfish is more accurate and robust than state-of-the-art methods. Availability and implementationhttps://github.com/ucrbioinfo/Selfish 
    more » « less
  2. Abstract A large-scale application of the “stacked modeling” approach for chromatin state discovery previously provides a single “universal” chromatin state annotation of thehumangenome based jointly on data from many cell and tissue types. Here, we produce an analogous chromatin state annotation formousebased on 901 datasets assaying 14 chromatin marks in 26 cell or tissue types. To characterize each chromatin state, we relate the states to external annotations and compare them to analogously definedhumanstates. We expect the universal chromatin state annotation formouseto be a useful resource for studying this key model organism’s genome. 
    more » « less
  3. Abstract The review begins with a concise description of the principles of phase separation. This is followed by a comprehensive section on phase separation of chromatin, in which we recount the 60 years history of chromatin aggregation studies, discuss the evidence that chromatin aggregation intrinsically is a physiologically relevant liquid–solid phase separation (LSPS) process driven by chromatin self-interaction, and highlight the recent findings that under specific solution conditions chromatin can undergo liquid–liquid phase separation (LLPS) rather than LSPS. In the next section of the review, we discuss how certain chromatin-associated proteins undergo LLPS in vitro and in vivo. Some chromatin-binding proteins undergo LLPS in purified form in near-physiological ionic strength buffers while others will do so only in the presence of DNA, nucleosomes, or chromatin. The final section of the review evaluates the solid and liquid states of chromatin in the nucleus. While chromatin behaves as an immobile solid on the mesoscale, nucleosomes are mobile on the nanoscale. We discuss how this dual nature of chromatin, which fits well the concept of viscoelasticity, contributes to genome structure, emphasizing the dominant role of chromatin self-interaction. 
    more » « less
  4. Abstract MotivationGenome-wide maps of epigenetic modifications are powerful resources for non-coding genome annotation. Maps of multiple epigenetics marks have been integrated into cell or tissue type-specific chromatin state annotations for many cell or tissue types. With the increasing availability of multiple chromatin state maps for biologically similar samples, there is a need for methods that can effectively summarize the information about chromatin state annotations within groups of samples and identify differences across groups of samples at a high resolution. ResultsWe developed CSREP, which takes as input chromatin state annotations for a group of samples. CSREP then probabilistically estimates the state at each genomic position and derives a representative chromatin state map for the group. CSREP uses an ensemble of multi-class logistic regression classifiers that predict the chromatin state assignment of each sample given the state maps from all other samples. The difference in CSREP’s probability assignments for the two groups can be used to identify genomic locations with differential chromatin state assignments. Using groups of chromatin state maps of a diverse set of cell and tissue types, we demonstrate the advantages of using CSREP to summarize chromatin state maps and identify biologically relevant differences between groups at a high resolution. Availability and implementationThe CSREP source code and generated data are available at http://github.com/ernstlab/csrep. Supplementary informationSupplementary data are available at Bioinformatics online. 
    more » « less
  5. 3D genomics methods such as Hi-C and Micro-C have uncovered chromatin loops across the genome and linked these loops to gene regulation. However, these methods only measure 3D interaction probabilities on a relative scale. Here, we overcome this limitation by using live imaging data to calibrate Micro-C in mouse embryonic stem cells, thus obtaining absolute looping probabilities for 36,804 chromatin loops across the genome. We find that the looped state is generally rare, with a mean probability of 2.3% and a maximum of 26% across the quantified loops. On average, CTCF-CTCF loops are stronger than loops between cis-regulatory elements (3.2% vs. 1.1%). Our findings can be extended to human stem cells and differentiated cells under certain assumptions. Overall, we establish an approach for genome-wide absolute loop quantification and report that loops generally occur with low probabilities, generalizing recent live imaging results to the whole genome. 
    more » « less