skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: SHICEDO: single-cell Hi-C data enhancement with reduced over-smoothing
Abstract MotivationSingle-cell Hi-C (scHi-C) technologies have significantly advanced our understanding of the 3D genome organization. However, scHi-C data are often sparse and noisy, leading to substantial computational challenges in downstream analyses. ResultsIn this study, we introduce SHICEDO, a novel deep-learning model specifically designed to enhance scHi-C contact matrices by imputing missing or sparsely captured chromatin contacts through a generative adversarial framework. SHICEDO leverages the unique structural characteristics of scHi-C matrices to derive customized features that enable effective data enhancement. Additionally, the model incorporates a channel-wise attention mechanism to mitigate the over-smoothing issue commonly associated with scHi-C enhancement methods. Through simulations and real-data applications, we demonstrate that SHICEDO outperforms the state-of-the-art methods, achieving superior quantitative and qualitative results. Moreover, SHICEDO enhances key structural features in scHi-C data, thus enabling more precise delineation of chromatin structures such as A/B compartments, TAD-like domains, and chromatin loops. Availability and implementationSHICEDO is publicly available at https://github.com/wmalab/SHICEDO.  more » « less
Award ID(s):
1751317 2215705
PAR ID:
10650868
Author(s) / Creator(s):
; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Bioinformatics
Volume:
41
Issue:
12
ISSN:
1367-4803
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract MotivationSingle-cell Hi-C (scHi-C) data provide critical insights into chromatin interactions at individual cell levels, uncovering unique genomic 3D structures. However, scHi-C datasets are characterized by sparsity and noise, complicating efforts to accurately reconstruct high-resolution chromosomal structures. In this study, we present ScUnicorn, a novel blind super-resolution framework for scHi-C data enhancement. ScUnicorn uses an iterative degradation kernel optimization process, unlike traditional super-resolution approaches, which rely on downsampling, predefined degradation ratios, or constant assumptions about the input data to reconstruct high-resolution interaction matrices. Hence, our approach more reliably preserves critical biological patterns and minimizes noise. Additionally, we propose 3DUnicorn, a maximum likelihood algorithm that leverages the enhanced scHi-C data to infer precise 3D chromosomal structures. ResultsOur evaluation demonstrates that ScUnicorn achieves superior performance over the state-of-the-art methods in terms of Peak Signal-to-Noise Ratio, Structural Similarity Index Measure, and GenomeDisco scores. Moreover, 3DUnicorn’s reconstructed structures align closely with experimental 3D-FISH data, underscoring its biological relevance. Together, ScUnicorn and 3DUnicorn provide a robust framework for advancing genomic research by enhancing scHi-C data fidelity and enabling accurate 3D genome structure reconstruction. Availability and implementationUnicorn implementation is publicly accessible at https://github.com/OluwadareLab/Unicorn. 
    more » « less
  2. Abstract Single‐cell chromatin conformation capture (scHi‐C) techniques have evolved to provide significant insights into the structural organization and regulatory mechanisms in individual cells. Although many scHi‐C protocols have been developed, they often involve intricate procedures and the resulting data are sparse, leading to computational challenges for systematic data analysis and limited applicability. This review provides a comprehensive overview, quantitative evaluation of thirteen protocols and practical guidance on computational topics. It is first assessed the efficiency of these protocols based on the total number of contacts recovered per cell and thecis/transratio. It is then provided systematic considerations for scHi‐C quality control and data imputation. Additionally, the capabilities and implementations of various analysis methods, covering cell clustering, A/B compartment calling, topologically associating domain (TAD) calling, loop calling, 3D reconstruction, scHi‐C data simulation and differential interaction analysis is summarized. It is further highlighted key computational challenges associated with the specific complexities of scHi‐C data and propose potential solutions. 
    more » « less
  3. Abstract MotivationHigh-throughput conformation capture experiments, such as Hi-C provide genome-wide maps of chromatin interactions, enabling life scientists to investigate the role of the three-dimensional structure of genomes in gene regulation and other essential cellular functions. A fundamental problem in the analysis of Hi-C data is how to compare two contact maps derived from Hi-C experiments. Detecting similarities and differences between contact maps are critical in evaluating the reproducibility of replicate experiments and for identifying differential genomic regions with biological significance. Due to the complexity of chromatin conformations and the presence of technology-driven and sequence-specific biases, the comparative analysis of Hi-C data is analytically and computationally challenging. ResultsWe present a novel method called Selfish for the comparative analysis of Hi-C data that takes advantage of the structural self-similarity in contact maps. We define a novel self-similarity measure to design algorithms for (i) measuring reproducibility for Hi-C replicate experiments and (ii) finding differential chromatin interactions between two contact maps. Extensive experimental results on simulated and real data show that Selfish is more accurate and robust than state-of-the-art methods. Availability and implementationhttps://github.com/ucrbioinfo/Selfish 
    more » « less
  4. Abstract SummaryHere, we presented the scHiCDiff software tool that provides both nonparametric tests and parametirc models to detect differential chromatin interactions (DCIs) from single-cell Hi-C data. We thoroughly evaluated the scHiCDiff methods on both simulated and real data. Our results demonstrated that scHiCDiff, especially the zero-inflated negative binomial model option, can effectively detect reliable and consistent single-cell DCIs between two conditions, thereby facilitating the study of cell type-specific variations of chromatin structures at the single-cell level. Availability and implementationscHiCDiff is implemented in R and freely available at GitHub (https://github.com/wmalab/scHiCDiff). 
    more » « less
  5. Abstract BackgroundAnalysis of the relationship between chromosomal structural variation (synteny breaks) and 3D-chromatin architectural changes among closely related species has the potential to reveal causes and correlates between chromosomal change and chromatin remodeling. Of note, contrary to extensive studies in animal species, the pace and pattern of chromatin architectural changes following the speciation of plants remain unexplored; moreover, there is little exploration of the occurrence of synteny breaks in the context of multiple genome topological hierarchies within the same model species. ResultsHere we used Hi-C and epigenomic analyses to characterize and compare the profiles of hierarchical chromatin architectural features in representative species of the cotton tribe (Gossypieae), includingGossypium arboreum,Gossypium raimondii, andGossypioides kirkii, which differ with respect to chromosome rearrangements. We found that (i) overall chromatin architectural territories were preserved inGossypioidesandGossypium, which was reflected in their similar intra-chromosomal contact patterns and spatial chromosomal distributions; (ii) the non-random preferential occurrence of synteny breaks in A compartment significantly associate with the B-to-A compartment switch in syntenic blocks flanking synteny breaks; (iii) synteny changes co-localize with open-chromatin boundaries of topologically associating domains, while TAD stabilization has a greater influence on regulating orthologous expression divergence than do rearrangements; and (iv) rearranged chromosome segments largely maintain ancestralin-cisinteractions. ConclusionsOur findings provide insights into the non-random occurrence of epigenomic remodeling relative to the genomic landscape and its evolutionary and functional connections to alterations of hierarchical chromatin architecture, on a known evolutionary timescale. 
    more » « less