skip to main content


Title: GenomeFlow: a comprehensive graphical tool for modeling and analyzing 3D genome structure
Abstract Motivation

Three-dimensional (3D) genome organization plays important functional roles in cells. User-friendly tools for reconstructing 3D genome models from chromosomal conformation capturing data and analyzing them are needed for the study of 3D genome organization.

Results

We built a comprehensive graphical tool (GenomeFlow) to facilitate the entire process of modeling and analysis of 3D genome organization. This process includes the mapping of Hi-C data to one-dimensional (1D) reference genomes, the generation, normalization and visualization of two-dimensional (2D) chromosomal contact maps, the reconstruction and the visualization of the 3D models of chromosome and genome, the analysis of 3D models and the integration of these models with functional genomics data. This graphical tool is the first of its kind in reconstructing, storing, analyzing and annotating 3D genome models. It can reconstruct 3D genome models from Hi-C data and visualize them in real-time. This tool also allows users to overlay gene annotation, gene expression data and genome methylation data on top of 3D genome models.

Availability and implementation

The source code and user manual: https://github.com/jianlin-cheng/GenomeFlow.

Supplementary information

Supplementary data are available at Bioinformatics online.

 
more » « less
NSF-PAR ID:
10393638
Author(s) / Creator(s):
; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
Bioinformatics
Volume:
35
Issue:
8
ISSN:
1367-4803
Page Range / eLocation ID:
p. 1416-1418
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Eukaryotic chromosomes are often composed of components organized into multiple scales, such as nucleosomes, chromatin fibers, topologically associated domains (TAD), chromosome compartments, and chromosome territories. Therefore, reconstructing detailed 3D models of chromosomes in high resolution is useful for advancing genome research. However, the task of constructing quality high-resolution 3D models is still challenging with existing methods. Hence, we designed a hierarchical algorithm, called Hierarchical3DGenome, to reconstruct 3D chromosome models at high resolution (<=5 Kilobase (KB)). The algorithm first reconstructs high-resolution 3D models at TAD level. The TAD models are then assembled to form complete high-resolution chromosomal models. The assembly of TAD models is guided by a complete low-resolution chromosome model. The algorithm is successfully used to reconstruct 3D chromosome models at 5 KB resolution for the human B-cell (GM12878). These high-resolution models satisfy Hi-C chromosomal contacts well and are consistent with models built at lower (i.e. 1 MB) resolution, and with the data of fluorescentin situhybridization experiments. The Java source code of Hierarchical3DGenome and its user manual are available herehttps://github.com/BDM-Lab/Hierarchical3DGenome.

     
    more » « less
  2. Abstract

    This study aims to explore whether and how positive and negative supercoiling contribute to the three-dimensional (3D) organization of the bacterial genome. We used recently published Escherichia coli GapR ChIP-seq and TopoI ChIP-seq (also called EcTopoI-seq) data, which marks positive and negative supercoiling sites, respectively, to study how supercoiling correlates with the spatial contact maps obtained from chromosome conformation capture sequencing (Hi-C and 5C). We find that supercoiled chromosomal loci have overall higher Hi-C contact frequencies than sites that are not supercoiled. Surprisingly, positive supercoiling corresponds to higher spatial contact than negative supercoiling. Additionally, positive, but not negative, supercoiling could be identified from Hi-C data with high accuracy. We further find that the majority of positive and negative supercoils coincide with highly active transcription units, with a minor group likely associated with replication and other genomic processes. Our results show that both positive and negative supercoiling enhance spatial contact, with positive supercoiling playing a larger role in bringing genomic loci closer in space. Based on our results, we propose new physical models of how the E. coli chromosome is organized by positive and negative supercoils.

     
    more » « less
  3. Abstract Summary

    HiCube is a lightweight web application for interactive visualization and exploration of diverse types of genomics data at multiscale resolutions. Especially, HiCube displays synchronized views of Hi-C contact maps and 3D genome structures with user-friendly annotation and configuration tools, thereby facilitating the study of 3D genome organization and function.

    Availability and implementation

    HiCube is implemented in Javascript and can be installed via NPM. The source code is freely available at GitHub (https://github.com/wmalab/HiCube).

     
    more » « less
  4. Abstract Motivation

    Cellular Electron CryoTomography (CECT) enables 3D visualization of cellular organization at near-native state and in sub-molecular resolution, making it a powerful tool for analyzing structures of macromolecular complexes and their spatial organizations inside single cells. However, high degree of structural complexity together with practical imaging limitations makes the systematic de novo discovery of structures within cells challenging. It would likely require averaging and classifying millions of subtomograms potentially containing hundreds of highly heterogeneous structural classes. Although it is no longer difficult to acquire CECT data containing such amount of subtomograms due to advances in data acquisition automation, existing computational approaches have very limited scalability or discrimination ability, making them incapable of processing such amount of data.

    Results

    To complement existing approaches, in this article we propose a new approach for subdividing subtomograms into smaller but relatively homogeneous subsets. The structures in these subsets can then be separately recovered using existing computation intensive methods. Our approach is based on supervised structural feature extraction using deep learning, in combination with unsupervised clustering and reference-free classification. Our experiments show that, compared with existing unsupervised rotation invariant feature and pose-normalization based approaches, our new approach achieves significant improvements in both discrimination ability and scalability. More importantly, our new approach is able to discover new structural classes and recover structures that do not exist in training data.

    Availability and Implementation

    Source code freely available at http://www.cs.cmu.edu/∼mxu1/software.

    Supplementary information

    Supplementary data are available at Bioinformatics online.

     
    more » « less
  5. Abstract We use data-driven physical simulations to study the three-dimensional architecture of the Aedes aegypti genome. Hi-C maps exhibit both a broad diagonal and compartmentalization with telomeres and centromeres clustering together. Physical modeling reveals that these observations correspond to an ensemble of 3D chromosomal structures that are folded over and partially condensed. Clustering of the centromeres and telomeres near the nuclear lamina appears to be a necessary condition for the formation of the observed structures. Further analysis of the mechanical properties of the genome reveals that the chromosomes of Aedes aegypti , by virtue of their atypical structural organization, are highly sensitive to the deformation of the nuclei. This last finding provides a possible physical mechanism linking mechanical cues to gene regulation. 
    more » « less