skip to main content


The NSF Public Access Repository (NSF-PAR) system and access will be unavailable from 5:00 PM ET until 11:00 PM ET on Friday, June 21 due to maintenance. We apologize for the inconvenience.

Title: Generating Chromosome Geometries in a Minimal Cell From Cryo-Electron Tomograms and Chromosome Conformation Capture Maps
JCVI-syn3A is a genetically minimal bacterial cell, consisting of 493 genes and only a single 543 kbp circular chromosome. Syn3A’s genome and physical size are approximately one-tenth those of the model bacterial organism Escherichia coli ’s, and the corresponding reduction in complexity and scale provides a unique opportunity for whole-cell modeling. Previous work established genome-scale gene essentiality and proteomics data along with its essential metabolic network and a kinetic model of genetic information processing. In addition to that information, whole-cell, spatially-resolved kinetic models require cellular architecture, including spatial distributions of ribosomes and the circular chromosome’s configuration. We reconstruct cellular architectures of Syn3A cells at the single-cell level directly from cryo-electron tomograms, including the ribosome distributions. We present a method of generating self-avoiding circular chromosome configurations in a lattice model with a resolution of 11.8 bp per monomer on a 4 nm cubic lattice. Realizations of the chromosome configurations are constrained by the ribosomes and geometry reconstructed from the tomograms and include DNA loops suggested by experimental chromosome conformation capture (3C) maps. Using ensembles of simulated chromosome configurations we predict chromosome contact maps for Syn3A cells at resolutions of 250 bp and greater and compare them to the experimental maps. Additionally, the spatial distributions of ribosomes and the DNA-crowding resulting from the individual chromosome configurations can be used to identify macromolecular structures formed from ribosomes and DNA, such as polysomes and expressomes.  more » « less
Award ID(s):
1818344 1840320 1430124 1840301 1920374
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Molecular Biosciences
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication and inheritance of genetic material. By creating a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics, we investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell-cycle. To achieve cell-scale chromosome structures that are realistic, we model the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. In addition, the conformations of the circular DNA must avoid overlapping with ribosomes identitied in cryo-electron tomograms. While Syn3A lacks the complex regulatory systems known to orchestrate chromosome segregation in other bacteria, its minimized genome retains essential loop-extruding structural maintenance of chromosomes (SMC) protein complexes (SMC-scpAB) and topoisomerases. Through implementing the effects of these proteins in our simulations of replicating chromosomes, we find that they alone are sufficient for simultaneous chromosome segregation across all generations within nested theta structures. This supports previous studies suggesting loop-extrusion serves as a near-universal mechanism for chromosome organization within bacterial and eukaryotic cells. Furthermore, we analyze ribosome diffusion under the influence of the chromosome and calculatein silicochromosome contact maps that capture inter-daughter interactions. Finally, we present a methodology to map the polymer model of the chromosome to a Martini coarse-grained representation to prepare molecular dynamics models of entire Syn3A cells, which serves as an ultimate means of validation for cell states predicted by the WCM.

    more » « less

    Ribosomes—the primary macromolecular machines responsible for translating the genetic code into proteins—are complexes of precisely folded RNA and proteins. The ways in which their production and assembly are managed by the living cell is of deep biological importance. Here we extend a recent spatially resolved whole‐cell model of ribosome biogenesis in a fixed volume [Earnest et al., Biophys J 2015, 109, 1117–1135] to include the effects of growth, DNA replication, and cell division. All biological processes are described in terms of reaction‐diffusion master equations and solved stochastically using the Lattice Microbes simulation software. In order to determine the replication parameters, we construct and analyze a series ofEscherichia colistrains with fluorescently labeled genes distributed evenly throughout their chromosomes. By measuring these cells’ lengths and number of gene copies at the single‐cell level, we could fit a statistical model of the initiation and duration of chromosome replication. We found that for our slow‐growing (120 min doubling time)E. colicells, replication was initiated 42 min into the cell cycle and completed after an additional 42 min. While simulations of the biogenesis model produce the correct ribosome and mRNA counts over the cell cycle, the kinetic parameters for transcription and degradation are lower than anticipated from a recent analytical time dependent model of in vivo mRNA production. Describing expression in terms of a simple chemical master equation, we show that the discrepancies are due to the lack of nonribosomal genes in the extended biogenesis model which effects the competition of mRNA for ribosome binding, and suggest corrections to parameters to be used in the whole‐cell model when modeling expression of the entire transcriptome. © 2016 Wiley Periodicals, Inc. Biopolymers 105: 735–751, 2016.

    more » « less
  3. Abstract Motivation

    Current technologies for single-cell DNA sequencing require whole-genome amplification (WGA), as a single cell contains too little DNA for direct sequencing. Unfortunately, WGA introduces biases in the resulting sequencing data, including non-uniformity in genome coverage and high rates of allele dropout. These biases complicate many downstream analyses, including the detection of genomic variants.


    We show that amplification biases have a potential upside: long-range correlations in rates of allele dropout provide a signal for phasing haplotypes at the lengths of amplicons from WGA, lengths which are generally longer than than individual sequence reads. We describe a statistical test to measure concurrent allele dropout between single-nucleotide polymorphisms (SNPs) across multiple sequenced single cells. We use results of this test to perform haplotype assembly across a collection of single cells. We demonstrate that the algorithm predicts phasing between pairs of SNPs with higher accuracy than phasing from reads alone. Using whole-genome sequencing data from only seven neural cells, we obtain haplotype blocks that are orders of magnitude longer than with sequence reads alone (median length 10.2 kb versus 312 bp), with error rates <2%. We demonstrate similar advantages on whole-exome data from 16 cells, where we obtain haplotype blocks with median length 9.2 kb—comparable to typical gene lengths—compared with median lengths of 41 bp with sequence reads alone, with error rates <4%. Our algorithm will be useful for haplotyping of rare alleles and studies of allele-specific somatic aberrations.

    Availability and implementation

    Source code is available at

    Supplementary information

    Supplementary data are available at Bioinformatics online.

    more » « less
  4. Abstract Background

    The maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies.


    Here, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including a 1.8-Mb duplication containing the Gametophyte factor1 locus for unilateral cross-incompatibility, and six inversions of 0.7 Mb or greater. Increased copy number of carotenoid cleavage dioxygenase 1 (ccd1) in A188 is associated with elevated expression during seed development. Highccd1expression in seeds together with low expression of yellow endosperm 1 (y1) reduces carotenoid accumulation, accounting for the white seed phenotype of A188. Furthermore, transcriptome and epigenome analyses reveal enhanced expression of defense pathways and altered DNA methylation patterns of the embryonic callus.


    The A188 genome assembly provides a high-resolution sequence for a complex genome species and a foundational resource for analyses of genome variation and gene function in maize. The genome, in comparison to B73, contains extensive intra-species structural variations and other genetic differences. Expression and network analyses identify discrete profiles for embryonic callus and other tissues.

    more » « less
  5. Abstract

    Integrative and conjugative elements (ICEs) are mobile genetic elements that can transfer by conjugation to recipient cells. Some ICEs integrate into a unique site in the genome of their hosts. We studied quantitatively the process by which an ICE searches for its unique integration site in the Bacillus subtilis chromosome. We followed the motion of both ICEBs1 and the chromosomal integration site in real time within individual cells. ICEBs1 exhibited a wide spectrum of dynamical behaviors, ranging from rapid sub-diffusive displacements crisscrossing the cell, to kinetically trapped states. The chromosomal integration site moved sub-diffusively and exhibited pronounced dynamical asymmetry between longitudinal and transversal motions, highlighting the role of chromosomal structure and the heterogeneity of the bacterial interior in the search. The successful search for and subsequent recombination into the integration site is a key step in the acquisition of integrating mobile genetic elements. Our findings provide new insights into intracellular transport processes involving large DNA molecules.

    more » « less