skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Comparative genomics reveals insight into the evolutionary origin of massively scrambled genomes
Ciliates are microbial eukaryotes that undergo extensive programmed genome rearrangement, a natural genome editing process that converts long germline chromosomes into smaller gene-rich somatic chromosomes. Three well-studied ciliates include Oxytricha trifallax , Tetrahymena thermophila and Paramecium tetraurelia , but only the Oxytricha lineage has a massively scrambled genome, whose assembly during development requires hundreds of thousands of precise programmed DNA joining events, representing the most complex genome dynamics of any known organism. Here we study the emergence of such complex genomes by examining the origin and evolution of discontinuous and scrambled genes in the Oxytricha lineage. This study compares six genomes from three species, the germline and somatic genomes for Euplotes woodruffi , Tetmemena sp. , and the model ciliate Oxytricha trifallax . To complement existing data, we sequenced, assembled and annotated the germline and somatic genomes of Euplotes woodruffi , which provides an outgroup, and the germline genome of Tetmemena sp.. We find that the germline genome of Tetmemena is as massively scrambled and interrupted as Oxytricha's : 13.6% of its gene loci require programmed translocations and/or inversions, with some genes requiring hundreds of precise gene editing events during development. This study revealed that the earlier-diverged spirotrich, E. woodruffi , also has a scrambled genome, but only roughly half as many loci (7.3%) are scrambled. Furthermore, its scrambled genes are less complex, together supporting the position of Euplotes as a possible evolutionary intermediate in this lineage, in the process of accumulating complex evolutionary genome rearrangements, all of which require extensive repair to assemble functional coding regions. Comparative analysis also reveals that scrambled loci are often associated with local duplications, supporting a gradual model for the origin of complex, scrambled genomes via many small events of DNA duplication and decay.  more » « less
Award ID(s):
1800443 1764366
PAR ID:
10383513
Author(s) / Creator(s):
; ; ; ; ; ;
Date Published:
Journal Name:
eLife
Volume:
11
ISSN:
2050-084X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Schmidt, Edward E (Ed.)
    The evolution of lineage-specific gene families remains poorly studied across the eukaryotic tree of life, with most analyses focusing on the recent evolution ofde novogenes in model species. Here we explore the origins of lineage-specific genes in ciliates, a ~1 billion year old clade of microeukaryotes that are defined by their division of somatic and germline functions into distinct nuclei. Previous analyses on conserved gene families have shown the effect of ciliates’ unusual genome architecture on gene family evolution: extensive genome processing–the generation of thousands of gene-sized somatic chromosomes from canonical germline chromosomes–is associated with larger and more diverse gene families. To further study the relationship between ciliate genome architecture and gene family evolution, we analyzed lineage specific gene families from a set of 46 transcriptomes and 12 genomes representing x species from eight ciliate classes. We assess how the evolution lineage-specific gene families occurs among four groups of ciliates: extensive fragmenters with gene-size somatic chromosomes, non-extensive fragmenters with “large’’ multi-gene somatic chromosomes, Heterotrichea with highly polyploid somatic genomes and Karyorelictea with ‘paradiploid’ somatic genomes. Our analyses demonstrate that: 1) most lineage-specific gene families are found at shallow taxonomic scales; 2) extensive genome processing (i.e., gene unscrambling) during development likely influences the size and number of young lineage-specific gene families; and 3) the influence of somatic genome architecture on molecular evolution is increasingly apparent in older gene families. Altogether, these data highlight the influences of genome architecture on the evolution of lineage-specific gene families in eukaryotes. 
    more » « less
  2. Zufall, Rebecca (Ed.)
    Abstract Ciliates are microbial eukaryotes with distinct somatic and germline genomes. Postzygotic development involves extensive remodeling of the germline genome to form somatic chromosomes. Ciliates therefore offer a valuable model for studying the architecture and evolution of programed genome rearrangements. Current studies usually focus on a few model species, where rearrangement features are annotated by aligning reference germline and somatic genomes. Although many high-quality somatic genomes have been assembled, a high-quality germline genome assembly is difficult to obtain due to its smaller DNA content and abundance of repetitive sequences. To overcome these hurdles, we propose a new pipeline, SIGAR (Split-read Inference of Genome Architecture and Rearrangements) to infer germline genome architecture and rearrangement features without a germline genome assembly, requiring only short DNA sequencing reads. As a proof of principle, 93% of rearrangement junctions identified by SIGAR in the ciliate Oxytricha trifallax were validated by the existing germline assembly. We then applied SIGAR to six diverse ciliate species without germline genome assemblies, including Ichthyophthirius multifilii, a fish pathogen. Despite the high level of somatic DNA contamination in each sample, SIGAR successfully inferred rearrangement junctions, short eliminated sequences, and potential scrambled genes in each species. This pipeline enables pilot surveys or exploration of DNA rearrangements in species with limited DNA material access, thereby providing new insights into the evolution of chromosome rearrangements. 
    more » « less
  3. Ciliates are a model lineage for studies of genome architecture given their unusual genome structures. All ciliates have both somatic macronuclei (MAC) and germline micronuclei (MIC), both of which develop from a zygotic nucleus following sex (i.e., conjugation). Nuclear developmental stages are not well documented among non-model ciliates, includingChilodonella uncinata(class Phyllopharyngea), the focus of our work. Here, we characterize nuclear architecture and genome dynamics inC. uncinataby combining 4′,6-diamidino-2-phenylindole (DAPI) staining and fluorescencein situhybridization (FISH) techniques with confocal microscopy. We developed a telomere probe for staining, which alongside DAPI allows for the identification of fragmented somatic chromosomes among the total DNA in the nuclei. We quantify both total DNA and telomere-bound signals from more than 250 nuclei sampled from 116 individual cells, and analyze changes in DNA content and nuclear architecture acrossChilodonella’s nuclear life cycle. Specifically, we find that MAC developmental stages in the ciliateC. uncinataare different from those reported from other ciliate species. These data provide insights into nuclear dynamics during development and enrich our understanding of genome evolution in non-model ciliates. IMPORTANCECiliates are a clade of diverse single-celled eukaryotic microorganisms that contain at least one somatic macronucleus (MAC) and germline micronucleus (MIC) within each cell/organism. Ciliates rely on complex genome rearrangements to generate somatic genomes from a zygotic nucleus. However, the development of somatic nuclei has only been documented for a few model ciliate genera, includingParamecium,Tetrahymena, andOxytricha. Here, we study the MAC developmental process in the non-model ciliate,C. uncinata. We analyze both total DNA and the generation of gene-sized somatic chromosomes using a laser scanning confocal microscope to describeC. uncinata’s nuclear life cycle. We show that DNA content changes dramatically during their life cycle and in a manner that differs from previous studies on model ciliates. Our study expands knowledge of genome dynamics in ciliates and among eukaryotes more broadly. 
    more » « less
  4. Hughes, T (Ed.)
    Abstract The germline-soma divide is a fundamental distinction in developmental biology, and different genes are expressed in germline and somatic cells throughout metazoan life cycles. Ciliates, a group of microbial eukaryotes, exhibit germline-somatic nuclear dimorphism within a single cell with two different genomes. The ciliate Oxytricha trifallax undergoes massive RNA-guided DNA elimination and genome rearrangement to produce a new somatic macronucleus (MAC) from a copy of the germline micronucleus (MIC). This process eliminates noncoding DNA sequences that interrupt genes and also deletes hundreds of germline-limited open reading frames (ORFs) that are transcribed during genome rearrangement. Here, we update the set of transcribed germline-limited ORFs (TGLOs) in O. trifallax. We show that TGLOs tend to be expressed during nuclear development and then are absent from the somatic MAC. We also demonstrate that exposure to synthetic RNA can reprogram TGLO retention in the somatic MAC and that TGLO retention leads to transcription outside the normal developmental program. These data suggest that TGLOs represent a group of developmentally regulated protein-coding sequences whose gene expression is terminated by DNA elimination. 
    more » « less
  5. Abstract During early development, sea lamprey embryos undergo programmatic elimination of DNA from somatic progenitor cells in a process termed programmed genome rearrangement (PGR). Eliminated DNA eventually becomes condensed into micronuclei, which are then physically degraded and permanently lost from the cell. Previous studies indicated that many of the genes eliminated during PGR have mammalian homologs that are bound by polycomb repressive complex (PRC) in embryonic stem cells. To test whether PRC components play a role in the faithful elimination of germline‐specific sequences, we used a combination of CRISPR/Cas9 and lightsheet microscopy to investigate the impact of gene knockouts on early development and the progression through stages of DNA elimination. Analysis of knockout embryos for the core PRC2 subunits EZH, SUZ12, and EED show that disruption of all three genes results in an increase in micronucleus number, altered distribution of micronuclei within embryos, and an increase in micronucleus volume in mutant embryos. While the upstream events of DNA elimination are not strongly impacted by loss of PRC2 components, this study suggests that PRC2 plays a role in the later stages of elimination related to micronucleus condensation and degradation. These findings also suggest that other genes/epigenetic pathways may work in parallel during DNA elimination to mediate chromatin structure, accessibility, and the ultimate loss of germline‐specific DNA. 
    more » « less