skip to main content


Title: Alignment-Free Analysis of Whole-Genome Sequences From Symbiodiniaceae Reveals Different Phylogenetic Signals in Distinct Regions
Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an alignment-free phylogenetic approach based on sub-sequences of fixed length k (i.e. k -mers), we assessed the phylogenetic signal among whole-genome sequences from 16 Symbiodiniaceae taxa (including the genera of Symbiodinium , Breviolum , Cladocopium , Durusdinium and Fugacium ) and two strains of Polarella glacialis as outgroup. Based on phylogenetic trees inferred from k -mers in distinct genomic regions (i.e. repeat-masked genome sequences, protein-coding sequences, introns and repeats) and in protein sequences, the phylogenetic signal associated with protein-coding DNA and the encoded amino acids is largely consistent with the Symbiodiniaceae phylogeny based on established markers, such as large subunit rRNA. The other genome sequences (introns and repeats) exhibit distinct phylogenetic signals, supporting the expected differential evolutionary pressure acting on these regions. Our analysis of conserved core k -mers revealed the prevalence of conserved k -mers (>95% core 23-mers among all 18 genomes) in annotated repeats and non-genic regions of the genomes. We observed 180 distinct repeat types that are significantly enriched in genomes of the symbiotic versus free-living Symbiodinium taxa, suggesting an enhanced activity of transposable elements linked to the symbiotic lifestyle. We provide evidence that representation of alignment-free phylogenies as dynamic networks enhances the ability to generate new hypotheses about genome evolution in Symbiodiniaceae. These results demonstrate the potential of alignment-free phylogenetic methods as a scalable approach for inferring comprehensive, unbiased whole-genome phylogenies of dinoflagellates and more broadly of microbial eukaryotes.  more » « less
Award ID(s):
1756616
NSF-PAR ID:
10331085
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Frontiers in Plant Science
Volume:
13
ISSN:
1664-462X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Background: Dinoflagellates are taxonomically diverse and ecologically important phytoplankton that are ubiquitously present in marine and freshwater environments. Mostly photosynthetic, dinoflagellates provide the basis of aquatic primary production; most taxa are free-living, while some can form symbiotic and parasitic associations with other organisms. However, knowledge of the molecular mechanisms that underpin the adaptation of these organisms to diverse ecological niches is limited by the scarce availability of genomic data, partly due to their large genome sizes estimated up to 250 Gbp. Currently available dinoflagellate genome data are restricted to Symbiodiniaceae (particularly symbionts of reef-building corals) and parasitic lineages, from taxa that have smaller genome size ranges, while genomic information from more diverse free living species is still lacking. Results: Here, we present two draft diploid genome assemblies of the free-living dinoflagellate Polarella glacialis, isolated from the Arctic and Antarctica. We found that about 68% of the genomes are composed of repetitive sequence, with long terminal repeats likely contributing to intra-species structural divergence and distinct genome sizes (3.0 and 2.7 Gbp). For each genome, guided using full-length transcriptome data, we predicted > 50,000 high-quality protein-coding genes, of which ~40% are in unidirectional gene clusters and ~25% comprise single exons. Multi-genome comparison unveiled genes specific to P. glacialis and a common, putatively bacterial origin of ice-binding domains in cold-adapted dinoflagellates. Conclusions: Our results elucidate how selection acts within the context of a complex genome structure to facilitate local adaptation. Because most dinoflagellate genes are constitutively expressed, Polarella glacialis has enhanced transcriptional responses via unidirectional, tandem duplication of single-exon genes that encode functions critical to survival in cold, low-light polar environments. These genomes provide a foundational reference for future research on dinoflagellate evolution. 
    more » « less
  2. Abstract Background Dinoflagellates in the family Symbiodiniaceae are important photosynthetic symbionts in cnidarians (such as corals) and other coral reef organisms. Breakdown of the coral-dinoflagellate symbiosis due to environmental stress (i.e. coral bleaching) can lead to coral death and the potential collapse of reef ecosystems. However, evolution of Symbiodiniaceae genomes, and its implications for the coral, is little understood. Genome sequences of Symbiodiniaceae remain scarce due in part to their large genome sizes (1–5 Gbp) and idiosyncratic genome features. Results Here, we present de novo genome assemblies of seven members of the genus Symbiodinium , of which two are free-living, one is an opportunistic symbiont, and the remainder are mutualistic symbionts. Integrating other available data, we compare 15 dinoflagellate genomes revealing high sequence and structural divergence. Divergence among some Symbiodinium isolates is comparable to that among distinct genera of Symbiodiniaceae. We also recovered hundreds of gene families specific to each lineage, many of which encode unknown functions. An in-depth comparison between the genomes of the symbiotic Symbiodinium tridacnidorum (isolated from a coral) and the free-living Symbiodinium natans reveals a greater prevalence of transposable elements, genetic duplication, structural rearrangements, and pseudogenisation in the symbiotic species. Conclusions Our results underscore the potential impact of lifestyle on lineage-specific gene-function innovation, genome divergence, and the diversification of Symbiodinium and Symbiodiniaceae. The divergent features we report, and their putative causes, may also apply to other microbial eukaryotes that have undergone symbiotic phases in their evolutionary history. 
    more » « less
  3. null (Ed.)
    Background: Dinoflagellates in the family Symbiodiniaceae are important photosynthetic symbionts in cnidarians (such as corals) and other coral reef organisms. Breakdown of the coral-dinoflagellate symbiosis due to environmental stress (i.e. coral bleaching) can lead to coral death and the potential collapse of reef ecosystems. However, evolution of Symbiodiniaceae genomes, and its implications for the coral, is little understood. Genome sequences of Symbiodiniaceae remain scarce due in part to their large genome sizes (1–5 Gbp) and idiosyncratic genome features. Results: Here, we present de novo genome assemblies of seven members of the genus Symbiodinium, of which two are free-living, one is an opportunistic symbiont, and the remainder are mutualistic symbionts. Integrating other available data, we compare 15 dinoflagellate genomes revealing high sequence and structural divergence. Divergence among some Symbiodinium isolates is comparable to that among distinct genera of Symbiodiniaceae. We also recovered hundreds of gene families specific to each lineage, many of which encode unknown functions. An in-depth comparison between the genomes of the symbiotic Symbiodinium tridacnidorum (isolated from a coral) and the free-living Symbiodinium natans reveals a greater prevalence of transposable elements, genetic duplication, structural rearrangements, and pseudogenisation in the symbiotic species. Conclusions: Our results underscore the potential impact of lifestyle on lineage-specific gene-function innovation, genome divergence, and the diversification of Symbiodinium and Symbiodiniaceae. The divergent features we report, and their putative causes, may also apply to other microbial eukaryotes that have undergone symbiotic phases in their evolutionary history. 
    more » « less
  4. Abstract

    Dinoflagellates are a diverse group of phytoplankton, ranging from harmful bloom-forming microalgae to photosymbionts of coral reefs. Genome-scale data from dinoflagellates reveal atypical genomic features, extensive genomic divergence, and lineage-specific innovation of gene functions. Long non-coding RNAs (lncRNAs), known to regulate gene expression in eukaryotes, are largely unexplored in dinoflagellates. Here, using high-quality genome and transcriptome data, we identified 48039 polyadenylated lncRNAs in three dinoflagellate species: the coral symbionts Cladocopium proliferum and Durusdinium trenchii, and the bloom-forming species, Prorocentrum cordatum. These lncRNAs have fewer introns and lower G+C content than protein-coding sequences; 37 768 (78.6%) are unique with respect to sequence similarity. We classified all lncRNAs based on conserved motifs (k-mers) into distinct clusters, following properties of protein-binding and/or subcellular localisation. Interestingly, 3708 (7.7%) lncRNAs are differentially expressed under heat stress, algal lifestyle, and/or growth phase, and share co-expression patterns with protein-coding genes. Based on inferred triplex interactions between lncRNA and putative promoter regions, we identified 19 460 putative gene targets for 3721 lncRNAs; 907 genes exhibit differential expression under heat stress. These results reveal, for the first time, the diversity of lncRNAs in dinoflagellates and how lncRNAs may regulate gene expression as a heat-stress response in these ecologically important microbes.

     
    more » « less
  5. In order to develop successful strategies for coral reef preservation, it is critical that the biology of both host corals and symbiotic algae are investigated. In the Ryukyu Archipelago, which encompasses many islands spread over ∼500 km of the Pacific Ocean, four major populations of the coral Acropora digitifera have been studied using whole-genome shotgun (WGS) sequence analysis (Shinzato C, Mungpakdee S, Arakaki N, Satoh N. 2015. Genome-wide single-nucleotide polymorphism (SNP) analysis explains coral diversity and recovery in the Ryukyu Archipelago. Sci Rep. 5:18211.). In contrast, the diversity of the symbiotic dinoflagellates associated with these A. digitifera populations is unknown. It is therefore unclear if these two core components of the coral holobiont share a common evolutionary history. This issue can be addressed for the symbiotic algal populations by studying the organelle genomes of their mitochondria and plastids. Here, we analyzed WGS data from ∼150 adult A. digitifera, and by mapping reads to the available reference genome sequences, we extracted 2,250 sequences representing 15 organelle genes of Symbiodiniaceae. Molecular phylogenetic analyses of these mitochondrial and plastid gene sets revealed that A. digitifera from the southern Yaeyama islands harbor a different Symbiodiniaceae population than the islands of Okinawa and Kerama in the north, indicating that the distribution of symbiont populations partially matches that of the four host populations. Interestingly, we found that numerous SNPs correspond to known RNA-edited sites in 14 of the Symbiodiniaceae organelle genes, with mitochondrial genes showing a stronger correspondence than plastid genes. These results suggest a possible correlation between RNA editing and SNPs in the two organelle genomes of symbiotic dinoflagellates. 
    more » « less