skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Multiple waves of viral invasions in Symbiodiniaceae algal genomes
Abstract Dinoflagellates from the family Symbiodiniaceae are phototrophic marine protists that engage in symbiosis with diverse hosts. Their large and distinct genomes are characterized by pervasive gene duplication and large-scale retroposition events. However, little is known about the role and scale of horizontal gene transfer (HGT) in the evolution of this algal family. In other dinoflagellates, high levels of HGTs have been observed, linked to major genomic transitions, such as the appearance of a viral-acquired nucleoprotein that originated via HGT from a large DNA algal virus. Previous work showed that Symbiodiniaceae from different hosts are actively infected by viral groups, such as giant DNA viruses and ssRNA viruses, that may play an important role in coral health. Latent viral infections may also occur, whereby viruses could persist in the cytoplasm or integrate into the host genome as a provirus. This hypothesis received experimental support; however, the cellular localization of putative latent viruses and their taxonomic affiliation are still unknown. In addition, despite the finding of viral sequences in some genomes of Symbiodiniaceae, viral origin, taxonomic breadth, and metabolic potential have not been explored. To address these questions, we searched for putative viral-derived proteins in thirteen Symbiodiniaceae genomes. We found fifty-nine candidate viral-derived HGTs that gave rise to twelve phylogenies across ten genomes. We also describe the taxonomic affiliation of these virus-related sequences, their structure, and their genomic context. These results lead us to propose a model to explain the origin and fate of Symbiodiniaceae viral acquisitions.  more » « less
Award ID(s):
1756616
PAR ID:
10427333
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Virus Evolution
Volume:
8
Issue:
2
ISSN:
2057-1577
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract Endogenous viral elements (EVEs) offer insight into the evolutionary histories and hosts of contemporary viruses. This study leveraged DNA metagenomics and genomics to detect and infer the host of a non-retroviral dinoflagellate-infecting +ssRNA virus (dinoRNAV) common in coral reefs. As part of the Tara Pacific Expedition, this study surveyed 269 newly sequenced cnidarians and their resident symbiotic dinoflagellates (Symbiodiniaceae), associated metabarcodes, and publicly available metagenomes, revealing 178 dinoRNAV EVEs, predominantly among hydrocoral-dinoflagellate metagenomes. Putative associations between Symbiodiniaceae and dinoRNAV EVEs were corroborated by the characterization of dinoRNAV-like sequences in 17 of 18 scaffold-scale and one chromosome-scale dinoflagellate genome assembly, flanked by characteristically cellular sequences and in proximity to retroelements, suggesting potential mechanisms of integration. EVEs were not detected in dinoflagellate-free (aposymbiotic) cnidarian genome assemblies, including stony corals, hydrocorals, jellyfish, or seawater. The pervasive nature of dinoRNAV EVEs within dinoflagellate genomes (especiallySymbiodinium), as well as their inconsistent within-genome distribution and fragmented nature, suggest ancestral or recurrent integration of this virus with variable conservation. Broadly, these findings illustrate how +ssRNA viruses may obscure their genomes as members of nested symbioses, with implications for host evolution, exaptation, and immunity in the context of reef health and disease. 
    more » « less
  2. Dinoflagellates of the family Symbiodiniaceae are crucial photosymbionts in corals and other marine organisms. Of these, Cladocopium goreaui is one of the most dominant symbiont species in the Indo-Pacific. Here, we present an improved genome assembly of C. goreaui combining new long-read sequence data with previously generated short-read data. Incorporating new full-length transcripts to guide gene prediction, the C. goreaui genome (1.2 Gb) exhibits a high extent of completeness (82.4% based on BUSCO protein recovery) and better resolution of repetitive sequence regions; 45,322 gene models were predicted, and 327 putative, topologically associated domains of the chromosomes were identified. Comparison with other Symbiodiniaceae genomes revealed a prevalence of repeats and duplicated genes in C. goreaui, and lineage-specific genes indicating functional innovation. Incorporating 2,841,408 protein sequences from 96 taxonomically diverse eukaryotes and representative prokaryotes in a phylogenomic approach, we assessed the evolutionary history of C. goreaui genes. Of the 5246 phylogenetic trees inferred from homologous protein sets containing two or more phyla, 35–36% have putatively originated via horizontal gene transfer (HGT), predominantly (19–23%) via an ancestral Archaeplastida lineage implicated in the endosymbiotic origin of plastids: 10–11% are of green algal origin, including genes encoding photosynthetic functions. Our results demonstrate the utility of long-read sequence data in resolving structural features of a dinoflagellate genome, and highlight how genetic transfer has shaped genome evolution of a facultative symbiont, and more broadly of dinoflagellates. 
    more » « less
  3. Abstract Viruses of the phylumNucleocytoviricota, often referred to as “giant viruses,” are prevalent in various environments around the globe and play significant roles in shaping eukaryotic diversity and activities in global ecosystems. Given the extensive phylogenetic diversity within this viral group and the highly complex composition of their genomes, taxonomic classification of giant viruses, particularly incomplete metagenome-assembled genomes (MAGs) can present a considerable challenge. Here we developed TIGTOG (TaxonomicInformation ofGiant viruses usingTrademarkOrthologousGroups), a machine learning-based approach to predict the taxonomic classification of novel giant virus MAGs based on profiles of protein family content. We applied a random forest algorithm to a training set of 1531 quality-checked, phylogenetically diverseNucleocytoviricotagenomes using pre-selected sets of giant virus orthologous groups (GVOGs). The classification models were predictive of viral taxonomic assignments with a cross-validation accuracy of 99.6% at the order level and 97.3% at the family level. We found that no individual GVOGs or genome features significantly influenced the algorithm’s performance or the models’ predictions, indicating that classification predictions were based on a comprehensive genomic signature, which reduced the necessity of a fixed set of marker genes for taxonomic assigning purposes. Our classification models were validated with an independent test set of 823 giant virus genomes with varied genomic completeness and taxonomy and demonstrated an accuracy of 98.6% and 95.9% at the order and family level, respectively. Our results indicate that protein family profiles can be used to accurately classify large DNA viruses at different taxonomic levels and provide a fast and accurate method for the classification of giant viruses. This approach could easily be adapted to other viral groups. 
    more » « less
  4. Some viruses have genes encoding proteins with membrane transport functions. It is unknown if these types of proteins are rare or are common in viruses. In particular, the evolutionary origin of some of the viral genes is obscure, where other viral proteins have homologs in prokaryotic and eukaryotic organisms. We searched virus genomes in databases looking for transmembrane proteins with possible transport function. This effort led to the detection of 18 different types of putative membrane transport proteins indicating that they are not a rarity in viral genomes. The most abundant proteins are K+ channels. Their predicted structures vary between different viruses. With a few exceptions, the viral proteins differed significantly from homologs in their current hosts. In some cases the data provide evidence for a recent gene transfer between host and virus, but in other cases the evidence indicates a more complex evolutionary history. 
    more » « less
  5. The advancement of high throughput sequencing has greatly facilitated the exploration of viruses that infect marine hosts. For example, a number of putative virus genomes belonging to the Totiviridae family have been described in crustacean hosts. However, there has been no characterization of the most newly discovered putative viruses beyond description of their genomes. In this study, two novel double-stranded RNA (dsRNA) virus genomes were discovered in the Atlantic blue crab ( Callinectes sapidus ) and further investigated. Sequencing of both virus genomes revealed that they each encode RNA dependent RNA polymerase proteins (RdRps) with similarities to toti-like viruses. The viruses were tentatively named Callinectes sapidus toti-like virus 1 (CsTLV1) and Callinectes sapidus toti-like virus 2 (CsTLV2). Both genomes have typical elements required for −1 ribosomal frameshifting, which may induce the expression of an encoded ORF1–ORF2 (gag-pol) fusion protein. Phylogenetic analyses of CsTLV1 and CsTLV2 RdRp amino acid sequences suggested that they are members of two new genera in the family Totiviridae . The CsTLV1 and CsTLV2 genomes were detected in muscle, gill, and hepatopancreas of blue crabs by real-time reverse transcription quantitative PCR (RT-qPCR). The presence of ~40 nm totivirus-like viral particles in all three tissues was verified by transmission electron microscopy, and pathology associated with CsTLV1 and CsTLV2 infections were observed by histology. PCR assays showed the prevalence and geographic range of these viruses, to be restricted to the northeast United States sites sampled. The two virus genomes co-occurred in almost all cases, with the CsTLV2 genome being found on its own in 8.5% cases, and the CsTLV1 genome not yet found on its own. To our knowledge, this is the first report of toti-like viruses in C. sapidus . The information reported here provides the knowledge and tools to investigate transmission and potential pathogenicity of these viruses. 
    more » « less