skip to main content


Title: Functional Genomic Analyses Reveal an Open Pan-genome for the Chloroviruses and a Potential for Genetic Innovation in New Isolates
ABSTRACT Chloroviruses (family Phycodnaviridae ) are large double-stranded DNA (dsDNA) viruses that infect unicellular green algae present in inland waters. These viruses have been isolated using three main chlorella-like green algal host cells, traditionally called NC64A, SAG, and Pbi, revealing extensive genetic diversity. In this study, we performed a functional genomic analysis on 36 chloroviruses that infected the three different hosts. Phylogenetic reconstruction based on the DNA polymerase B family gene clustered the chloroviruses into three distinct clades. The viral pan-genome consists of 1,345 clusters of orthologous groups of genes (COGs), with 126 COGs conserved in all viruses. Totals of 368, 268, and 265 COGs are found exclusively in viruses that infect NC64A, SAG, and Pbi algal hosts, respectively. Two-thirds of the COGs have no known function, constituting the “dark pan-genome” of chloroviruses, and further studies focusing on these genes may identify important novelties. The proportions of functionally characterized COGs composing the pan-genome and the core-genome are similar, but those related to transcription and RNA processing, protein metabolism, and virion morphogenesis are at least 4-fold more represented in the core genome. Bipartite network construction evidencing the COG sharing among host-specific viruses identified 270 COGs shared by at least one virus from each of the different host groups. Finally, our results reveal an open pan-genome for chloroviruses and a well-established core genome, indicating that the isolation of new chloroviruses can be a valuable source of genetic discovery. IMPORTANCE Chloroviruses are large dsDNA viruses that infect unicellular green algae distributed worldwide in freshwater environments. They comprise a genetically diverse group of viruses; however, a comprehensive investigation of the genomic evolution of these viruses is still missing. Here, we performed a functional pan-genome analysis comprising 36 chloroviruses associated with three different algal hosts in the family Chlorellaceae , referred to as zoochlorellae because of their endosymbiotic lifestyle. We identified a set of 126 highly conserved genes, most of which are related to essential functions in the viral replicative cycle. Several genes are unique to distinct isolates, resulting in an open pan-genome for chloroviruses. This profile is associated with generalist organisms, and new insights into the evolution and ecology of chloroviruses are presented. Ultimately, our results highlight the potential for genetic diversity in new isolates.  more » « less
Award ID(s):
1736030
NSF-PAR ID:
10433051
Author(s) / Creator(s):
; ; ; ;
Editor(s):
Parrish, Colin R.
Date Published:
Journal Name:
Journal of Virology
Volume:
96
Issue:
2
ISSN:
0022-538X
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    Chloroviruses are large, plaque-forming, dsDNA viruses that infect chlorella-like green algae that live in a symbiotic relationship with protists. Chloroviruses have genomes from 290 to 370 kb, and they encode as many as 400 proteins. One interesting feature of chloroviruses is that they encode a potassium ion (K+) channel protein named Kcv. The Kcv protein encoded by SAG chlorovirus ATCV-1 is one of the smallest known functional K+ channel proteins consisting of 82 amino acids. The KcvATCV-1 protein has similarities to the family of two transmembrane domain K+ channel proteins; it consists of two transmembrane α-helixes with a pore region in the middle, making it an ideal model for studying K+ channels. To assess their genetic diversity, kcv genes were sequenced from 103 geographically distinct SAG chlorovirus isolates. Of the 103 kcv genes, there were 42 unique DNA sequences that translated into 26 new Kcv channels. The new predicted Kcv proteins differed from KcvATCV-1 by 1 to 55 amino acids. The most conserved region of the Kcv protein was the filter, the turret and the pore helix were fairly well conserved, and the outer and the inner transmembrane domains of the protein were the most variable. Two of the new predicted channels were shown to be functional K+ channels. 
    more » « less
  2. Chloroviruses are large dsDNA, plaque-forming viruses that infect certain chlorella-like green algae; the algae are normally mutualistic endosymbionts of protists and metazoans and are often referred to as zoochlorellae. The viruses are ubiquitous in inland aqueous environments throughout the world and occasionally single types reach titers of thousands of plaque-forming units per ml of native water. The viruses are icosahedral in shape with a spike structure located at one of the vertices. They contain an internal membrane that is required for infectivity. The viral genomes are 290 to 370 kb in size, which encode up to 16 tRNAs and 330 to ~415 proteins, including many not previously seen in viruses. Examples include genes encoding DNA restriction and modification enzymes, hyaluronan and chitin biosynthetic enzymes, polyamine biosynthetic enzymes, ion channel and transport proteins, and enzymes involved in the glycan synthesis of the virus major capsid glycoproteins. The proteins encoded by many of these viruses are often the smallest or among the smallest proteins of their class. Consequently, some of the viral proteins are the subject of intensive biochemical and structural investigation. 
    more » « less
  3. Many chloroviruses replicate in Chlorella variabilis algal strains that are ex-endosymbionts isolated from the protozoan Paramecium bursaria, including the NC64A and Syngen 2-3 strains. We noticed that indigenous water samples produced a higher number of plaque-forming viruses on C. variabilis Syngen 2-3 lawns than on C. variabilis NC64A lawns. These observed differences led to the discovery of viruses that replicate exclusively in Syngen 2-3 cells, named Only Syngen (OSy) viruses. Here, we demonstrate that OSy viruses initiate infection in the restricted host NC64A by synthesizing some early virus gene products and that approximately 20% of the cells produce a small number of empty virus capsids. However, the infected cells did not produce infectious viruses because the cells were unable to replicate the viral genome. This is interesting because all previous attempts to isolate host cells resistant to chlorovirus infection were due to changes in the host receptor for the virus. 
    more » « less
  4. null (Ed.)
    Viruses rely on their host’s translation machinery for the synthesis of their own proteins. Problems belie viral translation when the host has a codon usage bias (CUB) that is different from an infecting virus due to differences in the GC content between the host and virus genomes. Here, we examine the hypothesis that chloroviruses adapted to host CUB by acquisition and selection of tRNAs that at least partially favor their own CUB. The genomes of 41 chloroviruses comprising three clades, each infecting a different algal host, have been sequenced, assembled and annotated. All 41 viruses not only encode tRNAs, but their tRNA genes are located in clusters. While differences were observed between clades and even within clades, seven tRNA genes were common to all three clades of chloroviruses, including the tRNAArg gene, which was found in all 41 chloroviruses. By comparing the codon usage of one chlorovirus algal host, in which the genome has been sequenced and annotated (67% GC content), to that of two of its viruses (40% GC content), we found that the viruses were able to at least partially overcome the host’s CUB by encoding tRNAs that recognize AU-rich codons. Evidence presented herein supports the hypothesis that a chlorovirus tRNA cluster was present in the most recent common ancestor (MRCA) prior to divergence into three clades. In addition, the MRCA encoded a putative isoleucine lysidine synthase (TilS) that remains in 39/41 chloroviruses examined herein, suggesting a strong evolutionary pressure to retain the gene. TilS alters the anticodon of tRNAMet that normally recognizes AUG to then recognize AUA, a codon for isoleucine. This is advantageous to the chloroviruses because the AUA codon is 12–13 times more common in the chloroviruses than their host, further helping the chloroviruses to overcome CUB. Among large DNA viruses infecting eukaryotes, the presence of tRNA genes and tRNA clusters appear to be most common in the Phycodnaviridae and, to a lesser extent, in the Mimiviridae. 
    more » « less
  5. null (Ed.)
    Paramecium bursaria chlorella virus-1 (PBCV-1) is a large double-stranded DNA (dsDNA) virus that infects the unicellular green alga Chlorella variabilis NC64A. Unlike many other viruses, PBCV-1 encodes most, if not all, of the enzymes involved in the synthesis of the glycans attached to its major capsid protein. Importantly, these glycans differ from those reported from the three domains of life in terms of structure and asparagine location in the sequon of the protein. Previous data collected from 20 PBCV-1 spontaneous mutants (or antigenic variants) suggested that the a064r gene encodes a glycosyltransferase (GT) with three domains, each with a different function. Here, we demonstrate that: domain 1 is a β- l -rhamnosyltransferase; domain 2 is an α- l -rhamnosyltransferase resembling only bacterial proteins of unknown function, and domain 3 is a methyltransferase that methylates the C-2 hydroxyl group of the terminal α- l -rhamnose (Rha) unit. We also establish that methylation of the C-3 hydroxyl group of the terminal α- l -Rha is achieved by another virus-encoded protein A061L, which requires an O-2 methylated substrate. This study, thus, identifies two of the glycosyltransferase activities involved in the synthesis of the N -glycan of the viral major capsid protein in PBCV-1 and establishes that a single protein A064R possesses the three activities needed to synthetize the 2-OMe-α- l -Rha-(1→2)-β- l -Rha fragment. Remarkably, this fragment can be attached to any xylose unit. 
    more » « less