skip to main content


Title: Diversity of tRNA Clusters in the Chloroviruses
Viruses rely on their host’s translation machinery for the synthesis of their own proteins. Problems belie viral translation when the host has a codon usage bias (CUB) that is different from an infecting virus due to differences in the GC content between the host and virus genomes. Here, we examine the hypothesis that chloroviruses adapted to host CUB by acquisition and selection of tRNAs that at least partially favor their own CUB. The genomes of 41 chloroviruses comprising three clades, each infecting a different algal host, have been sequenced, assembled and annotated. All 41 viruses not only encode tRNAs, but their tRNA genes are located in clusters. While differences were observed between clades and even within clades, seven tRNA genes were common to all three clades of chloroviruses, including the tRNAArg gene, which was found in all 41 chloroviruses. By comparing the codon usage of one chlorovirus algal host, in which the genome has been sequenced and annotated (67% GC content), to that of two of its viruses (40% GC content), we found that the viruses were able to at least partially overcome the host’s CUB by encoding tRNAs that recognize AU-rich codons. Evidence presented herein supports the hypothesis that a chlorovirus tRNA cluster was present in the most recent common ancestor (MRCA) prior to divergence into three clades. In addition, the MRCA encoded a putative isoleucine lysidine synthase (TilS) that remains in 39/41 chloroviruses examined herein, suggesting a strong evolutionary pressure to retain the gene. TilS alters the anticodon of tRNAMet that normally recognizes AUG to then recognize AUA, a codon for isoleucine. This is advantageous to the chloroviruses because the AUA codon is 12–13 times more common in the chloroviruses than their host, further helping the chloroviruses to overcome CUB. Among large DNA viruses infecting eukaryotes, the presence of tRNA genes and tRNA clusters appear to be most common in the Phycodnaviridae and, to a lesser extent, in the Mimiviridae.  more » « less
Award ID(s):
1736030
NSF-PAR ID:
10216978
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Viruses
Volume:
12
Issue:
10
ISSN:
1999-4915
Page Range / eLocation ID:
1173
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    Paramecium bursaria chlorella virus MA-1D is a chlorovirus that infects Chlorella variabilis strain NC64A, a symbiont of the protozoan Paramecium bursaria. MA-1D has a 339-kb genome encoding ca. 366 proteins and 11 tRNAs. Like other chloroviruses, its major capsid protein (MCP) is decorated with N-glycans, whose structures have been solved in this work by using nuclear magnetic spectroscopy and matrix-assisted laser desorption ionization-time of flight mass spectrometry along with MS/MS experiments. This analysis identified three N-linked oligosaccharides that differ in the nonstoichiometric presence of three monosaccharides, with the largest oligosaccharide composed of eight residues organized in a highly branched fashion. The N-glycans described here share several features with those of the other chloroviruses except that they lack a distal xylose unit that was believed to be part of a conserved core region for all the chloroviruses. Examination of the MA-1D genome detected a gene with strong homology to the putative xylosyltransferase in the reference chlorovirus PBCV-1 and in virus NY-2A, albeit mutated with a premature stop codon. This discovery means that we need to reconsider the essential features of the common core glycan region in the chloroviruses.

     
    more » « less
  2. Parrish, Colin R. (Ed.)
    ABSTRACT Chloroviruses (family Phycodnaviridae ) are large double-stranded DNA (dsDNA) viruses that infect unicellular green algae present in inland waters. These viruses have been isolated using three main chlorella-like green algal host cells, traditionally called NC64A, SAG, and Pbi, revealing extensive genetic diversity. In this study, we performed a functional genomic analysis on 36 chloroviruses that infected the three different hosts. Phylogenetic reconstruction based on the DNA polymerase B family gene clustered the chloroviruses into three distinct clades. The viral pan-genome consists of 1,345 clusters of orthologous groups of genes (COGs), with 126 COGs conserved in all viruses. Totals of 368, 268, and 265 COGs are found exclusively in viruses that infect NC64A, SAG, and Pbi algal hosts, respectively. Two-thirds of the COGs have no known function, constituting the “dark pan-genome” of chloroviruses, and further studies focusing on these genes may identify important novelties. The proportions of functionally characterized COGs composing the pan-genome and the core-genome are similar, but those related to transcription and RNA processing, protein metabolism, and virion morphogenesis are at least 4-fold more represented in the core genome. Bipartite network construction evidencing the COG sharing among host-specific viruses identified 270 COGs shared by at least one virus from each of the different host groups. Finally, our results reveal an open pan-genome for chloroviruses and a well-established core genome, indicating that the isolation of new chloroviruses can be a valuable source of genetic discovery. IMPORTANCE Chloroviruses are large dsDNA viruses that infect unicellular green algae distributed worldwide in freshwater environments. They comprise a genetically diverse group of viruses; however, a comprehensive investigation of the genomic evolution of these viruses is still missing. Here, we performed a functional pan-genome analysis comprising 36 chloroviruses associated with three different algal hosts in the family Chlorellaceae , referred to as zoochlorellae because of their endosymbiotic lifestyle. We identified a set of 126 highly conserved genes, most of which are related to essential functions in the viral replicative cycle. Several genes are unique to distinct isolates, resulting in an open pan-genome for chloroviruses. This profile is associated with generalist organisms, and new insights into the evolution and ecology of chloroviruses are presented. Ultimately, our results highlight the potential for genetic diversity in new isolates. 
    more » « less
  3. Many chloroviruses replicate in Chlorella variabilis algal strains that are ex-endosymbionts isolated from the protozoan Paramecium bursaria, including the NC64A and Syngen 2-3 strains. We noticed that indigenous water samples produced a higher number of plaque-forming viruses on C. variabilis Syngen 2-3 lawns than on C. variabilis NC64A lawns. These observed differences led to the discovery of viruses that replicate exclusively in Syngen 2-3 cells, named Only Syngen (OSy) viruses. Here, we demonstrate that OSy viruses initiate infection in the restricted host NC64A by synthesizing some early virus gene products and that approximately 20% of the cells produce a small number of empty virus capsids. However, the infected cells did not produce infectious viruses because the cells were unable to replicate the viral genome. This is interesting because all previous attempts to isolate host cells resistant to chlorovirus infection were due to changes in the host receptor for the virus. 
    more » « less
  4. Plastid genomes (plastomes) vary enormously in size and gene content among the many lineages of nonphotosynthetic plants, but key lineages remain unexplored. We therefore investigated plastome sequence and expression in the holoparasitic and morphologically bizarre Balanophoraceae. The twoBalanophoraplastomes examined are remarkable, exhibiting features rarely if ever seen before in plastomes or in any other genomes. At 15.5 kb in size and with only 19 genes, they are among the most reduced plastomes known. They have no tRNA genes for protein synthesis, a trait found in only three other plastid lineages, and thusBalanophoraplastids must import all tRNAs needed for translation.Balanophoraplastomes are exceptionally compact, with numerous overlapping genes, highly reduced spacers, loss of allcis-spliced introns, and shrunken protein genes. With A+T contents of 87.8% and 88.4%, theBalanophoragenomes are the most AT-rich genomes known save for a single mitochondrial genome that is merely bloated with AT-rich spacer DNA. Most plastid protein genes inBalanophoraconsist of ≥90% AT, with several between 95% and 98% AT, resulting in the most biased codon usage in any genome described to date. A potential consequence of its radical compositional evolution is the novel genetic code used byBalanophoraplastids, in which TAG has been reassigned from stop to tryptophan. Despite its many exceptional properties, theBalanophoraplastome must be functional because all examined genes are transcribed, its only intron is correctlytrans-spliced, and its protein genes, although highly divergent, are evolving under various degrees of selective constraint.

     
    more » « less
  5. Abstract

    Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity ofCUBis related to species differences in effective population sizes (Ne), with selection onCUBoperating less efficiently in species with smallNe. Here, we specifically ask whether variation inNepredicts differences inCUBin mammals and report two main findings. First, across 41 mammalian genomes,CUBwas not correlated with two indirect proxies ofNe(body mass and generation time), even though there was statistically significant evidence of selection shapingCUBacross all species. Interestingly, autosomal genes showed higher codon usage bias compared to X‐linked genes, and high‐recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation inNepredicts variation inCUB. Second, across six mammalian species with genetic estimates ofNe(human, chimpanzee, rabbit, and three mouse species:Mus musculus, M. domesticus,andM. castaneus),NeandCUBwere weakly and inconsistently correlated. At least in mammals, interspecific divergence inNedoes not strongly predict variation inCUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link betweenNeandCUB.

     
    more » « less