skip to main content

This content will become publicly available on April 21, 2023

Title: Novel Viral DNA Polymerases From Metagenomes Suggest Genomic Sources of Strand-Displacing Biochemical Phenotypes
Viruses are the most abundant and diverse biological entities on the planet and constitute a significant proportion of Earth’s genetic diversity. Most of this diversity is not represented by isolated viral-host systems and has only been observed through sequencing of viral metagenomes (viromes) from environmental samples. Viromes provide snapshots of viral genetic potential, and a wealth of information on viral community ecology. These data also provide opportunities for exploring the biochemistry of novel viral enzymes. The in vitro biochemical characteristics of novel viral DNA polymerases were explored, testing hypothesized differences in polymerase biochemistry according to protein sequence phylogeny. Forty-eight viral DNA Polymerase I (PolA) proteins from estuarine viromes, hot spring metagenomes, and reference viruses, encompassing a broad representation of currently known diversity, were synthesized, expressed, and purified. Novel functionality was shown in multiple PolAs. Intriguingly, some of the estuarine viral polymerases demonstrated moderate to strong innate DNA strand displacement activity at high enzyme concentration. Strand-displacing polymerases have important technological applications where isothermal reactions are desirable. Bioinformatic investigation of genes neighboring these strand displacing polymerases found associations with SNF2 helicase-associated proteins. The specific function of SNF2 family enzymes is unknown for prokaryotes and viruses. In eukaryotes, SNF2 enzymes have chromatin remodeling more » functions but do not separate nucleic acid strands. This suggests the strand separation function may be fulfilled by the DNA polymerase for viruses carrying SNF2 helicase-associated proteins. Biochemical data elucidated from this study expands understanding of the biology and ecological behavior of unknown viruses. Moreover, given the numerous biotechnological applications of viral DNA polymerases, novel viral polymerases discovered within viromes may be a rich source of biological material for further in vitro DNA amplification advancements. « less
; ; ; ; ; ; ; ; ;
Award ID(s):
2025567 1736030
Publication Date:
Journal Name:
Frontiers in Microbiology
Sponsoring Org:
National Science Foundation
More Like this
  1. Wang, Aiming (Ed.)
    Positive-strand (+)RNA viruses take advantage of the host cells by subverting a long list of host protein factors and transport vesicles and cellular organelles to build membranous viral replication organelles (VROs) that support robust RNA replication. How RNA viruses accomplish major recruitment tasks of a large number of cellular proteins are intensively studied. In case of tomato bushy stunt virus (TBSV), a single viral replication protein, named p33, carries out most of the recruitment duties. Yet, it is currently unknown how the viral p33 replication protein, which is membrane associated, is capable of the rapid and efficient recruitment of numerous cytosolic host proteins to facilitate the formation of large VROs. In this paper, we show that, TBSV p33 molecules do not recruit each cytosolic host factor one-by-one into VROs, but p33 targets a cytosolic protein interaction hub, namely Rpn11, which interacts with numerous other cytosolic proteins. The highly conserved Rpn11, called POH1 in humans, is the metalloprotease subunit of the proteasome, which couples deubiquitination and degradation of proteasome substrates. However, TBSV takes advantage of a noncanonical function of Rpn11 by exploiting Rpn11’s interaction with highly abundant cytosolic proteins and the actin network. We provide supporting evidence that the co-opted Rpn11more »in coordination with the subverted actin network is used for delivering cytosolic proteins, such as glycolytic and fermentation enzymes, which are readily subverted into VROs to produce ATP locally in support of VRO formation, viral replicase complex assembly and viral RNA replication. Using several approaches, including knockdown of Rpn11 level, sequestering Rpn11 from the cytosol into the nucleus in plants or temperature-sensitive mutation in Rpn11 in yeast, we show the inhibition of recruitment of glycolytic and fermentation enzymes into VROs. The Rpn11-assisted recruitment of the cytosolic enzymes by p33, however, also requires the combined and coordinated role of the subverted actin network. Accordingly, stabilization of the actin filaments by expression of the Legionella VipA effector in yeast and plant, or via a mutation of ACT1 in yeast resulted in more efficient and rapid recruitment of Rpn11 and the selected glycolytic and fermentation enzymes into VROs. On the contrary, destruction of the actin filaments via expression of the Legionella RavK effector led to poor recruitment of Rpn11 and glycolytic and fermentation enzymes. Finally, we confirmed the key roles of Rpn11 and the actin filaments in situ ATP production within TBSV VROs via using a FRET-based ATP-biosensor. The novel emerging theme is that TBSV targets Rpn11 cytosolic protein interaction hub driven by the p33 replication protein and aided by the subverted actin filaments to deliver several co-opted cytosolic pro-viral factors for robust replication within VROs.« less
  2. Abstract Background Microbes and their viruses are hidden engines driving Earth’s ecosystems from the oceans and soils to humans and bioreactors. Though gene marker approaches can now be complemented by genome-resolved studies of inter-(macrodiversity) and intra-(microdiversity) population variation, analytical tools to do so remain scattered or under-developed. Results Here, we introduce MetaPop, an open-source bioinformatic pipeline that provides a single interface to analyze and visualize microbial and viral community metagenomes at both the macro - and microdiversity levels. Macrodiversity estimates include population abundances and α- and β-diversity. Microdiversity calculations include identification of single nucleotide polymorphisms, novel codon-constrained linkage of SNPs, nucleotide diversity ( π and θ ), and selective pressures (pN/pS and Tajima’s D ) within and fixation indices ( F ST ) between populations. MetaPop will also identify genes with distinct codon usage. Following rigorous validation, we applied MetaPop to the gut viromes of autistic children that underwent fecal microbiota transfers and their neurotypical peers. The macrodiversity results confirmed our prior findings for viral populations (microbial shotgun metagenomes were not available) that diversity did not significantly differ between autistic and neurotypical children. However, by also quantifying microdiversity, MetaPop revealed lower average viral nucleotide diversity ( π ) in autisticmore »children. Analysis of the percentage of genomes detected under positive selection was also lower among autistic children, suggesting that higher viral π in neurotypical children may be beneficial because it allows populations to better “bet hedge” in changing environments. Further, comparisons of microdiversity pre- and post-FMT in autistic children revealed that the delivery FMT method (oral versus rectal) may influence viral activity and engraftment of microdiverse viral populations, with children who received their FMT rectally having higher microdiversity post-FMT. Overall, these results show that analyses at the macro level alone can miss important biological differences. Conclusions These findings suggest that standardized population and genetic variation analyses will be invaluable for maximizing biological inference, and MetaPop provides a convenient tool package to explore the dual impact of macro - and microdiversity across microbial communities.« less
  3. López, Susana (Ed.)
    ABSTRACT The rotavirus polymerase VP1 mediates all stages of viral RNA synthesis within the confines of subviral particles and while associated with the core shell protein VP2. Transcription (positive-strand RNA [+RNA] synthesis) by VP1 occurs within double-layered particles (DLPs), while genome replication (double-stranded RNA [dsRNA] synthesis) by VP1 occurs within assembly intermediates. VP2 is critical for VP1 enzymatic activity; yet, the mechanism by which the core shell protein triggers polymerase function remains poorly understood. Structural analyses of transcriptionally competent DLPs show that VP1 is located beneath the VP2 core shell and sits slightly off-center from each of the icosahedral 5-fold axes. In this position, the polymerase is contacted by the core shell at 5 distinct surface-exposed sites, comprising VP1 residues 264 to 267, 547 to 550, 614 to 620, 968 to 980, and 1022 to 1025. Here, we sought to test the functional significance of these VP2 contact sites on VP1 with regard to polymerase activity. We engineered 19 recombinant VP1 (rVP1) proteins that contained single- or multipoint alanine mutations within each individual contact site and assayed them for the capacity to synthesize dsRNA in vitro in the presence of rVP2. Three rVP1 mutants (E265A/L267A, R614A, and D971A/S978A/I980A) exhibited diminishedmore »in vitro dsRNA synthesis. Despite their loss-of-function phenotypes, the mutants did not show major structural changes in silico, and they maintained their overall capacity to bind rVP2 in vitro via their nonmutated contact sites. These results move us toward a mechanistic understanding of rotavirus replication and identify precise VP2-binding sites on the polymerase surface that are critical for its enzymatic activation. IMPORTANCE Rotaviruses are important pathogens that cause severe gastroenteritis in the young of many animals. The viral polymerase VP1 mediates all stages of viral RNA synthesis, and it requires the core shell protein VP2 for its enzymatic activity. Yet, there are several gaps in knowledge about how VP2 engages and activates VP1. Here, we probed the functional significance of 5 distinct VP2 contact sites on VP1 that were revealed through previous structural studies. Specifically, we engineered alanine amino acid substitutions within each of the 5 VP1 regions and assayed the mutant polymerases for the capacity to synthesize RNA in the presence of VP2 in a test tube. Our results identified residues within 3 of the VP2 contact sites that are critical for robust polymerase activity. These results are important because they enhance the understanding of a key step of the rotavirus replication cycle.« less
  4. Background

    Viruses strongly influence microbial population dynamics and ecosystem functions. However, our ability to quantitatively evaluate those viral impacts is limited to the few cultivated viruses and double-stranded DNA (dsDNA) viral genomes captured in quantitative viral metagenomes (viromes). This leaves the ecology of non-dsDNA viruses nearly unknown, including single-stranded DNA (ssDNA) viruses that have been frequently observed in viromes, but not quantified due to amplification biases in sequencing library preparations (Multiple Displacement Amplification, Linker Amplification or Tagmentation).


    Here we designed mock viral communities including both ssDNA and dsDNA viruses to evaluate the capability of a sequencing library preparation approach including an Adaptase step prior to Linker Amplification for quantitative amplification of both dsDNA and ssDNA templates. We then surveyed aquatic samples to provide first estimates of the abundance of ssDNA viruses.


    Mock community experiments confirmed the biased nature of existing library preparation methods for ssDNA templates (either largely enriched or selected against) and showed that the protocol using Adaptase plus Linker Amplification yielded viromes that were ±1.8-fold quantitative for ssDNA and dsDNA viruses. Application of this protocol to community virus DNA from three freshwater and three marine samples revealed that ssDNA viruses as a whole represent only a minor fraction (<5%)more »of DNA virus communities, though individual ssDNA genomes, both eukaryote-infecting Circular Rep-Encoding Single-Stranded DNA (CRESS-DNA) viruses and bacteriophages from theMicroviridaefamily, can be among the most abundant viral genomes in a sample.


    Together these findings provide empirical data for a new virome library preparation protocol, and a first estimate of ssDNA virus abundance in aquatic systems.

    « less
  5. ABSTRACT Viral infection exerts selection pressure on marine microbes, as virus-induced cell lysis causes 20 to 50% of cell mortality, resulting in fluxes of biomass into oceanic dissolved organic matter. Archaeal and bacterial populations can defend against viral infection using the clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) system, which relies on specific matching between a spacer sequence and a viral gene. If a CRISPR spacer match to any gene within a viral genome is equally effective in preventing lysis, no viral genes should be preferentially matched by CRISPR spacers. However, if there are differences in effectiveness, certain viral genes may demonstrate a greater frequency of CRISPR spacer matches. Indeed, homology search analyses of bacterioplankton CRISPR spacer sequences against virioplankton sequences revealed preferential matching of replication proteins, nucleic acid binding proteins, and viral structural proteins. Positive selection pressure for effective viral defense is one parsimonious explanation for these observations. CRISPR spacers from virioplankton metagenomes preferentially matched methyltransferase and phage integrase genes within virioplankton sequences. These virioplankton CRISPR spacers may assist infected host cells in defending against competing phage. Analyses also revealed that half of the spacer-matched viral genes were unknown, some genes matched several spacers, and some spacers matchedmore »multiple genes, a many-to-many relationship. Thus, CRISPR spacer matching may be an evolutionary algorithm, agnostically identifying those genes under stringent selection pressure for sustaining viral infection and lysis. Investigating this subset of viral genes could reveal those genetic mechanisms essential to virus-host interactions and provide new technologies for optimizing CRISPR defense in beneficial microbes. IMPORTANCE The CRISPR-Cas system is one means by which bacterial and archaeal populations defend against viral infection which causes 20 to 50% of cell mortality in the ocean. We tested the hypothesis that certain viral genes are preferentially targeted for the initial attack of the CRISPR-Cas system on a viral genome. Using CASC, a pipeline for CRISPR spacer discovery, and metagenome data from oceanic microbes and viruses, we found a clear subset of viral genes with high match frequencies to CRISPR spacers. Moreover, we observed a many-to-many relationship of spacers and viral genes. These high-match viral genes were involved in nucleotide metabolism, DNA methylation, and viral structure. It is possible that CRISPR spacer matching is an evolutionary algorithm pointing to those viral genes most important to sustaining infection and lysis. Studying these genes may advance the understanding of virus-host interactions in nature and provide new technologies for leveraging CRISPR-Cas systems in beneficial microbes.« less