Abstract BackgroundViruses, the majority of which are uncultivated, are among the most abundant biological entities on Earth. From altering microbial physiology to driving community dynamics, viruses are fundamental members of microbiomes. While the number of studies leveraging viral metagenomics (viromics) for studying uncultivated viruses is growing, standards for viromics research are lacking. Viromics can utilize computational discovery of viruses from total metagenomes of all community members (hereafter metagenomes) or use physical separation of virus-specific fractions (hereafter viromes). However, differences in the recovery and interpretation of viruses from metagenomes and viromes obtained from the same samples remain understudied. ResultsHere, we compare viral communities from paired viromes and metagenomes obtained from 60 diverse samples across human gut, soil, freshwater, and marine ecosystems. Overall, viral communities obtained from viromes had greater species richness and total viral genome abundances than those obtained from metagenomes, although there were some exceptions. Despite this, metagenomes still contained many viral genomes not detected in viromes. We also found notable differences in the predicted lytic state of viruses detected in viromes vs metagenomes at the time of sequencing. Other forms of variation observed include genome presence/absence, genome quality, and encoded protein content between viromes and metagenomes, but the magnitude of these differences varied by environment. ConclusionsOverall, our results show that the choice of method can lead to differing interpretations of viral community ecology. We suggest that the choice of whether to target a metagenome or virome to study viral communities should be dependent on the environmental context and ecological questions being asked. However, our overall recommendation to researchers investigating viral ecology and evolution is to pair both approaches to maximize their respective benefits.
more »
« less
Novel Viral DNA Polymerases From Metagenomes Suggest Genomic Sources of Strand-Displacing Biochemical Phenotypes
Viruses are the most abundant and diverse biological entities on the planet and constitute a significant proportion of Earth’s genetic diversity. Most of this diversity is not represented by isolated viral-host systems and has only been observed through sequencing of viral metagenomes (viromes) from environmental samples. Viromes provide snapshots of viral genetic potential, and a wealth of information on viral community ecology. These data also provide opportunities for exploring the biochemistry of novel viral enzymes. The in vitro biochemical characteristics of novel viral DNA polymerases were explored, testing hypothesized differences in polymerase biochemistry according to protein sequence phylogeny. Forty-eight viral DNA Polymerase I (PolA) proteins from estuarine viromes, hot spring metagenomes, and reference viruses, encompassing a broad representation of currently known diversity, were synthesized, expressed, and purified. Novel functionality was shown in multiple PolAs. Intriguingly, some of the estuarine viral polymerases demonstrated moderate to strong innate DNA strand displacement activity at high enzyme concentration. Strand-displacing polymerases have important technological applications where isothermal reactions are desirable. Bioinformatic investigation of genes neighboring these strand displacing polymerases found associations with SNF2 helicase-associated proteins. The specific function of SNF2 family enzymes is unknown for prokaryotes and viruses. In eukaryotes, SNF2 enzymes have chromatin remodeling functions but do not separate nucleic acid strands. This suggests the strand separation function may be fulfilled by the DNA polymerase for viruses carrying SNF2 helicase-associated proteins. Biochemical data elucidated from this study expands understanding of the biology and ecological behavior of unknown viruses. Moreover, given the numerous biotechnological applications of viral DNA polymerases, novel viral polymerases discovered within viromes may be a rich source of biological material for further in vitro DNA amplification advancements.
more »
« less
- PAR ID:
- 10369175
- Date Published:
- Journal Name:
- Frontiers in Microbiology
- Volume:
- 13
- ISSN:
- 1664-302X
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Abstract Ring-shaped sliding clamp proteins are essential components of the replication machinery, the replisome, across all domains of life. In bacteria, DNA polymerases bind the sliding clamp, DnaN, through conserved short peptide sequences called clamp-binding motifs. Clamp binding increases the processivity and rate of DNA synthesis and is generally required for polymerase activity. The current understanding of clamp-polymerase interactions was elucidated in the model bacteriumEscherichia coli, which has a single replicative polymerase, Pol III. However, many bacteria have two essential replicative polymerases, such as PolC and DnaE inBacillus subtilis. PolC performs the bulk of DNA synthesis whereas the error-prone DnaE only synthesizes short stretches of DNA on the lagging strand. How the clamp interacts with the two polymerases and coordinates their activity is unknown. We investigated this question by combining in vivo single-molecule fluorescence microscopy with biochemical and microbiological assays. We found that PolC-DnaN binding is essential for replication, although weakening the interaction is tolerated with only minimal effects. In contrast, the DnaE-DnaN interaction is dispensable for replication. Altering the clamp-binding strength of DnaE produces only subtle effects on DnaE cellular localization and dynamics, but it has a substantial impact on mutagenesis. Our results support a model in which DnaE acts distributively during replication but can be stabilized on the DNA template by clamp binding. This study provides new insights into the coordination of multiple replicative polymerases in bacteria and the role of the clamp in polymerase processivity, fidelity, and exchange.more » « less
-
Metagenomics has enabled sequencing of viral communities from a myriad of different environments. Viral metagenomic studies routinely uncover sequences with no recognizable homology to known coding regions or genomes. Nevertheless, complete viral genomes have been constructed directly from complex community metagenomes, often through tedious manual curation. To address this, we developed the software tool virMine to identify viral genomes from raw reads representative of viral or mixed (viral and bacterial) communities. virMine automates sequence read quality control, assembly, and annotation. Researchers can easily refine their search for a specific study system and/or feature(s) of interest. In contrast to other viral genome detection tools that often rely on the recognition of viral signature sequences, virMine is not restricted by the insufficient representation of viral diversity in public data repositories. Rather, viral genomes are identified through an iterative approach, first omitting non-viral sequences. Thus, both relatives of previously characterized viruses and novel species can be detected, including both eukaryotic viruses and bacteriophages. Here we present virMine and its analysis of synthetic communities as well as metagenomic data sets from three distinctly different environments: the gut microbiota, the urinary microbiota, and freshwater viromes. Several new viral genomes were identified and annotated, thus contributing to our understanding of viral genetic diversity in these three environments.more » « less
-
Chloroviruses are large dsDNA, plaque-forming viruses that infect certain chlorella-like green algae; the algae are normally mutualistic endosymbionts of protists and metazoans and are often referred to as zoochlorellae. The viruses are ubiquitous in inland aqueous environments throughout the world and occasionally single types reach titers of thousands of plaque-forming units per ml of native water. The viruses are icosahedral in shape with a spike structure located at one of the vertices. They contain an internal membrane that is required for infectivity. The viral genomes are 290 to 370 kb in size, which encode up to 16 tRNAs and 330 to ~415 proteins, including many not previously seen in viruses. Examples include genes encoding DNA restriction and modification enzymes, hyaluronan and chitin biosynthetic enzymes, polyamine biosynthetic enzymes, ion channel and transport proteins, and enzymes involved in the glycan synthesis of the virus major capsid glycoproteins. The proteins encoded by many of these viruses are often the smallest or among the smallest proteins of their class. Consequently, some of the viral proteins are the subject of intensive biochemical and structural investigation.more » « less
-
Abstract Background Microbes and their viruses are hidden engines driving Earth’s ecosystems from the oceans and soils to humans and bioreactors. Though gene marker approaches can now be complemented by genome-resolved studies of inter-(macrodiversity) and intra-(microdiversity) population variation, analytical tools to do so remain scattered or under-developed. Results Here, we introduce MetaPop, an open-source bioinformatic pipeline that provides a single interface to analyze and visualize microbial and viral community metagenomes at both the macro - and microdiversity levels. Macrodiversity estimates include population abundances and α- and β-diversity. Microdiversity calculations include identification of single nucleotide polymorphisms, novel codon-constrained linkage of SNPs, nucleotide diversity ( π and θ ), and selective pressures (pN/pS and Tajima’s D ) within and fixation indices ( F ST ) between populations. MetaPop will also identify genes with distinct codon usage. Following rigorous validation, we applied MetaPop to the gut viromes of autistic children that underwent fecal microbiota transfers and their neurotypical peers. The macrodiversity results confirmed our prior findings for viral populations (microbial shotgun metagenomes were not available) that diversity did not significantly differ between autistic and neurotypical children. However, by also quantifying microdiversity, MetaPop revealed lower average viral nucleotide diversity ( π ) in autistic children. Analysis of the percentage of genomes detected under positive selection was also lower among autistic children, suggesting that higher viral π in neurotypical children may be beneficial because it allows populations to better “bet hedge” in changing environments. Further, comparisons of microdiversity pre- and post-FMT in autistic children revealed that the delivery FMT method (oral versus rectal) may influence viral activity and engraftment of microdiverse viral populations, with children who received their FMT rectally having higher microdiversity post-FMT. Overall, these results show that analyses at the macro level alone can miss important biological differences. Conclusions These findings suggest that standardized population and genetic variation analyses will be invaluable for maximizing biological inference, and MetaPop provides a convenient tool package to explore the dual impact of macro - and microdiversity across microbial communities.more » « less
An official website of the United States government

