skip to main content


Title: Targeted assemblies of cas1 suggest CRISPR-Cas’s response to soil warming
Abstract

There is an increasing interest in the clustered regularly interspaced short palindromic repeats CRISPR-associated protein (CRISPR-Cas) system to reveal potential virus–host dynamics. The universal and most conserved Cas protein, cas1 is an ideal marker to elucidate CRISPR-Cas ecology. We constructed eight Hidden Markov Models (HMMs) and assembled cas1 directly from metagenomes by a targeted-gene assembler, Xander, to improve detection capacity and resolve the diverse CRISPR-Cas systems. The eight HMMs were first validated by recovering all 17 cas1 subtypes from the simulated metagenome generated from 91 prokaryotic genomes across 11 phyla. We challenged the targeted method with 48 metagenomes from a tallgrass prairie in Central Oklahoma recovering 3394 cas1. Among those, 88 were near full length, 5 times more than in de-novo assemblies from the Oklahoma metagenomes. To validate the host assignment by cas1, the targeted-assembled cas1 was mapped to the de-novo assembled contigs. All the phylum assignments of those mapped contigs were assigned independent of CRISPR-Cas genes on the same contigs and consistent with the host taxonomies predicted by the mapped cas1. We then investigated whether 8 years of soil warming altered cas1 prevalence within the communities. A shift in microbial abundances was observed during the year with the biggest temperature differential (mean 4.16 °C above ambient). cas1 prevalence increased and even in the phyla with decreased microbial abundances over the next 3 years, suggesting increasing virus–host interactions in response to soil warming. This targeted method provides an alternative means to effectively mine cas1 from metagenomes and uncover the host communities.

 
more » « less
Award ID(s):
1759892
NSF-PAR ID:
10154405
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Publisher / Repository:
Oxford University Press
Date Published:
Journal Name:
The ISME Journal
Volume:
14
Issue:
7
ISSN:
1751-7362
Format(s):
Medium: X Size: p. 1651-1662
Size(s):
["p. 1651-1662"]
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    The Pastaza‐Marañón Foreland Basin (PMFB) holds the most extensive tropical peatland area in South America. PMFB peatlands store ~7.07 Gt of organic carbon interacting with multiple microbial heterotrophic, methanogenic, and other aerobic/anaerobic respirations. Little is understood about the contribution of distinct microbial community members inhabiting tropical peatlands. Here, we studied the metagenomes of three geochemically distinct peatlands spanning minerotrophic, mixed, and ombrotrophic conditions. Using gene‐ and genome‐centric approaches, we evaluate the functional potential of the underlying microbial communities. Abundance analyses show significant differences in C, N, P, and S acquisition genes. Furthermore, community interactions mediated by toxin–antitoxin and CRISPR‐Cas systems were enriched in oligotrophic soils, suggesting that non‐metabolic interactions may exert additional controls in low‐nutrient environments. Additionally, we reconstructed 519 metagenome‐assembled genomes spanning 28 phyla. Our analyses detail key differences across the geochemical gradient in the predicted microbial populations involved in degradation of organic matter, and the cycling of N and S. Notably, we observed differences in the nitric oxide (NO) reduction strategies between sites with high and low N2O fluxes and found phyla putatively capable of both NO and sulfate reduction. Our findings detail how gene abundances and microbial populations are influenced by geochemical differences in tropical peatlands.

     
    more » « less
  2. Background

    Metagenomics has transformed our understanding of microbial diversity across ecosystems, with recent advances enablingde novoassembly of genomes from metagenomes. These metagenome-assembled genomes are critical to provide ecological, evolutionary, and metabolic context for all the microbes and viruses yet to be cultivated. Metagenomes can now be generated from nanogram to subnanogram amounts of DNA. However, these libraries require several rounds of PCR amplification before sequencing, and recent data suggest these typically yield smaller and more fragmented assemblies than regular metagenomes.

    Methods

    Here we evaluatede novoassembly methods of 169 PCR-amplified metagenomes, including 25 for which an unamplified counterpart is available, to optimize specific assembly approaches for PCR-amplified libraries. We first evaluated coverage bias by mapping reads from PCR-amplified metagenomes onto reference contigs obtained from unamplified metagenomes of the same samples. Then, we compared different assembly pipelines in terms of assembly size (number of bp in contigs ≥ 10 kb) and error rates to evaluate which are the best suited for PCR-amplified metagenomes.

    Results

    Read mapping analyses revealed that the depth of coverage within individual genomes is significantly more uneven in PCR-amplified datasets versus unamplified metagenomes, with regions of high depth of coverage enriched in short inserts. This enrichment scales with the number of PCR cycles performed, and is presumably due to preferential amplification of short inserts. Standard assembly pipelines are confounded by this type of coverage unevenness, so we evaluated other assembly options to mitigate these issues. We found that a pipeline combining read deduplication and an assembly algorithm originally designed to recover genomes from libraries generated after whole genome amplification (single-cell SPAdes) frequently improved assembly of contigs ≥10 kb by 10 to 100-fold for low input metagenomes.

    Conclusions

    PCR-amplified metagenomes have enabled scientists to explore communities traditionally challenging to describe, including some with extremely low biomass or from which DNA is particularly difficult to extract. Here we show that a modified assembly pipeline can lead to an improvedde novogenome assembly from PCR-amplified datasets, and enables a better genome recovery from low input metagenomes.

     
    more » « less
  3. ABSTRACT Theory, simulation, and experimental evolution demonstrate that diversified CRISPR-Cas immunity to lytic viruses can lead to stochastic virus extinction due to a limited number of susceptible hosts available to each potential new protospacer escape mutation. Under such conditions, theory predicts that to evade extinction, viruses evolve toward decreased virulence and promote vertical transmission and persistence in infected hosts. To better understand the evolution of host-virus interactions in microbial populations with active CRISPR-Cas immunity, we studied the interaction between CRISPR-immune Sulfolobus islandicus cells and immune-deficient strains that are infected by the chronic virus SSV9. We demonstrate that Sulfolobus islandicus cells infected with SSV9, and with other related SSVs, kill uninfected, immune strains through an antagonistic mechanism that is a protein and is independent of infectious virus. Cells that are infected with SSV9 are protected from killing and persist in the population. We hypothesize that this infection acts as a form of mutualism between the host and the virus by removing competitors in the population and ensuring continued vertical transmission of the virus within populations with diversified CRISPR-Cas immunity. IMPORTANCE Multiple studies, especially those focusing on the role of lytic viruses in key model systems, have shown the importance of viruses in shaping microbial populations. However, it has become increasingly clear that viruses with a long host-virus interaction, such as those with a chronic lifestyle, can be important drivers of evolution and have large impacts on host ecology. In this work, we describe one such interaction with the acidic crenarchaeon Sulfolobus islandicus and its chronic virus Sulfolobus spindle-shaped virus 9. Our work expands the view in which this symbiosis between host and virus evolved, describing a killing phenotype which we hypothesize has evolved in part due to the high prevalence and diversity of CRISPR-Cas immunity seen in natural populations. We explore the implications of this phenotype in population dynamics and host ecology, as well as the implications of mutualism between this virus-host pair. 
    more » « less
  4. Abstract Background

    Winter carbon loss in northern ecosystems is estimated to be greater than the average growing season carbon uptake and is primarily driven by microbial decomposers. Viruses modulate microbial carbon cycling via induced mortality and metabolic controls, but it is unknown whether viruses are active under winter conditions (anoxic and sub-freezing temperatures).

    Results

    We used stable isotope probing (SIP) targeted metagenomics to reveal the genomic potential of active soil microbial populations under simulated winter conditions, with an emphasis on viruses and virus-host dynamics. Arctic peat soils from the Bonanza Creek Long-Term Ecological Research site in Alaska were incubated under sub-freezing anoxic conditions with H218O or natural abundance water for 184 and 370 days. We sequenced 23 SIP-metagenomes and measured carbon dioxide (CO2) efflux throughout the experiment. We identified 46 bacterial populations (spanning 9 phyla) and 243 viral populations that actively took up18O in soil and respired CO2throughout the incubation. Active bacterial populations represented only a small portion of the detected microbial community and were capable of fermentation and organic matter degradation. In contrast, active viral populations represented a large portion of the detected viral community and one third were linked to active bacterial populations. We identified 86 auxiliary metabolic genes and other environmentally relevant genes. The majority of these genes were carried by active viral populations and had diverse functions such as carbon utilization and scavenging that could provide their host with a fitness advantage for utilizing much-needed carbon sources or acquiring essential nutrients.

    Conclusions

    Overall, there was a stark difference in the identity and function of the active bacterial and viral community compared to the unlabeled community that would have been overlooked with a non-targeted standard metagenomic analysis. Our results illustrate that substantial active virus-host interactions occur in sub-freezing anoxic conditions and highlight viruses as a major community-structuring agent that likely modulates carbon loss in peat soils during winter, which may be pivotal for understanding the future fate of arctic soils' vast carbon stocks.

     
    more » « less
  5. ABSTRACT Viral infection exerts selection pressure on marine microbes, as virus-induced cell lysis causes 20 to 50% of cell mortality, resulting in fluxes of biomass into oceanic dissolved organic matter. Archaeal and bacterial populations can defend against viral infection using the clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) system, which relies on specific matching between a spacer sequence and a viral gene. If a CRISPR spacer match to any gene within a viral genome is equally effective in preventing lysis, no viral genes should be preferentially matched by CRISPR spacers. However, if there are differences in effectiveness, certain viral genes may demonstrate a greater frequency of CRISPR spacer matches. Indeed, homology search analyses of bacterioplankton CRISPR spacer sequences against virioplankton sequences revealed preferential matching of replication proteins, nucleic acid binding proteins, and viral structural proteins. Positive selection pressure for effective viral defense is one parsimonious explanation for these observations. CRISPR spacers from virioplankton metagenomes preferentially matched methyltransferase and phage integrase genes within virioplankton sequences. These virioplankton CRISPR spacers may assist infected host cells in defending against competing phage. Analyses also revealed that half of the spacer-matched viral genes were unknown, some genes matched several spacers, and some spacers matched multiple genes, a many-to-many relationship. Thus, CRISPR spacer matching may be an evolutionary algorithm, agnostically identifying those genes under stringent selection pressure for sustaining viral infection and lysis. Investigating this subset of viral genes could reveal those genetic mechanisms essential to virus-host interactions and provide new technologies for optimizing CRISPR defense in beneficial microbes. IMPORTANCE The CRISPR-Cas system is one means by which bacterial and archaeal populations defend against viral infection which causes 20 to 50% of cell mortality in the ocean. We tested the hypothesis that certain viral genes are preferentially targeted for the initial attack of the CRISPR-Cas system on a viral genome. Using CASC, a pipeline for CRISPR spacer discovery, and metagenome data from oceanic microbes and viruses, we found a clear subset of viral genes with high match frequencies to CRISPR spacers. Moreover, we observed a many-to-many relationship of spacers and viral genes. These high-match viral genes were involved in nucleotide metabolism, DNA methylation, and viral structure. It is possible that CRISPR spacer matching is an evolutionary algorithm pointing to those viral genes most important to sustaining infection and lysis. Studying these genes may advance the understanding of virus-host interactions in nature and provide new technologies for leveraging CRISPR-Cas systems in beneficial microbes. 
    more » « less