skip to main content


Title: inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains
Coexisting microbial cells of the same species often exhibit genetic variation that can affect phenotypes ranging from nutrient preference to pathogenicity. Here we present inStrain, a program that uses metagenomic paired reads to profile intra-population genetic diversity (microdiversity) across whole genomes and compares microbial populations in a microdiversity-aware man- ner, greatly increasing the accuracy of genomic comparisons when benchmarked against existing methods. We use inStrain to profile >1,000 fecal metagenomes from newborn premature infants and find that siblings share significantly more strains than unrelated infants, although identical twins share no more strains than fraternal siblings. Infants born by cesarean section har- bor Klebsiella with significantly higher nucleotide diversity than infants delivered vaginally, potentially reflecting acquisition from hospital rather than maternal microbiomes. Genomic loci that show diversity in individual infants include variants found between other infants, possibly reflecting inoculation from diverse hospital-associated sources. inStrain can be applied to any metagenomic dataset for microdiversity analysis and rigorous strain comparison.  more » « less
Award ID(s):
1656009
NSF-PAR ID:
10229801
Author(s) / Creator(s):
; ; ; ; ;
Date Published:
Journal Name:
Nature Biotechnology
ISSN:
1087-0156
Page Range / eLocation ID:
1-10
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Senko, John M. (Ed.)

    In commercial large-scale aquaria, controlling levels of nitrogenous compounds is essential for macrofauna health. Naturally occurring bacteria are capable of transforming toxic nitrogen species into their more benign counterparts and play important roles in maintaining aquaria health. Nitrification, the microbially-mediated transformation of ammonium and nitrite to nitrate, is a common and encouraged process for management of both commercial and home aquaria. A potentially competing microbial process that transforms ammonium and nitrite to dinitrogen gas (anaerobic ammonium oxidation [anammox]) is mediated by some bacteria within the phylum Planctomycetes. Anammox has been harnessed for nitrogen removal during wastewater treatment, as the nitrogenous end product is released into the atmosphere rather than in aqueous discharge. Whether anammox bacteria could be similarly utilized in commercial aquaria is an open question. As a first step in assessing the viability of this practice, we (i) characterized microbial communities from water and sand filtration systems for four habitats at the Tennessee Aquarium and (ii) examined the abundance and anammox potential of Planctomycetes using culture-independent approaches. 16S rRNA gene amplicon sequencing revealed distinct, yet stable, microbial communities and the presence of Planctomycetes (~1–15% of library reads) in all sampled habitats. Preliminary metagenomic analyses identified the genetic potential for multiple complete nitrogen metabolism pathways. However, no known genes diagnostic for the anammox reaction were found in this survey. To better understand the diversity of this group of bacteria in these systems, a targeted Planctomycete-specific 16S rRNA gene-based PCR approach was used. This effort recovered amplicons that share <95% 16S rRNA gene sequence identity to previously characterized Planctomycetes, suggesting novel strains within this phylum reside within aquaria.

     
    more » « less
  2. ABSTRACT Diversification can generate genomic and phenotypic strain-level diversity within microbial species. This microdiversity is widely recognized in populations, but the community-level consequences of microbial strain-level diversity are poorly characterized. Using the cheese rind model system, we tested whether strain diversity across microbiomes from distinct geographic regions impacts assembly dynamics and functional outputs. We first isolated the same three bacterial species ( Staphylococcus equorum , Brevibacterium auranticum , and Brachybacterium alimentarium ) from nine cheeses produced in different regions of the United States and Europe to construct nine synthetic microbial communities consisting of distinct strains of the same three bacterial species. Comparative genomics identified distinct phylogenetic clusters and significant variation in genome content across the nine synthetic communities. When we assembled each synthetic community with initially identical compositions, community structure diverged over time, resulting in communities with different dominant taxa. The taxonomically identical communities showed differing responses to abiotic (high salt) and biotic (the fungus Penicillium ) perturbations, with some communities showing no response and others substantially shifting in composition. Functional differences were also observed across the nine communities, with significant variation in pigment production (light yellow to orange) and in composition of volatile organic compound profiles emitted from the rinds (nutty to sulfury). IMPORTANCE Our work demonstrated that the specific microbial strains used to construct a microbiome could impact the species composition, perturbation responses, and functional outputs of that system. These findings suggest that 16S rRNA gene taxonomic profiles alone may have limited potential to predict the dynamics of microbial communities because they usually do not capture strain-level diversity. Observations from our synthetic communities also suggest that strain-level diversity has the potential to drive variability in the aesthetics and quality of surface-ripened cheeses. 
    more » « less
  3. The extent and ecological significance of intraspecific functional diversity within marine microbial populations is still poorly understood, and it remains unclear if such strain-level microdiversity will affect fitness and persistence in a rapidly changing ocean environment. In this study, we cultured 11 sympatric strains of the ubiquitous marine picocyanobacteriumSynechococcusisolated from a Narragansett Bay (RI) phytoplankton community thermal selection experiment. Thermal performance curves revealed selection at cool and warm temperatures had subdivided the initial population into thermotypes with pronounced differences in maximum growth temperatures. Curiously, the genomes of all 11 isolates were almost identical (average nucleotide identities of >99.99%, with >99% of the genome aligning) and no differences in gene content or single nucleotide variants were associated with either cool or warm temperature phenotypes. Despite a very high level of genomic similarity, sequenced epigenomes for two strains showed differences in methylation on genes associated with photosynthesis. These corresponded to measured differences in photophysiology, suggesting a potential pathway for future mechanistic research into thermal microdiversity. Our study demonstrates that present-day marine microbial populations can harbor cryptic but environmentally relevant thermotypes which may increase their resilience to future rising temperatures.

     
    more » « less
  4. Abstract Background Metagenomic data can be used to profile high-importance genes within microbiomes. However, current metagenomic workflows produce data that suffer from low sensitivity and an inability to accurately reconstruct partial or full genomes, particularly those in low abundance. These limitations preclude colocalization analysis, i.e., characterizing the genomic context of genes and functions within a metagenomic sample. Genomic context is especially crucial for functions associated with horizontal gene transfer (HGT) via mobile genetic elements (MGEs), for example antimicrobial resistance (AMR). To overcome this current limitation of metagenomics, we present a method for comprehensive and accurate reconstruction of antimicrobial resistance genes (ARGs) and MGEs from metagenomic DNA, termed t arget- e nriched l ong-read seq uencing (TELSeq). Results Using technical replicates of diverse sample types, we compared TELSeq performance to that of non-enriched PacBio and short-read Illumina sequencing. TELSeq achieved much higher ARG recovery (>1,000-fold) and sensitivity than the other methods across diverse metagenomes, revealing an extensive resistome profile comprising many low-abundance ARGs, including some with public health importance. Using the long reads generated by TELSeq, we identified numerous MGEs and cargo genes flanking the low-abundance ARGs, indicating that these ARGs could be transferred across bacterial taxa via HGT. Conclusions TELSeq can provide a nuanced view of the genomic context of microbial resistomes and thus has wide-ranging applications in public, animal, and human health, as well as environmental surveillance and monitoring of AMR. Thus, this technique represents a fundamental advancement for microbiome research and application. 
    more » « less
  5. Pettigrew, Melinda M. (Ed.)
    ABSTRACT Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and nearly complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the United States or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside California, indicating that wastewater sequencing can provide evidence for recent introductions of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context. 
    more » « less