Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Abstract Understanding how populations respond to disturbances represents a major goal for microbial ecology. While several hypotheses have been advanced to explain microbial community compositional changes in response to disturbance, appropriate data to test these hypotheses is scarce, due to the challenges in delineating rare vs. abundant taxa and generalists vs. specialists, a prerequisite for testing the theories. Here, we operationally define these two key concepts by employing the patterns of coverage of a (target) genome by a metagenome to identify rare populations, and by borrowing the proportional similarity index from macroecology to identify generalists. We applied these concepts to time-series (field) metagenomes from the Piver’s Island Coastal Observatory to establish that coastal microbial communities are resilient to major perturbations such as tropical cyclones and (uncommon) cold or warm temperature events, in part due to the response of rare populations. Therefore, these results provide support for the insurance hypothesis [i.e. the rare biosphere has the buffering capacity to mitigate the effects of disturbance]. Additionally, generalists appear to contribute proportionally more than specialists to community adaptation to perturbations like warming, supporting the disturbance-specialization hypothesis [i.e. disturbance favors generalists]. Several of these findings were also observed in replicated laboratory mesocosms that aimed to simulate disturbances such as a rain-driven washout of microbial cells and a labile organic matter release from a phytoplankton bloom. Taken together, our results advance understanding of the mechanisms governing microbial population dynamics under changing environmental conditions and have implications for ecosystem modeling.more » « less
-
Abstract Genome search and/or classification typically involves finding the best-match database (reference) genomes and has become increasingly challenging due to the growing number of available database genomes and the fact that traditional methods do not scale well with large databases. By combining k-mer hashing-based probabilistic data structures (i.e. ProbMinHash, SuperMinHash, Densified MinHash and SetSketch) to estimate genomic distance, with a graph based nearest neighbor search algorithm (Hierarchical Navigable Small World Graphs, or HNSW), we created a new data structure and developed an associated computer program, GSearch, that is orders of magnitude faster than alternative tools while maintaining high accuracy and low memory usage. For example, GSearch can search 8000 query genomes against all available microbial or viral genomes for their best matches (n = ∼318 000 or ∼3 000 000, respectively) within a few minutes on a personal laptop, using ∼6 GB of memory (2.5 GB via SetSketch). Notably, GSearch has an O(log(N)) time complexity and will scale well with billions of genomes based on a database splitting strategy. Further, GSearch implements a three-step search strategy depending on the degree of novelty of the query genomes to maximize specificity and sensitivity. Therefore, GSearch solves a major bottleneck of microbiome studies that require genome search and/or classification.more » « less
An official website of the United States government
