NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Quantifying the contribution of the rare biosphere to natural disturbances

https://doi.org/10.1093/ismejo/wraf129

Zhao, Jianshu; Brandt, Genevieve; Gronniger, Jessica_L; Wang, Zhao; Li, Jiaqian; Hunt, Dana_E; Rodriguez-R, Luis_M; Hatt, Janet_K; Konstantinidis, Konstantinos_T (June 2025, The ISME Journal)

Abstract Understanding how populations respond to disturbances represents a major goal for microbial ecology. While several hypotheses have been advanced to explain microbial community compositional changes in response to disturbance, appropriate data to test these hypotheses is scarce, due to the challenges in delineating rare vs. abundant taxa and generalists vs. specialists, a prerequisite for testing the theories. Here, we operationally define these two key concepts by employing the patterns of coverage of a (target) genome by a metagenome to identify rare populations, and by borrowing the proportional similarity index from macroecology to identify generalists. We applied these concepts to time-series (field) metagenomes from the Piver’s Island Coastal Observatory to establish that coastal microbial communities are resilient to major perturbations such as tropical cyclones and (uncommon) cold or warm temperature events, in part due to the response of rare populations. Therefore, these results provide support for the insurance hypothesis [i.e. the rare biosphere has the buffering capacity to mitigate the effects of disturbance]. Additionally, generalists appear to contribute proportionally more than specialists to community adaptation to perturbations like warming, supporting the disturbance-specialization hypothesis [i.e. disturbance favors generalists]. Several of these findings were also observed in replicated laboratory mesocosms that aimed to simulate disturbances such as a rain-driven washout of microbial cells and a labile organic matter release from a phytoplankton bloom. Taken together, our results advance understanding of the mechanisms governing microbial population dynamics under changing environmental conditions and have implications for ecosystem modeling.
more » « less
Promiscuous and genome-wide recombination underlies the sequence-discrete species of the SAR11 lineage in the deep ocean

https://doi.org/10.1093/ismejo/wraf072

Zhao, Jianshu; Pachiadaki, Maria; Conrad, Roth_E; Hatt, Janet_K; Bristow, Laura_A; Rodriguez-R, Luis_M; Rossello-Mora, Ramon; Stewart, Frank_J; Konstantinidis, Konstantinos_T (April 2025, The ISME Journal)

Abstract Surveys of microbial communities (metagenomics) or isolate genomes have revealed sequence-discrete species. That is, members of the same species show >95% average nucleotide identity (ANI) of shared genes among themselves vs. <83% ANI to members of other species while genome pairs showing between 83% and 95% ANI are comparatively rare. In these surveys, aquatic bacteria of the ubiquitous SAR11 clade (Class Alphaproteobacteria) are an outlier and often do not exhibit discrete species boundaries, suggesting the potential for alternate modes of genetic differentiation. To explore evolution in SAR11, we analyzed high-quality, single-cell amplified genomes, and companion metagenomes from an oxygen minimum zone in the Eastern Tropical Pacific Ocean, where the SAR11 make up ~20% of the total microbial community. Our results show that SAR11 do form several sequence-discrete species, but their ANI range of discreteness is shifted to lower identities between 86% and 91%, with intra-species ANI ranging between 91% and 100%. Measuring recent gene exchange among these genomes based on a recently developed methodology revealed higher frequency of homologous recombination within compared to between species that affects sequence evolution at least twice as much as diversifying point mutation across the genome. Recombination in SAR11 appears to be more promiscuous compared to other prokaryotic species, likely due to the deletion of universal genes involved in the mismatch repair, and has facilitated the spread of adaptive mutations within the species (gene sweeps), further promoting the high intraspecies diversity observed. Collectively, these results implicate rampant, genome-wide homologous recombination as the mechanism of cohesion for distinct SAR11 species.
more » « less
GSearch: ultra-fast and scalable genome search by combining K-mer hashing with hierarchical navigable small world graphs

https://doi.org/10.1093/nar/gkae609

Zhao, Jianshu; Both, Jean_Pierre; Rodriguez-R, Luis M.; Konstantinidis, Konstantinos T. (July 2024, Nucleic Acids Research)

Abstract Genome search and/or classification typically involves finding the best-match database (reference) genomes and has become increasingly challenging due to the growing number of available database genomes and the fact that traditional methods do not scale well with large databases. By combining k-mer hashing-based probabilistic data structures (i.e. ProbMinHash, SuperMinHash, Densified MinHash and SetSketch) to estimate genomic distance, with a graph based nearest neighbor search algorithm (Hierarchical Navigable Small World Graphs, or HNSW), we created a new data structure and developed an associated computer program, GSearch, that is orders of magnitude faster than alternative tools while maintaining high accuracy and low memory usage. For example, GSearch can search 8000 query genomes against all available microbial or viral genomes for their best matches (n = ∼318 000 or ∼3 000 000, respectively) within a few minutes on a personal laptop, using ∼6 GB of memory (2.5 GB via SetSketch). Notably, GSearch has an O(log(N)) time complexity and will scale well with billions of genomes based on a database splitting strategy. Further, GSearch implements a three-step search strategy depending on the degree of novelty of the query genomes to maximize specificity and sensitivity. Therefore, GSearch solves a major bottleneck of microbiome studies that require genome search and/or classification.
more » « less
Global diversity and distribution of antibiotic resistance genes in human wastewater treatment systems

https://doi.org/10.1038/s41467-025-59019-3

Zhu, Congmin; Wu, Linwei; Ning, Daliang; Tian, Renmao; Gao, Shuhong; Zhang, Bing; Zhao, Jianshu; Zhang, Ya; Xiao, Naijia; Wang, Yajiao; et al (April 2025, Nature Communications)

Search for: All records