skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Award ID contains: 2125065

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract MotivationHigh-throughput sequencing (HTS) is a modern sequencing technology used to profile microbiomes by sequencing thousands of short genomic fragments from the microorganisms within a given sample. This technology presents a unique opportunity for artificial intelligence to comprehend the underlying functional relationships of microbial communities. However, due to the unstructured nature of HTS data, nearly all computational models are limited to processing DNA sequences individually. This limitation causes them to miss out on key interactions between microorganisms, significantly hindering our understanding of how these interactions influence the microbial communities as a whole. Furthermore, most computational methods rely on post-processing of samples which could inadvertently introduce unintentional protocol-specific bias. ResultsAddressing these concerns, we present SetBERT, a robust pre-training methodology for creating generalized deep learning models for processing HTS data to produce contextualized embeddings and be fine-tuned for downstream tasks with explainable predictions. By leveraging sequence interactions, we show that SetBERT significantly outperforms other models in taxonomic classification with genus-level classification accuracy of 95%. Furthermore, we demonstrate that SetBERT is able to accurately explain its predictions autonomously by confirming the biological-relevance of taxa identified by the model. Availability and implementationAll source code is available at https://github.com/DLii-Research/setbert. SetBERT may be used through the q2-deepdna QIIME 2 plugin whose source code is available at https://github.com/DLii-Research/q2-deepdna. 
    more » « less
  2. Abstract The host microbiome is integral to metabolism, immune function, and pathogen resistance. Yet, reliance on relative abundance in microbiome studies introduces compositional biases that obscure ecological interpretation, while the absence of robust tools for absolute abundance quantification has limited biological discovery. Here, we apply absolute abundance profiling to uncover host-specific microbial patterns across herpetofauna orders that are masked in relative abundance data. Relative- and absolute abundance-derived bacterial and fungal microbiomes exhibit divergent profiles shaped by compositional bias and multifactorial effects. Absolute abundance identified key genera, Lactococcus, Parabacteroides, and Cetobacterium in salamanders, and Basidiobolus and Mortierella in lizards, turtles, snakes, and tortoises, that consistently emerged as core taxa, revealing host-associated patterns previously obscured by compositional constraints. In closely related Desmognathus species, where environmental and phylogenetic variation was minimized, absolute abundance enabled finer resolution of microbiome dynamics and significantly reduced false discovery rates. Absolute abundance-based network analyses further revealed distinct keystone taxa between the relative and absolute abundance datasets. Despite low redundancy, Basidiobolus exhibited high network betweenness, efficiency, and degree, suggesting its role as a key connector between microbial modules and a contributor to overall network robustness. This predicted structural role aligns with Burt’s structural hole theory, which suggests that nodes linking otherwise disconnected modules occupy influential network positions. These findings underscore the value of absolute abundance in resolving microbial dynamics and supporting meaningful interpretation of host-microbiome associations. This advance is made possible by DspikeIn, a flexible wet-lab and computational framework that enhances ecological resolution and cross-study comparability. 
    more » « less
  3. McMahon, Katherine (Ed.)
    This work significantly advances our understanding of biodiversity and microbial interactions in herptile microbiomes, the role that fungi play as a structural and functional members of herptile gut microbiomes, and the chemical functions that structure microbiome phenotypes. We also provide an important observational system of how the gut microbiome represents a unique environment that selects for novel metabolic functions through horizontal gene transfer between fungi and bacteria. Such studies are needed to better understand the complexity of gut microbiomes in nature and will inform conservation strategies for threatened species of herpetofauna. 
    more » « less