NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Naïve Bayes classifier++ for metagenomic taxonomic classification—query evaluation

https://doi.org/10.1093/bioinformatics/btae743

Duan, Haozhe_Neil; Hearne, Gavin; Polikar, Robi; Rosen, Gail_L; Kendziorski, ed., Christina (December 2024, Bioinformatics)

Abstract MotivationThis study examines the query performance of the NBC++ (Incremental Naive Bayes Classifier) program for variations in canonicality, k-mer size, databases, and input sample data size. We demonstrate that both NBC++ and Kraken2 are influenced by database depth, with macro measures improving as depth increases. However, fully capturing the diversity of life, especially viruses, remains a challenge. ResultsNBC++ can competitively profile the superkingdom content of metagenomic samples using a small training database. NBC++ spends less time training and can use a fraction of the memory than Kraken2 but at the cost of long querying time. Major NBC++ enhancements include accommodating canonical k-mer storage (leading to significant storage savings) and adaptable and optimized memory allocation that accelerates query analysis and enables the software to be run on nearly any system. Additionally, the output now includes log-likelihood values for each training genome, providing users with valuable confidence information. Availability and implementationSource code and Dockerfile are available at http://github.com/EESI/Naive_Bayes.
more » « less
MetaMutationalSigs: comparison of mutational signature refitting results made easy

https://doi.org/10.1093/bioinformatics/btac091

Pandey, Palash; Arora, Sanjeevani; Rosen, Gail L.; Marschall, ed., Tobias (February 2022, Bioinformatics)

Abstract MotivationThe analysis of mutational signatures is becoming increasingly common in cancer genetics, with emerging implications in cancer evolution, classification, treatment decision and prognosis. Recently, several packages have been developed for mutational signature analysis, with each using different methodology and yielding significantly different results. Because of the non-trivial differences in tools’ refitting results, researchers may desire to survey and compare the available tools, in order to objectively evaluate the results for their specific research question, such as which mutational signatures are prevalent in different cancer types. ResultsDue to the need for effective comparison of refitting mutational signatures, we introduce a user-friendly software that can aggregate and visually present results from different refitting packages. Availability and implementationMetaMutationalSigs is implemented using R and python and is available for installation using Docker and available at: https://github.com/EESI/MetaMutationalSigs.
more » « less
Physiological and evolutionary contexts of a new symbiotic species from the nitrogen-recycling gut community of turtle ants

https://doi.org/10.1038/s41396-023-01490-1

Béchade, Benoît; Cabuslay, Christian_S; Hu, Yi; Mendonca, Caroll_M; Hassanpour, Bahareh; Lin, Jonathan_Y; Su, Yangzhou; Fiers, Valerie_J; Anandarajan, Dharman; Lu, Richard; et al (August 2023, The ISME Journal)

Abstract While genome sequencing has expanded our knowledge of symbiosis, role assignment within multi-species microbiomes remains challenging due to genomic redundancy and the uncertainties of in vivo impacts. We address such questions, here, for a specialized nitrogen (N) recycling microbiome of turtle ants, describing a new genus and species of gut symbiont—Ischyrobacter davidsoniae (Betaproteobacteria: Burkholderiales: Alcaligenaceae)—and its in vivo physiological context. A re-analysis of amplicon sequencing data, with precisely assigned Ischyrobacter reads, revealed a seemingly ubiquitous distribution across the turtle ant genus Cephalotes, suggesting ≥50 million years since domestication. Through new genome sequencing, we also show that divergent I. davidsoniae lineages are conserved in their uricolytic and urea-generating capacities. With phylogenetically refined definitions of Ischyrobacter and separately domesticated Burkholderiales symbionts, our FISH microscopy revealed a distinct niche for I. davidsoniae, with dense populations at the anterior ileum. Being positioned at the site of host N-waste delivery, in vivo metatranscriptomics and metabolomics further implicate I. davidsoniae within a symbiont-autonomous N-recycling pathway. While encoding much of this pathway, I. davidsoniae expressed only a subset of the requisite steps in mature adult workers, including the penultimate step deriving urea from allantoate. The remaining steps were expressed by other specialized gut symbionts. Collectively, this assemblage converts inosine, made from midgut symbionts, into urea and ammonia in the hindgut. With urea supporting host amino acid budgets and cuticle synthesis, and with the ancient nature of other active N-recyclers discovered here, I. davidsoniae emerges as a central player in a conserved and impactful, multipartite symbiosis.
more » « less
Enhancing nucleotide sequence representations in genomic analysis with contrastive optimization

https://doi.org/10.1038/s42003-025-07902-6

Refahi, Mohammadsaleh; Sokhansanj, Bahrad_A; Mell, Joshua_C; Brown, James_R; Yoo, Hyunwoo; Hearne, Gavin; Rosen, Gail_L (March 2025, Communications Biology)
Can Large Language Models Classify and Generate Antimicrobial Resistance Genes?

https://doi.org/10.18653/v1/2025.bionlp-1.21

Yoo, Hyunwoo; Shin, Haebin; Rosen, Gail (January 2025, Association for Computational Linguistics)

Full Text Available
Normalized Compression Distance for DNA Classification

https://doi.org/10.1145/3698587.3701490

Hearne, Gavin LA; Refahi, Mohammad S; Duan, Haozhe Neil; Brown, James R; Rosen, Gail L (November 2024, ACM)

Full Text Available
The Role and Applications of Artificial Intelligence in the Treatment of Chronic Pain

https://doi.org/10.1007/s11916-024-01264-0

Meier, Tiffany A; Refahi, Mohammad S; Hearne, Gavin; Restifo, Daniele S; Munoz-Acuna, Ricardo; Rosen, Gail L; Woloszynek, Stephen (August 2024, Current Pain and Headache Reports)

Full Text Available
Streamlining Computational Fragment-Based Drug Discovery through Evolutionary Optimization Informed by Ligand-Based Virtual Prescreening

https://doi.org/10.1021/acs.jcim.4c00234

Chandraghatgi, Rohan; Ji, Hai-Feng; Rosen, Gail_L; Sokhansanj, Bahrad_A (May 2024, Journal of Chemical Information and Modeling)
Fragment databases from screened ligands for drug discovery (FDSL-DD)

https://doi.org/10.1016/j.jmgm.2023.108669

Wilson, Jerica; Sokhansanj, Bahrad A.; Chong, Wei Chuen; Chandraghatgi, Rohan; Rosen, Gail L.; Ji, Hai-Feng (March 2024, Journal of Molecular Graphics and Modelling)

Full Text Available
Microbiome preterm birth DREAM challenge: Crowdsourcing machine learning approaches to advance preterm birth research

https://doi.org/10.1016/j.xcrm.2023.101350

Golob, Jonathan L; Oskotsky, Tomiko T; Tang, Alice S; Roldan, Alennie; Chung, Verena; Ha, Connie WY; Wong, Ronald J; Flynn, Kaitlin J; Parraga-Leo, Antonio; Wibrand, Camilla; et al (January 2024, Cell Reports Medicine)

Full Text Available

« Prev Next »

Search for: All records