skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: KARGA: Multi-platform Toolkit for k-mer-based Antibiotic Resistance Gene Analysis of High-throughput Sequencing Data
High-throughput sequencing is widely used for strain detection and characterization of antibiotic resistance in microbial metagenomic samples. Current analytical tools use curated antibiotic resistance gene (ARG) databases to classify individual sequencing reads or assembled contigs. However, identifying ARGs from raw read data can be time consuming (especially if assembly or alignment is required) and challenging, due to genome rearrangements and mutations. Here, we present the k-mer-based antibiotic gene resistance analyzer (KARGA), a multi-platform Java toolkit for identifying ARGs from metagenomic short read data. KARGA does not perform alignment; it uses an efficient double-lookup strategy, statistical filtering on false positives, and provides individual read classification as well as covering of the database resistome. On simulated data, KARGA’s antibiotic resistance class recall is 99.89% for error/mutation rates within 10%, and of 83.37% for error/mutation rates between 10% and 25%, while it is 99.92% on ARGs with rearrangements. On empirical data, KARGA provides higher hit score (≥1.5-fold) than AMRPlusPlus, DeepARG, and MetaMARC. KARGA has also faster runtimes than all other tools (2x faster than AMRPlusPlus, 7x than DeepARG, and over 100x than MetaMARC). KARGA is available under the MIT license at https://github.com/DataIntellSystLab/KARGA.  more » « less
Award ID(s):
2013998
PAR ID:
10301296
Author(s) / Creator(s):
;
Date Published:
Journal Name:
021 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI)
Page Range / eLocation ID:
1 to 4
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Antibiotic resistance (AR) presents a global health challenge, necessitating an improved understanding of the ecology, evolution, and dissemination of antibiotic resistance genes (ARGs). Several tools, databases, and algorithms are now available to facilitate the identification of ARGs in metagenomic sequencing data; however, direct annotation of short-read data provides limited contextual information. Knowledge of whether an ARG is carried in the chromosome or on a specific mobile genetic element (MGE) is critical to understanding mobility, persistence, and potential for co-selection. Here we developed ARGContextProfiler, a pipeline designed to extract and visualize ARG genomic contexts. By leveraging the assembly graph for genomic neighborhood extraction and validating contexts through read mapping, ARGContextProfiler minimizes chimeric errors that are a common artifact of assembly outputs. Testing on real, synthetic, and semi-synthetic data, including long-read sequencing data from environmental samples, demonstrated that ARGContextProfiler offers superior accuracy, precision, and sensitivity compared to conventional assembly-based methods. ARGContextProfiler thus provides a powerful tool for uncovering the genomic context of ARGs in metagenomic sequencing data, which can be of value to both fundamental and applied research aimed at understanding and stemming the spread of AR. The source code of ARGContextProfiler is publicly available atGitHub. 
    more » « less
  2. Characterization of antibiotic resistance genes (ARGs) from high-throughput sequencing data of metagenomics and cultured bacterial samples is a challenging task, with the need to account for both computational (e.g., string algorithms) and biological (e.g., gene transfers, rearrangements) aspects. Curated ARG databases exist together with assorted ARG classification approaches (e.g., database alignment, machine learning). Besides ARGs that naturally occur in bacterial strains or are acquired through mobile elements, there are chromosomal genes that can render a bacterium resistant to antibiotics through point mutations, i.e., ARG variants (ARGVs). While ARG repositories also collect ARGVs, there are only a few tools that are able to identify ARGVs from metagenomics and high throughput sequencing data, with a number of limitations (e.g., pre-assembly,a posterioriverification of mutations, or specification of species). In this work we present thek-mer, i.e., strings of fixed lengthk, ARGV analyzer – KARGVA – an open-source, multi-platform tool that provides: (i) anad hoc, large ARGV database derived from multiple sources; (ii) input capability for various types of high-throughput sequencing data; (iii) a three-way, hash-based,k-mer search setup to process data efficiently, linkingk-mers to ARGVs,k-mers to point mutations, and ARGVs tok-mers, respectively; (iv) a statistical filter on sequence classification to reduce type I and II errors. On semi-synthetic data, KARGVA provides very high accuracy even in presence of high sequencing errors or mutations (99.2 and 86.6% accuracy within 1 and 5% base change rates, respectively), and genome rearrangements (98.2% accuracy), with robust performance onad hocfalse positive sets. On data from the worldwide MetaSUB consortium, comprising 3,700+ metagenomics experiments, KARGVA identifies more ARGVs than Resistance Gene Identifier (4.8x) and PointFinder (6.8x), yet all predictions are below the expected false positive estimates. The prevalence of ARGVs is correlated to ARGs but ecological characteristics do not explain well ARGV variance. KARGVA is publicly available athttps://github.com/DataIntellSystLab/KARGVAunder MIT license. 
    more » « less
  3. Elkins, Christopher A. (Ed.)
    ABSTRACT Low- and middle-income countries (LMICs) bear the largest mortality burden of antibiotic-resistant infections. Small-scale animal production and free-roaming domestic animals are common in many LMICs, yet data on zoonotic exchange of gut bacteria and antibiotic resistance genes (ARGs) in low-income communities are sparse. Differences between rural and urban communities with regard to population density, antibiotic use, and cohabitation with animals likely influence the frequency of transmission of gut bacterial communities and ARGs between humans and animals. Here, we determined the similarity in gut microbiomes, using 16S rRNA gene amplicon sequencing, and resistomes, using long-read metagenomics, between humans, chickens, and goats in a rural community compared to an urban community in Bangladesh. Gut microbiomes were more similar between humans and chickens in the rural (where cohabitation is more common) than the urban community, but there was no difference for humans and goats in the rural versus the urban community. Human and goat resistomes were more similar in the urban community, and ARG abundance was higher in urban animals than rural animals. We identified substantial overlap of ARG alleles in humans and animals in both settings. Humans and chickens had more overlapping ARG alleles than humans and goats. All fecal hosts from the urban community and rural humans carried ARGs on chromosomal contigs classified as potentially pathogenic bacteria, including Escherichia coli , Campylobacter jejuni , Clostridioides difficile , and Klebsiella pneumoniae . These findings provide insight into the breadth of ARGs circulating within human and animal populations in a rural compared to urban community in Bangladesh. IMPORTANCE While the development of antibiotic resistance in animal gut microbiomes and subsequent transmission to humans has been demonstrated in intensive farming environments and high-income countries, evidence of zoonotic exchange of antibiotic resistance in LMIC communities is lacking. This research provides genomic evidence of overlap of antibiotic resistance genes between humans and animals, especially in urban communities, and highlights chickens as important reservoirs of antibiotic resistance. Chicken and human gut microbiomes were more similar in rural Bangladesh, where cohabitation is more common. Incorporation of long-read metagenomics enabled characterization of bacterial hosts of resistance genes, which has not been possible in previous culture-independent studies using only short-read sequencing. These findings highlight the importance of developing strategies for combatting antibiotic resistance that account for chickens being reservoirs of ARGs in community environments, especially in urban areas. 
    more » « less
  4. Antibiotic resistance is a continually rising threat to global health. A primary driver of the evolution of new strains of resistant pathogens is the horizontal gene transfer (HGT) of antibiotic resistance genes (ARGs). However, identifying and quantifying ARGs subject to HGT remains a significant challenge. Here, we introduce HT-ARGfinder (horizontally transferred ARG finder), a pipeline that detects and enumerates horizontally transferred ARGs in metagenomic data while also estimating the directionality of transfer. To demonstrate the pipeline, we applied it to an array of publicly-available wastewater metagenomes, including hospital sewage. We compare the horizontally transferred ARGs detected across various sample types and estimate their directionality of transfer among donors and recipients. This study introduces a comprehensive tool to track mobile ARGs in wastewater and other aquatic environments. 
    more » « less
  5. null (Ed.)
    Wastewater treatment plants (WWTPs) receive a confluence of sewage containing antimicrobials, antibiotic resistant bacteria, antibiotic resistance genes (ARGs), and pathogens and thus are a key point of interest for antibiotic resistance surveillance. WWTP monitoring has the potential to inform with respect to the antibiotic resistance status of the community served as well as the potential for ARGs to escape treatment. However, there is lack of agreement regarding suitable sampling frequencies and monitoring targets to facilitate comparison within and among individual WWTPs. The objective of this study was to comprehensively evaluate patterns in metagenomic-derived indicators of antibiotic resistance through various stages of treatment at a conventional WWTP for the purpose of informing local monitoring approaches that are also informative for global comparison. Relative abundance of total ARGs decreased by ∼50% from the influent to the effluent, with each sampling location defined by a unique resistome (i.e., total ARG) composition. However, 90% of the ARGs found in the effluent were also detected in the influent, while the effluent ARG-pathogen taxonomic linkage patterns identified in assembled metagenomes were more similar to patterns in regional clinical surveillance data than the patterns identified in the influent. Analysis of core and discriminatory resistomes and general ARG trends across the eight sampling events (i.e., tendency to be removed, increase, decrease, or be found in the effluent only), along with quantification of ARGs of clinical concern, aided in identifying candidate ARGs for surveillance. Relative resistome risk characterization further provided a comprehensive metric for predicting the relative mobility of ARGs and likelihood of being carried in pathogens and can help to prioritize where to focus future monitoring and mitigation. Most antibiotics that were subject to regional resistance testing were also found in the WWTP, with the total antibiotic load decreasing by ∼40–50%, but no strong correlations were found between antibiotics and corresponding ARGs. Overall, this study provides insight into how metagenomic data can be collected and analyzed for surveillance of antibiotic resistance at WWTPs, suggesting that effluent is a beneficial monitoring point with relevance both to the local clinical condition and for assessing efficacy of wastewater treatment in reducing risk of disseminating antibiotic resistance. 
    more » « less