skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


This content will become publicly available on February 1, 2026

Title: Rapid whole genome characterization of antimicrobial-resistant pathogens using long-read sequencing to identify potential healthcare transmission
Abstract Objective:Whole genome sequencing (WGS) can help identify transmission of pathogens causing healthcare-associated infections (HAIs). However, the current gold standard of short-read, Illumina-based WGS is labor and time intensive. Given recent improvements in long-read Oxford Nanopore Technologies (ONT) sequencing, we sought to establish a low resource approach providing accurate WGS-pathogen comparison within a time frame allowing for infection prevention and control (IPC) interventions. Methods:WGS was prospectively performed on pathogens at increased risk of potential healthcare transmission using the ONT MinION sequencer with R10.4.1 flow cells and Dorado basecaller. Potential transmission was assessed via Ridom SeqSphere+ for core genome multilocus sequence typing and MINTyper for reference-based core genome single nucleotide polymorphisms using previously published cutoff values. The accuracy of our ONT pipeline was determined relative to Illumina. Results:Over a six-month period, 242 bacterial isolates from 216 patients were sequenced by a single operator. Compared to the Illumina gold standard, our ONT pipeline achieved a mean identity score of Q60 for assembled genomes, even with a coverage rate as low as 40×. The mean time from initiating DNA extraction to complete analysis was 2 days (IQR 2–3.25 days). We identified five potential transmission clusters comprising 21 isolates (8.7% of sequenced strains). Integrating ONT with epidemiological data, >70% (15/21) of putative transmission cluster isolates originated from patients with potential healthcare transmission links. Conclusions:Via a stand-alone ONT pipeline, we detected potentially transmitted HAI pathogens rapidly and accurately, aligning closely with epidemiological data. Our low-resource method has the potential to assist in IPC efforts.  more » « less
Award ID(s):
2239114
PAR ID:
10586628
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ; ; ;
Publisher / Repository:
Cambridge University Press
Date Published:
Journal Name:
Infection Control & Hospital Epidemiology
Volume:
46
Issue:
2
ISSN:
0899-823X
Page Range / eLocation ID:
129 to 135
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. IntroductionBacteria are frequently isolated from surfaces in cleanrooms, where astromaterials are curated, at NASA’s Lyndon B. Johnson Space Center (JSC).Bacillusspecies are of particular interest because endospores can endure extreme conditions. Current monitoring programs at JSC rely on culturing microbes from swabs of surfaces followed by identification by 16S rRNA sequencing and the VITEK 2 Compact bacterial identification system. These methods have limited power to resolveBacillusspecies. Whole genome sequencing (WGS) is the current standard for bacterial identification but is expensive and time-consuming. Matrix-assisted laser desorption - time of flight mass spectrometry (MALDI-TOF MS), provides a rapid, low-cost, method of identifying bacterial isolates and has a higher resolution than 16S rRNA sequencing, particularly forBacillusspecies; however, few studies have compared this method to WGS for identification ofBacillusspecies isolated from cleanrooms. MethodsTo address this, we selected 15 isolates for analysis with WGS and MALDI-TOF MS. Hybrid next-generation (Illumina) and 3rd-generation (nanopore) sequencing were used to draft genomes. Mass spectra, generated with MALDI-TOF MS, were processed with custom scripts to identify clusters of closely related isolates. ResultsMALDI-TOF MS and WGS identified 13/15 and 9/14 at the species level, respectively, and clusters of species generated from MALDI-TOF MS showed good agreement, in terms of congruence of partitioning, with phylotypes generated with WGS. Pairs of strains that were > 94% similar to each other, in terms of average amino acid identity (AAI) predicted by WGS, consistently showed cosine similarities of mass spectra >0.8. The only discordance was for a pair of isolates that were classified asPaenibacillusspecies. This pair showed relatively high similarity (0.85) in terms of MALDI-TOF MS but only 85% similarity in terms of AAI. In addition, some strains isolated from cleanrooms at the JSC appeared closely related to strains isolated from spacecraft assembly cleanrooms. DiscussionSince MALDI-TOF MS costs less than whole genome sequencing and offers a throughput of hundreds of isolates per hour, this approach appears to offer a cost-efficient option for identifyingBacillusspecies, and related microbes, isolated during routine monitoring of cleanrooms and similar built environments. 
    more » « less
  2. Abstract Changes in telomere length are increasingly used to indicate species' response to environmental stress across diverse taxa. Despite this broad use, few studies have explored telomere length in plants. Thus, evaluation of new approaches for measuring telomeres in plants is needed. Rapid advances in sequencing approaches and bioinformatic tools now allow estimation of telomere content from whole‐genome sequencing (WGS) data, a proxy for telomere length. While telomere content has been quantified extensively using quantitative polymerase chain reaction (qPCR) and WGS in humans, no study to date has compared the effectiveness of WGS in estimating telomere length in plants relative to qPCR approaches. In this study, we use 100Populusclones re‐sequenced using short‐read Illumina sequencing to quantify telomere length comparing three different bioinformatic approaches (Computel, K‐seek and TRIP) in addition to qPCR. Overall, telomere length estimates varied across different bioinformatic approaches, but were highly correlated across methods for individual genotypes. A positive correlation was observed between WGS estimates and qPCR, however, Computel estimates exhibited the greatest correlation. Computel incorporates genome coverage into telomere length calculations, suggesting that genome coverage is likely important to telomere length quantification when using WGS data. Overall, telomere estimates from WGS provided greater precision and accuracy of telomere length estimates relative to qPCR. The findings suggest WGS is a promising approach for assessing telomere length and, as the field of telomere ecology evolves, may provide added value to assaying response to biotic and abiotic environments for plants needed to accelerate plant breeding and conservation management. 
    more » « less
  3. Abstract Although plastid genome (plastome) structure is highly conserved across most seed plants, investigations during the past two decades have revealed several disparately related lineages that experienced substantial rearrangements. Most plastomes contain a large inverted repeat and two single‐copy regions, and a few dispersed repeats; however, the plastomes of some taxa harbour long repeat sequences (>300 bp). These long repeats make it challenging to assemble complete plastomes using short‐read data, leading to misassemblies and consensus sequences with spurious rearrangements. Single‐molecule, long‐read sequencing has the potential to overcome these challenges, yet there is no consensus on the most effective method for accurately assembling plastomes using long‐read data. We generated a pipeline,plastidGenomeAssemblyUsingLong‐read data (ptGAUL), to address the problem of plastome assembly using long‐read data from Oxford Nanopore Technologies (ONT) or Pacific Biosciences platforms. We demonstrated the efficacy of the ptGAUL pipeline using 16 published long‐read data sets. We showed that ptGAUL quickly produces accurate and unbiased assemblies using only ~50× coverage of plastome data. Additionally, we deployed ptGAUL to assemble four newJuncus(Juncaceae) plastomes using ONT long reads. Our results revealed many long repeats and rearrangements inJuncusplastomes compared with basal lineages of Poales. The ptGAUL pipeline is available on GitHub:https://github.com/Bean061/ptgaul. 
    more » « less
  4. High-throughput short-read sequencing has taken on a central role in research and diagnostics. Hundreds of different assays take advantage of Illumina short-read sequencers, the predominant short-read sequencing technology available today. Although other short-read sequencing technologies exist, the ubiquity of Illumina sequencers in sequencing core facilities and the high capital costs of these technologies have limited their adoption. Among a new generation of sequencing technologies, Oxford Nanopore Technologies (ONT) holds a unique position because the ONT MinION, an error-prone long-read sequencer, is associated with little to no capital cost. Here we show that we can make short-read Illumina libraries compatible with the ONT MinION by using the rolling circle to concatemeric consensus (R2C2) method to circularize and amplify the short library molecules. This results in longer DNA molecules containing tandem repeats of the original short library molecules. This longer DNA is ideally suited for the ONT MinION, and after sequencing, the tandem repeats in the resulting raw reads can be converted into high-accuracy consensus reads with similar error rates to that of the Illumina MiSeq. We highlight this capability by producing and benchmarking RNA-seq, ChIP-seq, and regular and target-enriched Tn5 libraries. We also explore the use of this approach for rapid evaluation of sequencing library metrics by implementing a real-time analysis workflow. 
    more » « less
  5. Control of hospital-associated Enterococcus faecium infection is a strenuous task due to the difficulty of identifying transmission routes and the persistence of this nosocomial pathogen despite the implementation of infection control measures that have been successful with other important nosocomial pathogens. This study provides a comprehensive analysis of over 100 E. faecium isolates collected from 66 cancer patients at the University of Arkansas for Medical Sciences (UAMS) between June 2018 and May 2019. In the top-down approach used in this study, we employed, in addition to the 106 E. faecium UAMS isolates, a filtered set of 2,167 E. faecium strains from the GenBank database to assess the current population structure of E. faecium species and, consequently, to identify the lineages associated with our clinical isolates. We then evaluated the antibiotic resistance and virulence profiles of hospital-associated strains from the species pool, focusing on antibiotics of last resort, to establish an updated classification of high-risk and multidrug-resistant nosocomial clones. Further investigation of the clinical isolates collected from UAMS patients using whole-genome sequencing analytical methodologies (core genome multilocus sequence typing [cgMLST], core single nucleotide polymorphism [coreSNP] analysis, and phylogenomics), with the addition of patient epidemiological data, revealed a polyclonal outbreak of three sequence types occurring simultaneously in different patient wards. The integration of genomic and epidemiological data collected from the patients increased our understanding of the relationships and transmission dynamics of the E. faecium isolates. Our study provides new insights into genomic surveillance of E. faecium to assist in monitoring and further limiting the spread of multidrug-resistant E. faecium. 
    more » « less