Identifying Taxonomic Units in Metagenomic DNA Streams

Zheng, V; Sariyuce, AE; Zola, J

doi:10.1101/2020.08.21.261313

Citation Details

Identifying Taxonomic Units in Metagenomic DNA Streams

With the emergence of portable DNA sequencers, such as Oxford Nanopore Technology MinION, metagenomic DNA sequencing can be performed in real-time and directly in the field. However, because metagenomic DNA analysis is computationally and memory intensive, and the current methods are designed for batch processing, the current metagenomic tools are not well suited for mobile devices. In this paper, we propose a new memory-efficient method to identify Operational Taxonomic Units (OTUs) in metagenomic DNA streams. Our method is based on finding connected components in overlap graphs constructed over a real-time stream of long DNA reads as produced by MinION platform. We propose an efficient algorithm to maintain connected components when an overlap graph is streamed, and show how redundant information can be removed from the stream by transitive closures. Through experiments on simulated and real-world metagenomic data, we demonstrate that the resulting solution is able to recover OTUs with high precision while remaining suitable for mobile computing devices. more »

Award ID(s):: 1910193

PAR ID:: 10195903

Author(s) / Creator(s):: Zheng, V; Sariyuce, AE; Zola, J

Date Published:: 2020-01-01

Journal Name:: BIOKDD - 19th International Workshop on Data Mining in Bioinformatics

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1101/2020.08.21.261313

More Like this