Practical dynamic de Bruijn graphs

Crawford, Victoria G.; Kuhnle, Alan; Boucher, Christina; Chikhi, Rayan; Gagie, Travis (ORCID:000000033689327X); Hancock, ed., John

doi:10.1093/bioinformatics/bty500

Citation Details

Practical dynamic de Bruijn graphs

Abstract Motivation

The de Bruijn graph is fundamental to the analysis of next generation sequencing data and so, as datasets of DNA reads grow rapidly, it becomes more important to represent de Bruijn graphs compactly while still supporting fast assembly. Previous implementations of compact de Bruijn graphs have not supported node or edge deletion, however, which is important for pruning spurious elements from the graph.

Results

Belazzougui et al. (2016b) recently proposed a compact and fully dynamic representation, which supports exact membership queries and insertions and deletions of both nodes and edges. In this paper, we give a practical implementation of their data structure, supporting exact membership queries and fully dynamic edge operations, as well as limited support for dynamic node operations. We demonstrate experimentally that its performance is comparable to that of state-of-the-art implementations based on Bloom filters.

Availability and implementation

Our source-code is publicly available at https://github.com/csirac/dynamicDBG under an open-source license.

NSF-PAR ID:: 10393427

Author(s) / Creator(s):: Crawford, Victoria G.; Kuhnle, Alan; Boucher, Christina; Chikhi, Rayan; Gagie, Travis; Hancock, ed., John

Publisher / Repository:: Oxford University Press

Date Published:: 2018-06-22

Journal Name:: Bioinformatics

Volume:: 34

Issue:: 24

ISSN:: 1367-4803

Page Range / eLocation ID:: p. 4189-4195

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/bioinformatics/bty500

More Like this