Persistent Homology on Streaming Data

Moitra, Anindya; Malott, Nicholas O.; Wilsey, Philip A.

doi:10.1109/ICDMW51313.2020.00090

Citation Details

Persistent Homology on Streaming Data

This paper introduces a framework to compute persistent homology, a principal tool in Topological Data Analysis, on potentially unbounded and evolving data streams. The framework is organized into online and offline components. The online element maintains a summary of the data that preserves the topological structure of the stream. The offline component computes the persistence intervals from the data captured by the summary. The framework is applied to the detection of horizontal or reticulate genomic exchanges during the evolution of species that cannot be identified by phylogenetic inference or traditional data mining. The method effectively detects reticulate evolution that occurs through reassortment and recombination in large streams of genomic sequences of Influenza and HIV viruses. more »

Award ID(s):: 1440420 1909096

PAR ID:: 10350969

Author(s) / Creator(s):: Moitra, Anindya; Malott, Nicholas O.; Wilsey, Philip A.

Date Published:: 2020-11-01

Journal Name:: 8th Workshop on Data Mining in Biomedical Informatics and Healthcare

Page Range / eLocation ID:: 636 to 643

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICDMW51313.2020.00090

More Like this