A Clustering-based framework for Classifying Data Streams

Yan, Xuyang; Homaifar, Abdollah; Sarkar, Mrinmoy; Girma, Abenezer; Tunstel, Edward

doi:10.24963/ijcai.2021/448

Citation Details

A Clustering-based framework for Classifying Data Streams

The non-stationary nature of data streams strongly challenges traditional machine learning techniques. Although some solutions have been proposed to extend traditional machine learning techniques for handling data streams, these approaches either require an initial label set or rely on specialized design parameters. The overlap among classes and the labeling of data streams constitute other major challenges for classifying data streams. In this paper, we proposed a clustering-based data stream classification framework to handle non-stationary data streams without utilizing an initial label set. A density-based stream clustering procedure is used to capture novel concepts with a dynamic threshold and an effective active label querying strategy is introduced to continuously learn the new concepts from the data streams. The sub-cluster structure of each cluster is explored to handle the overlap among classes. Experimental results and quantitative comparison studies reveal that the proposed method provides statistically better or comparable performance than the existing methods. more »

Award ID(s):: 2000320

PAR ID:: 10328297

Author(s) / Creator(s):: Yan, Xuyang; Homaifar, Abdollah; Sarkar, Mrinmoy; Girma, Abenezer; Tunstel, Edward

Date Published:: 2021-07-01

Journal Name:: International Joint Conferences on Artificial Intelligence Organization

Page Range / eLocation ID:: 3257 to 3263

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.24963/ijcai.2021/448

More Like this