Semi-supervised Drifted Stream Learning with Short Lookback

Ren, Weijieying; Wang, Pengyang; Li, Xiaolin; Hughes, Charles E.; Fu, Yanjie

doi:10.1145/3534678.3539297

Citation Details

Semi-supervised Drifted Stream Learning with Short Lookback

In many scenarios, 1) data streams are generated in real time; 2) labeled data are expensive and only limited labels are available in the beginning; 3) real-world data is not always i.i.d. and data drift over time gradually; 4) the storage of historical streams is limited. This learning setting limits the applicability and availability of many Machine Learning (ML) algorithms. We generalize the learning task under such setting as a semi-supervised drifted stream learning with short lookback problem (SDSL). SDSL imposes two under-addressed challenges on existing methods in semi-supervised learning and continuous learning: 1) robust pseudo-labeling under gradual shifts and 2) anti-forgetting adaptation with short lookback. To tackle these challenges, we propose a principled and generic generation-replay framework to solve SDSL. To achieve robust pseudo-labeling, we develop a novel pseudo-label classification model to leverage supervised knowledge of previously labeled data, unsupervised knowledge of new data, and, structure knowledge of invariant label semantics. To achieve adaptive anti-forgetting model replay, we propose to view the anti-forgetting adaptation task as a flat region search problem. We propose a novel minimax game-based replay objective function to solve the flat region search problem and develop an effective optimization solver. Experimental results demonstrate the effectiveness of the proposed method. more »

Award ID(s):: 2114808

PAR ID:: 10408453

Author(s) / Creator(s):: Ren, Weijieying; Wang, Pengyang; Li, Xiaolin; Hughes, Charles E.; Fu, Yanjie

Editor(s):: Aidong Zhang; Huzefa Rangwala

Date Published:: 2022-08-14

Journal Name:: KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Page Range / eLocation ID:: 1504 to 1513

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3534678.3539297

More Like this