Online k-means clustering on arbitrary data streams

Bhattacharjee, R.; Imola, J.; Moshkovitz, M.; Dasgupta, S.

Citation Details

We consider k-means clustering in an online setting where each new data point is assigned to its closest cluster center and incurs a loss equal to the squared distance to that center, after which the algorithm is allowed to update its centers. The goal over a data stream X is to achieve a total loss that is not too much larger than L(X, OPT), the best possible loss using k fixed centers in hindsight. We give the first algorithm to achieve polynomial space and time complexity in the online setting. more »

Award ID(s):: 2211386

PAR ID:: 10466919

Author(s) / Creator(s):: Bhattacharjee, R.; Imola, J.; Moshkovitz, M.; Dasgupta, S.

Publisher / Repository:: Proceedings of Machine Learning Research

Date Published:: 2023-04-01

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this