Efficient Centroid-Linkage Clustering

Bateni, MH; Dhulipala, L; Fletcher, W; Gowda, K; Hershkowitz, DE; Jayaram, R; Lacki, J

Citation Details

We give an algorithm for Centroid-Linkage Hierarchical Agglomerative Clustering (HAC), which computes a $$c$$-approximate clustering in roughly $$n^{1+O(1/c^2)}$$ time. We obtain our result by combining a new Centroid-Linkage HAC algorithm with a novel fully dynamic data structure for nearest neighbor search which works under adaptive updates. We also evaluate our algorithm empirically. By leveraging a state-of-the-art nearest-neighbor search library, we obtain a fast and accurate Centroid-Linkage HAC algorithm. Compared to an existing state-of-the-art exact baseline, our implementation maintains the clustering quality while delivering up to a $$36\times$$ speedup due to performing fewer distance comparisons. more »

Award ID(s):: 2403236

PAR ID:: 10627074

Author(s) / Creator(s):: Bateni, MH; Dhulipala, L; Fletcher, W; Gowda, K; Hershkowitz, DE; Jayaram, R; Lacki, J

Publisher / Repository:: NeurIPS 2024

Date Published:: 2024-12-02

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this