Scalable KNN Graph Construction for Heterogeneous Architectures

Ruys, William  (ORCID:000000015702022X); Ghafouri, Ali  (ORCID:0000000333454685); Chen, Chao  (ORCID:0000000253853651); Biros, George  (ORCID:0000000200333994)

doi:10.1145/3733610

Citation Details

Scalable KNN Graph Construction for Heterogeneous Architectures

Constructing k-nearest neighbor (kNN) graphs is a fundamental component in many machine learning and scientific computing applications. Despite its prevalence, efficiently building all-nearest-neighbor graphs at scale on distributed heterogeneous HPC systems remains challenging, especially for large sparse non-integer datasets. We introduce optimizations for algorithms based on forests of random projection trees. Our novel GPU kernels for batched, within leaf, exact searches achieve 1.18× speedup over sparse reference kernels with less peak memory, and up to 19× speedup over CPU for memory-intensive problems. Our library,PyRKNN, implements distributed randomized projection forests for approximate kNN search. Optimizations to reduce and hide communication overhead allow us to achieve 5× speedup, in per iteration performance, relative to GOFMM (another projection tree, MPI-based kNN library), for a 64M 128d dataset on 1,024 processes. On a single-node we achieve speedup over FAISS-GPU for dense datasets and up to 10× speedup over CPU-only libraries.PyRKNNuniquely supports distributed memory kNN graph construction for both dense and sparse coordinates on CPU and GPU accelerators. more »

Award ID(s):: 2204226

PAR ID:: 10642508

Author(s) / Creator(s):: Ruys, William ; Ghafouri, Ali ; Chen, Chao ; Biros, George

Publisher / Repository:: Association for Computing Machinery

Date Published:: 2025-09-30

Journal Name:: ACM Transactions on Parallel Computing

Volume:: 12

Issue:: 3

ISSN:: 2329-4949

Page Range / eLocation ID:: 1 to 35

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
https://doi.org/10.1145/3733610

More Like this