Bolt: Fast Inference for Random Forests

Romero, Eduardo; Stewart, Christopher; Li, Angela; Hale, Kyle; Morris, Nathaniel

doi:10.1145/3528535.3531519

Citation Details

Bolt: Fast Inference for Random Forests

Random forests use ensembles of decision trees to boost accuracy for machine learning tasks. However, large ensembles slow down inference on platforms that process each tree in an ensemble individually. We present Bolt, a platform that restructures whole random forests, not just individual trees, to speed up inference. Conceptually, Bolt maps every path in each tree to a lookup table which, if cache were large enough, would allow inference with just one memory access. When the size of the lookup table exceeds cache capacity, Bolt employs a novel combination of lossless compression, parameter selection, and bloom filters to shrink the table while preserving fast inference. We compared inference speed in Bolt to three state-of-the-art platforms: Python Scikit-Learn, Ranger, and Forest Packing. We evaluated these platforms using datasets with vision, natural language processing and categorical applications. We observed that on ensembles of shallow decision trees Bolt can run 2-14X faster than competing platforms and that Bolt's speedups persist as the number of decision trees in an ensemble increases. more »

Award ID(s):: 1763612 2028958

PAR ID:: 10382585

Author(s) / Creator(s):: Romero, Eduardo; Stewart, Christopher; Li, Angela; Hale, Kyle; Morris, Nathaniel

Date Published:: 2022-10-24

Journal Name:: Proceedings of the 23rd ACM/IFIP International Middleware Conference

Page Range / eLocation ID:: 94 to 106

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3528535.3531519

More Like this