Unbiased Measurement of Feature Importance in Tree-Based Methods

Zhou, Zhengze; Hooker, Giles

doi:10.1145/3429445

Citation Details

Unbiased Measurement of Feature Importance in Tree-Based Methods

We propose a modification that corrects for split-improvement variable importance measures in Random Forests and other tree-based methods. These methods have been shown to be biased towards increasing the importance of features with more potential splits. We show that by appropriately incorporating split-improvement as measured on out of sample data, this bias can be corrected yielding better summaries and screening tools. more »

Award ID(s):: 1712554

PAR ID:: 10298502

Author(s) / Creator(s):: Zhou, Zhengze; Hooker, Giles

Date Published:: 2021-04-01

Journal Name:: ACM Transactions on Knowledge Discovery from Data

Volume:: 15

Issue:: 2

ISSN:: 1556-4681

Page Range / eLocation ID:: 1 to 21

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3429445

More Like this