Data Mining with Algorithmic Transparency

Yan Zhou, Yasmeen Alufaisan

doi:10.1007/978-3-319-93034-3_11

Citation Details

Data Mining with Algorithmic Transparency

In this paper, we investigate whether decision trees can be used to interpret a black-box classifier without knowing the learning algorithm and the training data. Decision trees are known for their transparency and high expressivity. However, they are also notorious for their instability and tendency to grow excessively large. We present a classifier reverse engineering model that outputs a decision tree to interpret the black-box classifier. There are two major challenges. One is to build such a decision tree with controlled stability and size, and the other is that probing the black-box classifier is limited for security and economic reasons. Our model addresses the two issues by simultaneously minimizing sampling cost and classifier complexity. We present our empirical results on four real datasets, and demonstrate that our reverse engineering learning model can effectively approximate and simplify the black box classifier. more »

Award ID(s):: 1633331

PAR ID:: 10073924

Author(s) / Creator(s):: Yan Zhou, Yasmeen Alufaisan

Date Published:: 2018-01-01

Journal Name:: PAKDD 2018: Advances in Knowledge Discovery and Data Mining

Page Range / eLocation ID:: 130- 142

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-319-93034-3_11

More Like this