Equi-explanation Maps: Concise and Informative Global Summary Explanations

Chowdhury, Tanya; Rahimi, Razieh; Allan, James

doi:10.1145/3531146.3533112

We propose equi-explanation maps to study the variation in model logic across the input space. These global model-agnostic structures partition the hyper-space of explanation features into regions of similar model logic. Equi-explanation maps act as a concise summary of instance explanations and can provide laymen an at-a-glance understanding of the basis on which the classifier makes its decisions. We thus propose the task of generating $$\epsilon$$-equi-explanation maps, a partitioning of the input space into subspaces such that the standard deviation of explanation vectors in a subspace do not exceed $$\epsilon$$. We adapt existing local and subspace explainability techniques like LIME and MUSE to generate equi-explanation maps on two binary classification datasets using four classification models and evaluate the quality of their partitioning. We find that these techniques produce a sub-optimal number of subspaces (making the maps harder to interpret) and have a considerable run time. We then propose E-map, a new divide-and-conquer based algorithm to produce $$\epsilon$$-equi-explanation maps. E-map is able to decrease the number of subspaces (and hence increase interpretability) and running time as compared to the previous systems for a fixed value of $$\epsilon$$. Finally, given a classifier decision boundary, we try to determine what would be an optimal value for the parameter $$\epsilon$$. We believe good explanation representation methods can increase the trustworthiness and understanding of machine learning models for critical real world tasks.

More Like this