Pathwise Explanation of ReLU Neural Networks

Lim, Seongwoo; Jo, Won; Lee, Joohyung; Choi, Jaesik

Citation Details

Neural networks have demonstrated a wide range of successes, but their “black box" nature raises concerns about transparency and reliability. Previous research on ReLU networks has sought to unwrap these networks into linear models based on activation states of all hidden units. In this paper, we introduce a novel approach that considers subsets of the hidden units involved in the decision making path. This pathwise explanation provides a clearer and more consistent understanding of the relationship between the input and the decision-making process. Our method also offers flexibility in adjusting the range of explanations within the input, i.e., from an overall attribution input to particular components within the input. Furthermore, it allows for the decomposition of explanations for a given input for more detailed explanations. Our experiments demonstrate that the proposed method outperforms existing methods both quantitatively and qualitatively. more »

Award ID(s):: 2006747

PAR ID:: 10564709

Author(s) / Creator(s):: Lim, Seongwoo; Jo, Won; Lee, Joohyung; Choi, Jaesik

Publisher / Repository:: PMLR (Proceedings of Machine Learning Research)

Date Published:: 2024-05-02

Volume:: 238

Issue:: 4645-4653

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this