Improving Interpretability via Explicit Word Interaction Graph Layer

Sekhon, Arshdeep; Chen, Hanjie; Shrivastava, Aman; Wang, Zhe; Ji, Yangfeng; Qi, Yanjun

doi:10.1609/aaai.v37i11.26586

Citation Details

Improving Interpretability via Explicit Word Interaction Graph Layer

Recent NLP literature has seen growing interest in improving model interpretability. Along this direction, we propose a trainable neural network layer that learns a global interaction graph between words and then selects more informative words using the learned word interactions. Our layer, we call WIGRAPH, can plug into any neural network-based NLP text classifiers right after its word embedding layer. Across multiple SOTA NLP models and various NLP datasets, we demonstrate that adding the WIGRAPH layer substantially improves NLP models' interpretability and enhances models' prediction performance at the same time. more »

Award ID(s):: 2124538

PAR ID:: 10482171

Author(s) / Creator(s):: Sekhon, Arshdeep; Chen, Hanjie; Shrivastava, Aman; Wang, Zhe; Ji, Yangfeng; Qi, Yanjun

Publisher / Repository:: AAAI

Date Published:: 2023-06-27

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 37

Issue:: 11

ISSN:: 2159-5399

Page Range / eLocation ID:: 13528 to 13537

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaai.v37i11.26586

More Like this