Interpreting and Improving Deep-Learning Models with Reality Checks

Singh, Chandan; Ha, Wooseok; Yu, Bin

doi:10.1007/978-3-031-04083-2_12

Citation Details

Interpreting and Improving Deep-Learning Models with Reality Checks

Recent deep-learning models have achieved impressive predictive performance by learning complex functions of many variables, often at the cost of interpretability. This chapter covers recent work aiming to interpret models by attributing importance to features and feature groups for a single prediction. Importantly, the proposed attributions assign importance to interactions between features, in addition to features in isolation. These attributions are shown to yield insights across real-world domains, including bio-imaging, cosmology image and natural-language processing. We then show how these attributions can be used to directly improve the generalization of a neural network or to distill it into a simple model. Throughout the chapter, we emphasize the use of reality checks to scrutinize the proposed interpretation techniques. (Code for all methods in this chapter is available at github.com/csinva and github.com/Yu-Group, implemented in PyTorch [54]). more »

Award ID(s):: 2023505 2031883 1740855 2015341 0939370 1953191

PAR ID:: 10343666

Author(s) / Creator(s):: Singh, Chandan; Ha, Wooseok; Yu, Bin

Date Published:: 2022-04-17

Journal Name:: Lecture notes in computer science

ISSN:: 0302-9743

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1007/978-3-031-04083-2_12

More Like this