SPEX: Scaling Feature Interaction Explanations for LLMs

Kong, Justin_Singh; Butler, Landon; Agarwal; Erginbas, Yigit_Efe; Pedarsani, Ramtin; Yu, Bin; Ramchandran, Kannan

Citation Details

This content will become publicly available on May 1, 2026

SPEX: Scaling Feature Interaction Explanations for LLMs

Large language models (LLMs) have revolution- ized machine learning due to their ability to cap- ture complex interactions between input features. Popular post-hoc explanation methods like SHAP provide marginal feature attributions, while their extensions to interaction importances only scale to small input lengths (≈20). We propose Spectral Ex- plainer (SPEX), a model-agnostic interaction attri- bution algorithm that efficiently scales to large input lengths (≈1000). SPEX exploits underlying nat- ural sparsity among interactions—common in real- world data—and applies a sparse Fourier transform using a channel decoding algorithm to efficiently identify important interactions. We perform exper- iments across three difficult long-context datasets that require LLMs to utilize interactions between inputs to complete the task. For large inputs, SPEX outperforms marginal attribution methods by up to 20% in terms of faithfully reconstructing LLM out- puts. Further, SPEX successfully identifies key fea- tures and interactions that strongly influence model output. For one of our datasets, HotpotQA, SPEX provides interactions that align with human annota- tions. Finally, we use our model-agnostic approach to generate explanations to demonstrate abstract rea- soning in closed-source LLMs (GPT-4o mini) and compositional reasoning in vision-language models. more »

Award ID(s):: 2209975 2413265 2031883 2023505

PAR ID:: 10635709

Author(s) / Creator(s):: Kong, Justin_Singh; Butler, Landon; Agarwal; Erginbas, Yigit_Efe; Pedarsani, Ramtin; Yu, Bin; Ramchandran, Kannan

Publisher / Repository:: ICML 2025

Date Published:: 2025-05-01

ISBN:: 978-3031948916

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 1, 2026
Conference Proceeding:
The DOI is not currently available.

More Like this