ADAPTIVE TEST-TIME INTERVENTION FOR CONCEPT BOTTLENECK MODELS

Shen, Matthew; Hsu, Aliyah R; Agarwal, Abhineet; Yu, Bin

Citation Details

This content will become publicly available on March 5, 2026

ADAPTIVE TEST-TIME INTERVENTION FOR CONCEPT BOTTLENECK MODELS

Concept bottleneck models (CBM) aim to improve model interpretability by predicting human level “concepts” in a bottleneck within a deep learning model architecture. However, how the predicted concepts are used in predicting the target still either remains black-box or is simplified to maintain interpretability at the cost of prediction performance. We propose to use Fast Interpretable Greedy Sum- Trees (FIGS) to obtain Binary Distillation (BD). This new method, called FIGSBD, distills a binary-augmented concept-to-target portion of the CBM into an interpretable tree-based model, while maintaining the competitive prediction performance of the CBM teacher. FIGS-BD can be used in downstream tasks to explain and decompose CBM predictions into interpretable binary-concept-interaction attributions and guide adaptive test-time intervention. Across 4 datasets, we demonstrate that our adaptive test-time intervention identifies key concepts that significantly improve performance for realistic human-in-the-loop settings that only allow for limited concept interventions. All code is made available on Github (https://github.com/mattyshen/adaptiveTTI). more »

Award ID(s):: 2209975 2413265 2023505 2031883

PAR ID:: 10635695

Author(s) / Creator(s):: Shen, Matthew; Hsu, Aliyah R; Agarwal, Abhineet; Yu, Bin

Publisher / Repository:: ICLR 2025

Date Published:: 2025-03-05

ISBN:: 979-8-3313-2085-0

Subject(s) / Keyword(s):: Adaptive test-time, interpretable deep learning, Conceptual bottleneck models

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 5, 2026
Conference Paper:
The DOI is not currently available.

More Like this