Symbolic Rule Extraction From Attention-Guided Sparse Representations in Vision Transformers

PADALKAR, PARTH (ORCID:0000000310150777); GUPTA, GOPAL

doi:10.1017/S1471068425100318

Citation Details

This content will become publicly available on July 1, 2026

Symbolic Rule Extraction From Attention-Guided Sparse Representations in Vision Transformers

Abstract Recentneuro-symbolic approaches have successfully extracted symbolic rule-sets from Convolutional Neural Network-based models to enhance interpretability. However, applying similar techniques to Vision Transformers (ViTs) remains challenging due to their lack of modular concept detectors and reliance on global self-attention mechanisms. We propose a framework for symbolic rule extraction from ViTs by introducing a sparse concept layer inspired by Sparse Autoencoders (SAEs). This linear layer operates on attention-weighted patch representations and learns a disentangled, binarized representation in which individual neurons activate for high-level visual concepts. To encourage interpretability, we apply a combination of L1 sparsity, entropy minimization, and supervised contrastive loss. These binarized concept activations are used as input to the FOLD-SE-M algorithm, which generates a rule-set in the form of a logic program. Our method achieves abetter classification accuracy than the standard ViT while enabling symbolic reasoning. Crucially, the extracted rule-set is not merely post-hoc but acts as a logic-based decision layer that operates directly on the sparse concept representations. The resulting programs are concise and semantically meaningful. This work is the first to extract executable logic programs from ViTs using sparse symbolic representations, providing a step forward in interpretable and verifiable neuro-symbolic AI. more »

Award ID(s):: 1910131

PAR ID:: 10649080

Author(s) / Creator(s):: PADALKAR, PARTH; GUPTA, GOPAL

Publisher / Repository:: Cambridge University Press

Date Published:: 2025-07-01

Journal Name:: Theory and Practice of Logic Programming

Volume:: 25

Issue:: 4

ISSN:: 1471-0684

Page Range / eLocation ID:: 722 to 738

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on July 1, 2026
Journal Article:
https://doi.org/10.1017/S1471068425100318

More Like this