XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs

Chen, Z; Chen, J; Singh, A K; Sra, M

Citation Details

This content will become publicly available on November 12, 2025

XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs

Large Language Models (LLMs) have achieved remarkable success in natural language tasks, yet understanding their reasoning processes re- mains a significant challenge. We address this by introducing XplainLLM, a dataset accom- panying an explanation framework designed to enhance LLM transparency and reliability. Our dataset comprises 24,204 instances where each instance interprets the LLM’s reasoning behavior using knowledge graphs (KGs) and graph attention networks (GAT), and includes explanations of LLMs such as the decoder- only Llama-3 and the encoder-only RoBERTa. XplainLLM also features a framework for gener- ating grounded explanations and the debugger- scores for multidimensional quality analysis. Our explanations include why-choose and why- not-choose components, reason-elements, and debugger-scores that collectively illuminate the LLM’s reasoning behavior. Our evaluations demonstrate XplainLLM’s potential to reduce hallucinations and improve grounded explana- tion generation in LLMs. XplainLLM is a re- source for researchers and practitioners to build trust and verify the reliability of LLM outputs. Our code and dataset are publicly available. more »

Award ID(s):: 2229876

PAR ID:: 10594777

Author(s) / Creator(s):: Chen, Z; Chen, J; Singh, A K; Sra, M

Publisher / Repository:: Proc. Empirical Methods in Natural Language Processing (EMNLP)

Date Published:: 2024-11-12

Page Range / eLocation ID:: 7578–7596

Format(s):: Medium: X

Location:: Miami, FL

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on November 12, 2025
Conference Paper:
The DOI is not currently available.

More Like this