NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Post Hoc Explanations of Language Models Can Improve Language Models

Krishna, Satyapriya; Ma, Jiaqi; Slack, Dylan; Ghandeharioun, Asma; Singh, Sameer; Lakkaraju, Himabindu (December 2023, Advances in Neural Information Processing Systems)

Full Text Available
Post Hoc Explanations of Language Models Can Improve Language Models

Krishna, Satyapriya; Ma, Jiaqi; Slack, Dylan; Ghandeharioun, Asma; Singh, Sameer; Lakkaraju, Himabindu (December 2023, Advances in Neural Information Processing Systems 36 (NeurIPS 2023))

Large Language Models (LLMs) have demonstrated remarkable capabilities in performing complex tasks. Moreover, recent research has shown that incorporating human-annotated rationales (e.g., Chain-of-Thought prompting) during in-context learning can significantly enhance the performance of these models, particularly on tasks that require reasoning capabilities. However, incorporating such rationales poses challenges in terms of scalability as this requires a high degree of human involvement. In this work, we present a novel framework, Amplifying Model Performance by Leveraging In-Context Learning with Post Hoc Explanations (AMPLIFY), which addresses the aforementioned challenges by automating the process of rationale generation. To this end, we leverage post hoc explanation methods which output attribution scores (explanations) capturing the influence of each of the input features on model predictions. More specifically, we construct automated natural language rationales that embed insights from post hoc explanations to provide corrective signals to LLMs. Extensive experimentation with real-world datasets demonstrates that our framework, AMPLIFY, leads to prediction accuracy improvements of about 10-25% over a wide range of tasks, including those where prior approaches which rely on human-annotated rationales such as Chain-of-Thought prompting fall short. Our work makes one of the first attempts at highlighting the potential of post hoc explanations as valuable tools for enhancing the effectiveness of LLMs. Furthermore, we conduct additional empirical analyses and ablation studies to demonstrate the impact of each of the components of AMPLIFY, which, in turn, lead to critical insights for refining in context learning.
more » « less
Full Text Available
Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten

Krishna, Satyapriya; Ma, Jiaqi; Lakkaraju, Himabindu. (July 2023, International Conference on Machine Learning)
OpenXAI: Towards a Transparent Evaluation of Model Explanations

Agarwal, Chirag; Krishna, Satyapriya; Saxena, Eshika; Pawelczyk, Martin; Johnson, Nari; Puri, Isha; Zitnik, Marinka; Lakkaraju, Himabindu (December 2023, Advances in neural information processing systems)
Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten.

Krishna, Satyapriya; Ma, Jiaqi; Lakkaraju, Himabindu (June 2023, International Conference on Machine Learning)
TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations.

Slack, Dylan; Krishna, Satyapriya; Lakkaraju, Himabindu; Singh, Sameer (July 2023, Nature machine intelligence)

Full Text Available
Explaining machine learning models with interactive natural language conversations using TalkToModel

https://doi.org/10.1038/s42256-023-00692-8

Slack, Dylan; Krishna, Satyapriya; Lakkaraju, Himabindu; Singh, Sameer (July 2023, Nature Machine Intelligence)

Abstract Practitioners increasingly use machine learning (ML) models, yet models have become more complex and harder to understand. To understand complex models, researchers have proposed techniques to explain model predictions. However, practitioners struggle to use explainability methods because they do not know which explanation to choose and how to interpret the explanation. Here we address the challenge of using explainability methods by proposing TalkToModel: an interactive dialogue system that explains ML models through natural language conversations. TalkToModel consists of three components: an adaptive dialogue engine that interprets natural language and generates meaningful responses; an execution component that constructs the explanations used in the conversation; and a conversational interface. In real-world evaluations, 73% of healthcare workers agreed they would use TalkToModel over existing systems for understanding a disease prediction model, and 85% of ML professionals agreed TalkToModel was easier to use, demonstrating that TalkToModel is highly effective for model explainability.
more » « less
OpenXAI: Towards a Transparent Evaluation of Model Explanations

Agarwal, Chirag; Krishna, Satyapriya; Saxena, Eshika; Pawelczyk, Martin; Johnson, Nari; Puri, Isha; Zitnik, Marinka; Lakkaraju, Himabindu (October 2022, Advances in neural information processing systems)

Full Text Available
TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations

Slack, Dylan Z.; Krishna, Satyapriya; Lakkaraju, Himabindu; Singh, Sameer (January 2022, NeurIPS Workshop on Trustworthy and Socially Responsible Machine Learning (TSRML))

Machine Learning (ML) models are increasingly used to make critical decisions in real-world applications, yet they have become more complex, making them harder to understand. To this end, researchers have proposed several techniques to explain model predictions. However, practitioners struggle to use these explainability techniques because they often do not know which one to choose and how to interpret the results of the explanations. In this work, we address these challenges by introducing TalkToModel: an interactive dialogue system for explaining machine learning models through conversations. TalkToModel comprises 1) a dialogue engine that adapts to any tabular model and dataset, understands language, and generates responses, and 2) an execution component that constructs the explanations. In real-world evaluations with humans, 73% of healthcare workers (e.g., doctors and nurses) agreed they would use TalkToModel over baseline point-and-click systems for explainability in a disease prediction task, and 85% of ML professionals agreed TalkToModel was easier to use for computing explanations. Our findings demonstrate that TalkToModel is more effective for model explainability than existing systems, introducing a new category of explainability tools for practitioners.
more » « less
Full Text Available

Search for: All records