I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

Zhou, Pei; Zhu, Andrew; Hu, Jennifer; Pujara, Jay; Ren, Xiang; Callison-Burch, Chris; Choi, Yejin; Ammanabrolu, Prithviraj

doi:10.18653/v1/2023.acl-long.624

Citation Details

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

We propose a novel task, G4C (Goal-driven Guidance Generation in Grounded Communication), for studying goal-driven and grounded natural language interactions. Specifically, we choose Dungeons and Dragons (D&D) -- a role-playing game consisting of multiple player characters and a Dungeon Master (DM) who collaborate to achieve a set of goals that are beneficial to the players -- as a testbed for this task. Here, each of the player characters is a student, with their own personas and abilities, and the DM is the teacher, an arbitrator of the rules of the world and responsible for assisting and guiding the students towards a global goal. We propose a theory-of-mind-inspired methodology for training such a DM with reinforcement learning (RL), where a DM: (1) learns to predict how the players will react to its utterances using a dataset of D&D dialogue transcripts; and (2) uses this prediction as a reward function providing feedback on how effective these utterances are at guiding the players towards a goal. Human and automated evaluations show that a DM trained with RL to generate guidance by incorporating a theory-of-mind of the players significantly improves the players' ability to achieve goals grounded in their shared world. more »

Award ID(s):: 1928474

PAR ID:: 10463285

Author(s) / Creator(s):: Zhou, Pei; Zhu, Andrew; Hu, Jennifer; Pujara, Jay; Ren, Xiang; Callison-Burch, Chris; Choi, Yejin; Ammanabrolu, Prithviraj

Date Published:: 2023-01-01

Journal Name:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics

Volume:: 1

Page Range / eLocation ID:: 11136 to 11155

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.acl-long.624

More Like this