skip to main content


Title: Using Multi-Encoder Fusion Strategies to Improve Personalized Response Selection
Personalized response selection systems are generally grounded on persona. However, a correlation exists between persona and empathy, which these systems do not explore well. Also, when a contradictory or off-topic response is selected, faithfulness to the conversation context plunges. This paper attempts to address these issues by proposing a suite of fusion strategies that capture the interaction between persona, emotion, and entailment information of the utterances. Ablation studies on the Persona-Chat dataset show that incorporating emotion and entailment improves the accuracy of response selection. We combine our fusion strategies and concept-flow encoding to train a BERT-based model which outperforms the previous methods by margins larger than 2.3% on original personas and 1.9% on revised personas in terms of hits@1 (top-1 accuracy), achieving a new state-of-the-art performance on the Persona-Chat dataset  more » « less
Award ID(s):
2214070
NSF-PAR ID:
10441744
Author(s) / Creator(s):
; ;
Date Published:
Journal Name:
Proceedings of the 29th International Conference on Computational Linguistics (COLING)
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The concept of using automated vehicles as mobile workspaces is now emerging. Consequently, the in- vehicle environment of automated vehicles must be redesigned to support user interactions in performing work-related tasks. During the design phase, interaction designers often use personas to understand target user groups. Personas are representations of prototypical users and are constructed from user surveys and interview data. Although data-driven, large samples of user data are typically assessed qualitatively and may result in personas that are not representative of target user groups. To create representative personas, this paper demonstrates a data analytics approach to persona development for future mobile workspaces using data from the occupational information network (O*NET). O*NET consists of data on 968 occupations, each defined by 277 features. The data were reduced using dimensionality reduction and 7 personas were identified using cluster analysis. Finally, the important features of each persona were identified using logistic regression. 
    more » « less
  2. Conversational agents designed to interact through natural language are often imbued with human-like personalities. At times, the agent might also have a distinct persona with traits such as gender, age, or a backstory. Designing such personality or persona for conversational agents has become a common design practice. In this work, we review the emerging literature on designing agent persona or personality, and reflect on these approaches, along with the personas that are created for common conversational agents. We discuss open questions with regards to three aspects: meeting user needs, the ethics of deception, and reinforcing social stereotypes through conversational agents. We hope this work can provoke researchers and practitioners to critically reflect on their approach for designing personality or persona of conversational agents. 
    more » « less
  3. Abstract Organizations all over the world, both national and international, gather demographic data so that the progress of nations and peoples can be tracked. This data is often made available to the public in the form of aggregated national level data or individual responses (microdata). Product designers likewise conduct surveys to better understand their customer and create personas. Personas are archetypes of the individuals who will use, maintain, sell or otherwise be affected by the products created by designers. Personas help designers better understand the person the product is designed for. Unfortunately, the process of collecting customer information and creating personas is often a slow and expensive process. In this paper, we introduce a new method of creating personas, leveraging publicly available databanks of both aggregated national level and information on individuals in the population. A computational persona generator is introduced that creates a population of personas that mirrors a real population in terms of size and statistics. Realistic individual personas are filtered from this population for use in product development. 
    more » « less
  4. We release FOOLMETWICE (FM2 for short), a large dataset of challenging entailment pairs collected through a fun multi-player game. Gamification encourages adversarial examples, drastically lowering the number of examples that can be solved using “shortcuts” compared to other entailment datasets. Players are presented with two tasks. The first task asks the player to write a plausible claim based on the evidence from a Wikipedia page. The second one shows two plausible claims written by other players, one of which is false, and the goal is to identify it before the time runs out. Players “pay” to see clues retrieved from the evidence pool: the more evidence the player needs, the harder the claim. Game-play between motivated players leads to diverse strategies for crafting claims, such as temporal inference and diverting to unrelated evidence, and results in higher quality data for the entailment and evidence retrieval tasks. We open source the dataset and game code. 
    more » « less
  5. For autistic individuals, navigating social and emotional interactions can be complex, often involving disproportionately high cognitive labor in contrast to neurotypical conversation partners. Through a novel approach to speculative co-design, autistic adults explored affective imaginaries — imagined futuristic technology interventions — to probe a provocative question: What if technology could translate emotions like it can translate spoken language? The resulting speculative prototype for an image-enabled emotion translator chat application included: (1) a visual system for representing personalized emotion taxonomies, and (2) a Wizard of Oz implementation of these taxonomies in a low-fidelity chat application. Although wary of technology that purports to understand emotions, autistic participants saw value in being able to deploy visual emotion taxonomies during chats with neurotypical conversation partners. This work shows that affective technology should enable users to: (1) curate encodings of emotions used in system artifacts, (2) enhance interactive emotional understanding, and (3) have agency over how and when to use emotion features. 
    more » « less