Embodied Multimodal Agents to Bridge the Understanding Gap

Krishnaswamy, Nikhil.; Alalyani, Nada

Citation Details

In this paper we argue that embodied multimodal agents, i.e., avatars, can play an important role in moving natural language processing toward “deep understanding.” Fully featured interactive agents, model encounters between two “people,” but a language-only agent has little environmental and situational awareness. Multimodal agents bring new opportunities for interpreting visuals, locational information, gestures, etc., which are more axes along which to communicate. We propose that multimodal agents, by facilitating an embodied form of human-computer interaction, provide additional structure that can be used to train models that move NLP systems closer to genuine “understanding” of grounded language, and we discuss ongoing studies using existing systems. more »

Award ID(s):: 2019805

PAR ID:: 10494303

Author(s) / Creator(s):: Krishnaswamy, Nikhil.; Alalyani, Nada

Publisher / Repository:: Association for Computational Linguistics

Date Published:: 2021-04-01

Journal Name:: Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing

Format(s):: Medium: X

Location:: Online

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this