Multimodal Semantics for Affordances and Actions

Pustejovsky, James; Krishnaswamy, Nikhil

doi:10.1007/978-3-031-05311-5_9

Citation Details

Multimodal Semantics for Affordances and Actions

In this paper, we argue that, as HCI becomes more multimodal with the integration of gesture, gaze, posture, and other nonverbal behavior, it is important to understand the role played by affordances and their associated actions in human-object interactions (HOI), so as to facilitate reasoning in HCI and HRI environments. We outline the requirements and challenges involved in developing a multimodal semantics for human-computer and human-robot interactions. Unlike unimodal interactive agents (e.g., text-based chatbots or voice-based personal digital assistants), multimodal HCI and HRI inherently require a notion of embodiment, or an understanding of the agent’s placement within the environment and that of its interlocutor. We present a dynamic semantics of the language, VoxML, to model human-computer, human-robot, and human-human interactions by creating multimodal simulations of both the communicative content and the agents’ common ground, and show the utility of VoxML information that is reified within the environment on computational understanding of objects for HOI. more »

Award ID(s):: 2033932

PAR ID:: 10379212

Author(s) / Creator(s):: Pustejovsky, James; Krishnaswamy, Nikhil

Date Published:: 2022-07-01

Journal Name:: Lecture notes in computer science

ISSN:: 1611-3349

Page Range / eLocation ID:: 137–160

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-031-05311-5_9

More Like this