DIM: Dyadic Interaction Modeling for Social Behavior Generation

Tran, Minh; Chang, Di; Siniukov, Maksim; Soleymani, Mohammad

doi:10.1007/978-3-031-72913-3_27

Citation Details

This content will become publicly available on December 2, 2025

DIM: Dyadic Interaction Modeling for Social Behavior Generation

Human-human communication is like a delicate dance where listeners and speakers concurrently interact to maintain conversational dynamics. Hence, an effective model for generating listener nonverbal behaviors requires understanding the dyadic context and interaction. In this paper, we present an effective framework for creating 3D facial motions in dyadic interactions. Existing work consider a listener as a reactive agent with reflexive behaviors to the speaker’s voice and facial motions. The heart of our framework is Dyadic Interaction Modeling (DIM), a pre-training approach that jointly models speakers’ and listeners’ motions through masking and contrastive learning to learn representations that capture the dyadic context. To enable the generation of non-deterministic behaviors, we encode both listener and speaker motions into discrete latent representations, through VQ-VAE. The pre-trained model is further fine-tuned for motion generation. Extensive experiments demonstrate the superiority of our framework in generating listener motions, establishing a new state-of-the-art according to the quantitative measures capturing the diversity and realism of generated motions. Qualitative results demonstrate the superior capabilities of the proposed approach in generating diverse and realistic expressions, eye blinks and head gestures. more »

Award ID(s):: 2211550

PAR ID:: 10620786

Author(s) / Creator(s):: Tran, Minh; Chang, Di; Siniukov, Maksim; Soleymani, Mohammad

Editor(s):: Leonardis, Aleš; Ricci, Eliss; Roth, Stefan; Russakovsky, Olga; Sattler, Torsten; Varol, Gul

Publisher / Repository:: Springer Nature Switzerland

Date Published:: 2024-12-02

Page Range / eLocation ID:: 484 to 503

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 2, 2025
Book Chapter:
https://doi.org/10.1007/978-3-031-72913-3_27

More Like this