Multimodal Turn Analysis and Prediction for Multi-party Conversations

Lee, Meng-Chen; Trinh, Mai; Deng, Zhigang

doi:10.1145/3577190.3614139

Citation Details

Multimodal Turn Analysis and Prediction for Multi-party Conversations

This paper presents a computational study to analyze and predict turns (i.e., turn-taking and turn-keeping) in multiparty conversations. Specifically, we use a high-fidelity hybrid data acquisition system to capture a large-scale set of multi-modal natural conversational behaviors of interlocutors in three-party conversations, including gazes, head movements, body movements, speech, etc. Based on the inter-pausal units (IPUs) extracted from the in-house acquired dataset, we propose a transformer-based computational model to predict the turns based on the interlocutor states (speaking/back-channeling/silence) and the gaze targets. Our model can robustly achieve more than 80% accuracy, and the generalizability of our model was extensively validated through cross-group experiments. Also, we introduce a novel computational metric called “relative engagement level" (REL) of IPUs, and further validate its statistical significance between turn-keeping IPUs and turn-taking IPUs, and between different conversational groups. Our experimental results also found that the patterns of the interlocutor states can be used as a more effective cue than their gaze behaviors for predicting turns in multiparty conversations. more »

Award ID(s):: 2005430

PAR ID:: 10532343

Author(s) / Creator(s):: Lee, Meng-Chen; Trinh, Mai; Deng, Zhigang

Publisher / Repository:: ACM

Date Published:: 2023-10-09

ISBN:: 9798400700552

Page Range / eLocation ID:: 436 to 444

Subject(s) / Keyword(s):: Multi-party conversations conversational gesture understanding Multimodal interaction Machine learning Human-human interaction Empirical studies

Format(s):: Medium: X

Location:: Paris France

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3577190.3614139

More Like this