How Noisy is Too Noisy? The Impact of Data Noise on Multimodal Recognition of Confusion and Conflict During Collaborative Learning

Ma, Yingbo; Celepkolu, Mehmet; Boyer, Kristy Elizabeth; Lynch, Collin F.; Wiebe, Eric; Israel, Maya

doi:10.1145/3577190.3614127

Citation Details

How Noisy is Too Noisy? The Impact of Data Noise on Multimodal Recognition of Confusion and Conflict During Collaborative Learning

Intelligent systems to support collaborative learning rely on real-time behavioral data, including language, audio, and video. However, noisy data, such as word errors in speech recognition, audio static or background noise, and facial mistracking in video, often limit the utility of multimodal data. It is an open question of how we can build reliable multimodal models in the face of substantial data noise. In this paper, we investigate the impact of data noise on the recognition of confusion and conflict moments during collaborative programming sessions by 25 dyads of elementary school learners. We measure language errors with word error rate (WER), audio noise with speech-to-noise ratio (SNR), and video errors with frame-by-frame facial tracking accuracy. The results showed that the model’s accuracy for detecting confusion and conflict in the language modality decreased drastically from 0.84 to 0.73 when the WER exceeded 20%. Similarly, in the audio modality, the model’s accuracy decreased sharply from 0.79 to 0.61 when the SNR dropped below 5 dB. Conversely, the model’s accuracy remained relatively constant in the video modality at a comparable level (> 0.70) so long as at least one learner’s face was successfully tracked. Moreover, we trained several multimodal models and found that integrating multimodal data could effectively offset the negative effect of noise in unimodal data, ultimately leading to improved accuracy in recognizing confusion and conflict. These findings have practical implications for the future deployment of intelligent systems that support collaborative learning in actual classroom settings. more »

Award ID(s):: 2229612

PAR ID:: 10496733

Author(s) / Creator(s):: Ma, Yingbo; Celepkolu, Mehmet; Boyer, Kristy Elizabeth; Lynch, Collin F.; Wiebe, Eric; Israel, Maya

Publisher / Repository:: ACM

Date Published:: 2023-10-09

Journal Name:: Proceedings of the 25th International Conference on Multimodal Interaction

ISBN:: 9798400700552

Page Range / eLocation ID:: 326 to 335

Format(s):: Medium: X

Location:: Paris France

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3577190.3614127

More Like this