skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: A Multimodal-Sensor-Enabled Room for Unobtrusive Group Meeting Analysis
Group meetings can suffer from serious problems that undermine performance, including bias, "groupthink", fear of speaking, and unfocused discussion. To better understand these issues, propose interventions, and thus improve team performance, we need to study human dynamics in group meetings. However, this process currently heavily depends on manual coding and video cameras. Manual coding is tedious, inaccurate, and subjective, while active video cameras can affect the natural behavior of meeting participants. Here, we present a smart meeting room that combines microphones and unobtrusive ceiling-mounted Time-of-Flight (ToF) sensors to understand group dynamics in team meetings. We automatically process the multimodal sensor outputs with signal, image, and natural language processing algorithms to estimate participant head pose, visual focus of attention (VFOA), non-verbal speech patterns, and discussion content. We derive metrics from these automatic estimates and correlate them with user-reported rankings of emergent group leaders and major contributors to produce accurate predictors. We validate our algorithms and report results on a new dataset of lunar survival tasks of 36 individuals across 10 groups collected in the multimodal-sensor-enabled smart room.  more » « less
Award ID(s):
1631674
PAR ID:
10107376
Author(s) / Creator(s):
; ; ; ; ; ; ; ; ;
Date Published:
Journal Name:
Proceedings of the 20th ACM International Conference on Multimodal Interaction
Page Range / eLocation ID:
347 to 355
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Studying group dynamics requires fine-grained spatial and temporal understanding of human behavior. Social psychologists studying human interaction patterns in face-to-face group meetings often find themselves struggling with huge volumes of data that require many hours of tedious manual coding. There are only a few publicly available multi-modal datasets of face-to-face group meetings that enable the development of automated methods to study verbal and non-verbal human behavior. In this paper, we present a new, publicly available multi-modal dataset for group dynamics study that differs from previous datasets in its use of ceiling-mounted, unobtrusive depth sensors. These can be used for fine-grained analysis of head and body pose and gestures, without any concerns about participants' privacy or inhibited behavior. The dataset is complemented by synchronized and time-stamped meeting transcripts that allow analysis of spoken content. The dataset comprises 22 group meetings in which participants perform a standard collaborative group task designed to measure leadership and productivity. Participants' post-task questionnaires, including demographic information, are also provided as part of the dataset. We show the utility of the dataset in analyzing perceived leadership, contribution, and performance, by presenting results of multi-modal analysis using our sensor-fusion algorithms designed to automatically understand audio-visual interactions. 
    more » « less
  2. Global teams frequently consist of language-based subgroups who put together complementary information to achieve common goals. Previous research outlines a two-step work communication flow in these teams. There are team meetings using a required common language (i.e., English); in preparation for those meetings, people have subgroup conversations in their native languages. Work communication at team meetings is often less effective than in subgroup conversations. In the current study, we investigate the idea of leveraging machine translation (MT) to facilitate global team meetings. We hypothesize that exchanging subgroup conversation logs before a team meeting offers contextual information that benefits teamwork at the meeting. MT can translate these logs, which enables comprehension at a low cost. To test our hypothesis, we conducted a between-subjects experiment where twenty quartets of participants performed a personnel selection task. Each quartet included two English native speakers (NS) and two non-native speakers (NNS) whose native language was Mandarin. All participants began the task with subgroup conversations in their native languages, then proceeded to team meetings in English. We manipulated the exchange of subgroup conversation logs prior to team meetings: with MT-mediated exchanges versus without. Analysis of participants' subjective experience, task performance, and depth of discussions as reflected through their conversational moves jointly indicates that team meeting quality improved when there were MT-mediated exchanges of subgroup conversation logs as opposed to no exchanges. We conclude with reflections on when and how MT could be applied to enhance global teamwork across a language barrier. 
    more » « less
  3. The detection and counting of pedestrians plays a central role for the design of smart cities. Although the use of cameras for this task has been shown to have high accuracy, they come at a high cost and are susceptible to challenges such as poor lighting, fog, and obstructed views. Our study investigates audio-based pedestrian detection, combining potentially low cost sensors with advanced machine learning based audio analysis algorithms. With an audio sensor installed along the walkway, machine learning algorithms can tell from the audio whether there is a pedestrian or not, or how far the pedestrian is from the sensor. Results show the general feasibility of audio-based pedestrian detection but fall short of reaching the accuracy levels of video-based detection. 
    more » « less
  4. Meetings are a frequent part of life for a software developer. Software design is often performed, discussed, and reviewed in these meetings. This means that meetings may contain important design information that could be captured for later use. Meeting design tools may be a way to capture design information as a byproduct of discussion that arises in these meetings. In this paper, we identify a list of key meeting support tool features that could support the capture and retrieval of design information and compare these to features currently offered in commercial meeting support tools. 
    more » « less
  5. IntroductionThis team science case study explores one cross-disciplinary science institute's change process for redesigning a weekly research coordination meeting. The narrative arc follows four stages of the adaptive process in complex adaptive systems: disequilibrium, amplification, emergence, and new order. MethodsThis case study takes an interpretative, participatory approach, where the objective is to understand the phenomena within the social context and deepen understanding of how the process unfolds over time and in context. Multiple data sources were collected and analyzed. ResultsA new adaptive order for the weekly research coordination meeting was established. The mechanism for the success of the change initiative was best explained by complexity leadership theory. DiscussionImplications for team science practice include generating momentum for change, re-examining power dynamics, defining critical teaming professional roles, building multiple pathways towards team capacity development, and holding adaptive spaces. Promising areas for further exploration are also presented. 
    more » « less