NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

From the Lab to the Wild: Examining Generalizability of Video-based Mind Wandering Detection

https://doi.org/10.1007/s40593-024-00412-2

Bühler, Babette; Bozkir, Efe; Goldberg, Patricia; Sümer, Ömer; D’Mello, Sidney; Gerjets, Peter; Trautwein, Ulrich; Kasneci, Enkelejda (June 2024, International Journal of Artificial Intelligence in Education)

Abstract Student’s shift of attention away from a current learning task to task-unrelated thought, also called mind wandering, occurs about 30% of the time spent on education-related activities. Its frequent occurrence has a negative effect on learning outcomes across learning tasks. Automated detection of mind wandering might offer an opportunity to assess the attentional state continuously and non-intrusively over time and hence enable large-scale research on learning materials and responding to inattention with targeted interventions. To achieve this, an accessible detection approach that performs well for various systems and settings is required. In this work, we explore a new, generalizable approach to video-based mind wandering detection that can be transferred to naturalistic settings across learning tasks. Therefore, we leverage two datasets, consisting of facial videos during reading in the lab (N = 135) and lecture viewing in-the-wild (N = 15). When predicting mind wandering, deep neural networks (DNN) and long short-term memory networks (LSTMs) achieve F$$_{1}$$ $_{1}$ scores of 0.44 (AUC-PR = 0.40) and 0.459 (AUC-PR = 0.39), above chance level, with latent features based on transfer-learning on the lab data. When exploring generalizability by training on the lab dataset and predicting on the in-the-wild dataset, BiLSTMs on latent features perform comparably to the state-of-the-art with an F$$_{1}$$ $_{1}$ score of 0.352 (AUC-PR = 0.26). Moreover, we investigate the fairness of predictive models across gender and show based on post-hoc explainability methods that employed latent features mainly encode information on eye and mouth areas. We discuss the benefits of generalizability and possible applications.
more » « less
Multimodal Engagement Analysis from Facial Videos in the Classroom

https://doi.org/10.1109/TAFFC.2021.3127692

Sumer, Omer; Goldberg, Patricia; D'Mello, Sidney; Gerjets, Peter; Trautwein, Ulrich; Kasneci, Enkelejda (November 2021, IEEE Transactions on Affective Computing)

Student engagement is a key component of learning and teaching, resulting in a plethora of automated methods to measure it. Whereas most of the literature explores student engagement analysis using computer-based learning often in the lab, we focus on using classroom instruction in authentic learning environments. We collected audiovisual recordings of secondary school classes over a one and a half month period, acquired continuous engagement labeling per student (N=15) in repeated sessions, and explored computer vision methods to classify engagement from facial videos. We learned deep embeddings for attentional and affective features by training Attention-Net for head pose estimation and Affect-Net for facial expression recognition using previously-collected large-scale datasets. We used these representations to train engagement classifiers on our data, in individual and multiple channel settings, considering temporal dependencies. The best performing engagement classifiers achieved student-independent AUCs of .620 and .720 for grades 8 and 12, respectively, with attention-based features outperforming affective features. Score-level fusion either improved the engagement classifiers or was on par with the best performing modality. We also investigated the effect of personalization and found that only 60 seconds of person-specific data, selected by margin uncertainty of the base classifier, yielded an average AUC improvement of .084.
more » « less
Full Text Available
Eye-Hand Behavior in Human-Robot Shared Manipulation

https://doi.org/10.1145/3171221.3171287

Aronson, Reuben M.; Santini, Thiago; Kübler, Thomas C.; Kasneci, Enkelejda; Srinivasa, Siddhartha; Admoni, Henny (January 2018, Proceedings of the 2018 ACM/IEEE International Conference on Human-Robot Interaction)

Full Text Available
Eye tracking: empirical foundations for a minimal reporting guideline

https://doi.org/10.3758/s13428-021-01762-8

Holmqvist, Kenneth; Örbom, Saga Lee; Hooge, Ignace T.; Niehorster, Diederick C.; Alexander, Robert G.; Andersson, Richard; Benjamins, Jeroen S.; Blignaut, Pieter; Brouwer, Anne-Marie; Chuang, Lewis L.; et al (April 2022, Behavior research methods)

In this paper, we present a review of how the various aspects of any study using an eye tracker (such as the instrument, methodology, environment, participant, etc.) affect the quality of the recorded eye-tracking data and the obtained eye-movement and gaze measures. We take this review to represent the empirical foundation for reporting guidelines of any study involving an eye tracker. We compare this empirical foundation to five existing reporting guidelines and to a database of 207 published eye-tracking studies. We find that reporting guidelines vary substantially and do not match with actual reporting practices. We end by deriving a minimal, flexible reporting guideline based on empirical research (Section “An empirically based minimal reporting guideline”).
more » « less
Full Text Available

Search for: All records