NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Effects of Text Release Features on Communication and Problem-Solving in Teams with a Deaf/Hard of Hearing and a Hearing Member Using Automatic Speech Recognition: Effects of Text Release Features

https://doi.org/10.1145/3663547.3759694

Stinson, Michael; L_Carpenter, Rebecca; Easton, Donna (October 2025, ACM)

Free, publicly-accessible full text available October 22, 2026
CuCap: Comparative Analysis of Customized Captioning between North American and South Korean d/Deaf and Hard-of-Hearing Users

https://doi.org/10.1145/3663547.3746400

de_Lacerda_Pataca, Caluã; Ahn, SooYeon; Yoo, Suhyeon; Kim, JooYeong; Truong, Khai N; Hong, Jin-Hyuk; Peiris, Roshan L; Huenerfauth, Matt (October 2025, ACM)

Free, publicly-accessible full text available October 22, 2026
Notification Designs for Influencing Hearing Speakers' Behaviors During Captioned Conversations Among Mixed DHH-Hearing Groups

https://doi.org/10.1145/3744257.3744273

Seita, Matthew; Andrew, Sarah; Huenerfauth, Matt (April 2025, ACM)

Free, publicly-accessible full text available April 28, 2026
Tactile Emotions: Multimodal Affective Captioning with Haptics Improves Narrative Engagement for d/Deaf and Hard-of-Hearing Viewers

https://doi.org/10.1145/3706598.3713304

de_Lacerda_Pataca, Caluã; Hassan, Saad; May, Lloyd; Olson, Michelle M; D'aurio, Toni; Peiris, Roshan L; Huenerfauth, Matt (April 2025, ACM)

This paper explores a multimodal approach for translating emotional cues present in speech, designed with Deaf and Hard-of-Hearing (dhh) individuals in mind. Prior work has focused on visual cues applied to captions, successfully conveying whether a speaker’s words have a negative or positive tone (valence), but with mixed results regarding the intensity (arousal) of these emotions. We propose a novel method using haptic feedback to communicate a speaker’s arousal levels through vibrations on a wrist-worn device. In a formative study with 16 dhh participants, we tested six haptic patterns and found that participants preferred single per-word vibrations at 75 Hz to encode arousal. In a follow-up study with 27 dhh participants, this pattern was paired with visual cues, and narrative engagement with audio-visual content was measured. Results indicate that combining haptics with visuals significantly increased engagement compared to a conventional captioning baseline and a visuals-only affective captioning style.
more » « less
Free, publicly-accessible full text available April 25, 2026
Designing and Evaluating an Advanced Dance Video Comprehension Tool with In-situ Move Identification Capabilities

https://doi.org/10.1145/3613904.3642710

Hassan, Saad; De_Lacerda_Pataca, Caluã; Nourian, Laleh; Tigwell, Garreth W; Davis, Briana; Silver_Wagman, Will Zhenya (May 2024, ACM)

Analyzing dance moves and routines is a foundational step in learning dance. Videos are often utilized at this step, and advancements in machine learning, particularly in human-movement recognition, could further assist dance learners. We developed and evaluated a Wizard-of-Oz prototype of a video comprehension tool that offers automatic in-situ dance move identification functionality. Our system design was informed by an interview study involving 12 dancers to understand the challenges they face when trying to comprehend complex dance videos and taking notes. Subsequently, we conducted a within-subject study with 8 Cuban salsa dancers to identify the benefits of our system compared to an existing traditional feature-based search system. We found that the quality of notes taken by participants improved when using our tool, and they reported a lower workload. Based on participants’ interactions with our system, we offer recommendations on how an AI-powered span-search feature can enhance dance video comprehension tools.
more » « less
Full Text Available
Caption Royale: Exploring the Design Space of Affective Captions from the Perspective of Deaf and Hard-of-Hearing Individuals

https://doi.org/10.1145/3613904.3642258

de_Lacerda_Pataca, Caluã; Hassan, Saad; Tinker, Nathan; Peiris, Roshan Lalintha; Huenerfauth, Matt (May 2024, ACM)

Affective captions employ visual typographic modulations to convey a speaker’s emotions, improving speech accessibility for Deaf and Hard-of-Hearing (dhh) individuals. However, the most effective visual modulations for expressing emotions remain uncertain. Bridging this gap, we ran three studies with 39 dhh participants, exploring the design space of affective captions, which include parameters like text color, boldness, size, and so on. Study 1 assessed preferences for nine of these styles, each conveying either valence or arousal separately. Study 2 combined Study 1’s top-performing styles and measured preferences for captions depicting both valence and arousal simultaneously. Participants outlined readability, minimal distraction, intuitiveness, and emotional clarity as key factors behind their choices. In Study 3, these factors and an emotion-recognition task were used to compare how Study 2’s winning styles performed versus a non-styled baseline. Based on our findings, we present the two best-performing styles as design recommendations for applications employing affective captions.
more » « less
Full Text Available
Modeling Word Importance in Conversational Transcripts: Toward improved live captioning for Deaf and hard of hearing viewers

https://doi.org/10.1145/3587281.3587290

Amin, Akhter Al; Hassan, Saad; Huenerfauth, Matt; Alm, Cecilia Ovesdotter (April 2023, Proceedings of the 20th International Web for All Conference)

Full Text Available
Understanding How Deaf and Hard of Hearing Viewers Visually Explore Captioned Live TV News

https://doi.org/10.1145/3587281.3587287

Amin, Akhter Al; Hassan, Saad; Lee, Sooyeon; Huenerfauth, Matt (April 2023, Proceedings of the 20th International Web for All Conference)

Full Text Available
Exploring the Design Space of Automatically Generated Emotive Captions for Deaf or Hard of Hearing Users

https://doi.org/10.1145/3544549.3585880

Hassan, Saad; Ding, Yao; Kerure, Agneya Abhimanyu; Miller, Christi; Burnett, John; Biondo, Emily; Gilbert, Brenden (April 2023, Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems (CHI EA '23))

Caption text conveys salient auditory information to deaf or hard-of-hearing (DHH) viewers. However, the emotional information within the speech is not captured. We developed three emotive captioning schemas that map the output of audio-based emotion detection models to expressive caption text that can convey underlying emotions. The three schemas used typographic changes to the text, color changes, or both. Next, we designed a Unity framework to implement these schemas and used it to generate stimuli videos. In an experimental evaluation with 28 DHH viewers, we compared DHH viewers’ ability to understand emotions and their subjective judgments across the three captioning schemas. We found no significant difference in participants’ ability to understand the emotion based on the captions or their subjective preference ratings. Open-ended feedback revealed factors contributing to individual differences in preferences among the participants and challenges with automatically generated emotive captions that motivate future work.
more » « less
Full Text Available
Visualization of Speech Prosody and Emotion in Captions: Accessibility for Deaf and Hard-of-Hearing Users

https://doi.org/10.1145/3544548.3581511

de Lacerda Pataca, Caluã; Watkins, Matthew; Peiris, Roshan; Lee, Sooyeon; Huenerfauth, Matt (April 2023, In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems)

Full Text Available

« Prev Next »

Search for: All records