NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Visual Hearing Aids: Artificial Visual Speech Stimuli for Audiovisual Speech Perception in Noise

https://doi.org/10.1145/3611659.3615682

Choudhary, Zubin; Bruder, Gerd; Welch, Greg (October 2023, ACM)

Human speech perception is generally optimal in quiet environments, however it becomes more difficult and error prone in the presence of noise, such as other humans speaking nearby or ambient noise. In such situations, human speech perception is improved by speech reading , i.e., watching the movements of a speaker's mouth and face, either consciously as done by people with hearing loss or subconsciously by other humans. While previous work focused largely on speech perception of two-dimensional videos of faces, there is a gap in the research field focusing on facial features as seen in head-mounted displays, including the impacts of display resolution, and the effectiveness of visually enhancing a virtual human face on speech perception in the presence of noise. In this paper, we present a comparative user study ( $N=21$ ) in which we investigated an audio-only condition compared to two levels of head-mounted display resolution ( $$1832\times 1920$$ or $$916\times 960$$ pixels per eye) and two levels of the native or visually enhanced appearance of a virtual human, the latter consisting of an up-scaled facial representation and simulated lipstick (lip coloring) added to increase contrast. To understand effects on speech perception in noise, we measured participants' speech reception thresholds (SRTs) for each audio-visual stimulus condition. These thresholds indicate the decibel levels of the speech signal that are necessary for a listener to receive the speech correctly 50% of the time. First, we show that the display resolution significantly affected participants' ability to perceive the speech signal in noise, which has practical implications for the field, especially in social virtual environments. Second, we show that our visual enhancement method was able to compensate for limited display resolution and was generally preferred by participants. Specifically, our participants indicated that they benefited from the head scaling more than the added facial contrast from the simulated lipstick. We discuss relationships, implications, and guidelines for applications that aim to leverage such enhancements.
more » « less
Full Text Available
Exploring the Social Influence of Virtual Humans Unintentionally Conveying Conflicting Emotions

https://doi.org/10.1109/VR55154.2023.00072

Choudhary, Zubin; Norouzi, Nahal; Erickson, Austin; Schubert, Ryan; Bruder, Gerd; Welch, Gregory F. (March 2023, 2023 IEEE Conference Virtual Reality and 3D User Interfaces (VR))

The expression of human emotion is integral to social interaction, and in virtual reality it is increasingly common to develop virtual avatars that attempt to convey emotions by mimicking these visual and aural cues, i.e. the facial and vocal expressions. However, errors in (or the absence of) facial tracking can result in the rendering of incorrect facial expressions on these virtual avatars. For example, a virtual avatar may speak with a happy or unhappy vocal inflection while their facial expression remains otherwise neutral. In circumstances where there is conflict between the avatar's facial and vocal expressions, it is possible that users will incorrectly interpret the avatar's emotion, which may have unintended consequences in terms of social influence or in terms of the outcome of the interaction. In this paper, we present a human-subjects study (N = 22) aimed at understanding the impact of conflicting facial and vocal emotional expressions. Specifically we explored three levels of emotional valence (unhappy, neutral, and happy) expressed in both visual (facial) and aural (vocal) forms. We also investigate three levels of head scales (down-scaled, accurate, and up-scaled) to evaluate whether head scale affects user interpretation of the conveyed emotion. We find significant effects of different multimodal expressions on happiness and trust perception, while no significant effect was observed for head scales. Evidence from our results suggest that facial expressions have a stronger impact than vocal expressions. Additionally, as the difference between the two expressions increase, the less predictable the multimodal expression becomes. For example, for the happy-looking and happy-sounding multimodal expression, we expect and see high happiness rating and high trust, however if one of the two expressions change, this mismatch makes the expression less predictable. We discuss the relationships, implications, and guidelines for social applications that aim to leverage multimodal social cues.
more » « less
Full Text Available
Virtual Big Heads in Extended Reality: Estimation of Ideal Head Scales and Perceptual Thresholds for Comfort and Facial Cues

https://doi.org/10.1145/3571074

Choudhary, Zubin; Erickson, Austin; Norouzi, Nahal; Kim, Kangsoo; Bruder, Gerd; Welch, Gregory (January 2023, ACM Transactions on Applied Perception)

Extended reality (XR) technologies, such as virtual reality (VR) and augmented reality (AR), provide users, their avatars, and embodied agents a shared platform to collaborate in a spatial context. Although traditional face-to-face communication is limited by users’ proximity, meaning that another human’s non-verbal embodied cues become more difficult to perceive the farther one is away from that person, researchers and practitioners have started to look into ways to accentuate or amplify such embodied cues and signals to counteract the effects of distance with XR technologies. In this article, we describe and evaluate the Big Head technique, in which a human’s head in VR/AR is scaled up relative to their distance from the observer as a mechanism for enhancing the visibility of non-verbal facial cues, such as facial expressions or eye gaze. To better understand and explore this technique, we present two complimentary human-subject experiments in this article. In our first experiment, we conducted a VR study with a head-mounted display to understand the impact of increased or decreased head scales on participants’ ability to perceive facial expressions as well as their sense of comfort and feeling of “uncannniness” over distances of up to 10 m. We explored two different scaling methods and compared perceptual thresholds and user preferences. Our second experiment was performed in an outdoor AR environment with an optical see-through head-mounted display. Participants were asked to estimate facial expressions and eye gaze, and identify a virtual human over large distances of 30, 60, and 90 m. In both experiments, our results show significant differences in minimum, maximum, and ideal head scales for different distances and tasks related to perceiving faces, facial expressions, and eye gaze, and we also found that participants were more comfortable with slightly bigger heads at larger distances. We discuss our findings with respect to the technologies used, and we discuss implications and guidelines for practical applications that aim to leverage XR-enhanced facial cues.
more » « less
Full Text Available
Amplifying Realities: Gradual and Seamless Scaling of Visual and Auditory Stimuli in Extended Reality

https://doi.org/10.1109/ISMAR-Adjunct54149.2021.00109

Choudhary, Zubin (October 2021, 2021 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct))

Full Text Available
Real-Time Magnification in Augmented Reality

https://doi.org/10.1145/3485279.3488286

Choudhary, Zubin; Ugarte, Jesus; Bruder, Gerd; Welch, Greg (November 2021, Symposium on Spatial User Interaction)

Full Text Available
Scaled User Embodied Representations in Virtual and Augmented Reality

Choudhary, Zubin; Bruder, Gerd; Welch, Gregory F. (January 2021, Workshop on User-Embodied Interaction in Virtual Reality (UIVR) 2021)

Full Text Available
Trade-offs in Augmented Reality User Interfaces for Controlling a Smart Environment

https://doi.org/10.1145/3485279.3485288

Flick, Connor Daniel; Harris, Courtney J; Yonkers, Nikolas T; Norouzi, Nahal; Erickson, Austin; Choudhary, Zubin; Gottsacker, Matt; Bruder, Gerd; Welch, Greg (November 2021, Proc. of the ACM Symposium on Spatial User Interaction (SUI ’21))

Smart devices and Internet of Things (IoT) technologies are replacing or being incorporated into traditional devices at a growing pace. The use of digital interfaces to interact with these devices has become a common occurrence in homes, work spaces, and various industries around the world. The most common interfaces for these connected devices focus on mobile apps or voice control via intelligent virtual assistants. However, with augmented reality (AR) becoming more popular and accessible among consumers, there are new opportunities for spatial user interfaces to seamlessly bridge the gap between digital and physical affordances. In this paper, we present a human-subject study evaluating and comparing four user interfaces for smart connected environments: gaze input, hand gestures, voice input, and a mobile app. We assessed participants’ user experience, usability, task load, completion time, and preferences. Our results show multiple trade-offs between these interfaces across these measures. In particular, we found that gaze input shows great potential for future use cases, while both gaze input and hand gestures suffer from limited familiarity among users, compared to voice input and mobile apps.
more » « less
Full Text Available
Revisiting Distance Perception with Scaled Embodied Cues in Social Virtual Reality

https://doi.org/10.1109/VR50410.2021.00106

Choudhary, Zubin; Gottsacker, Matthew; Kim, Kangsoo; Schubert, Ryan; Stefanucci, Jeanine; Bruder, Gerd; Welch, Gregory F. (March 2021, IEEE Virtual Reality (VR), 2021)
null (Ed.)
Full Text Available
Virtual Big Heads: Analysis of Human Perception and Comfort of Head Scales in Social Virtual Reality

https://doi.org/10.1109/VR46266.2020.00063

Choudhary, Zubin; Kim, Kangsoo; Schubert, Ryan; Bruder, Gerd; Welch, Gregory F. (March 2020, 2020 IEEE Conference on Virtual Reality and 3D User Interfaces (VR))

Full Text Available
Analysis of Human Perception and Comfort of Head Scales in Social Virtual Reality

https://doi.org/10.1109/VR46266.2020.00-41

Choudhary, Zubin; Kim, Kangsoo; Schubert, Ryan; Bruder, Gerd; Welch, Gregory F (January 2020, IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR))

Full Text Available

« Prev Next »

Search for: All records