NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Prosodic alignment toward emotionally expressive speech: Comparing human and Alexa model talkers

https://doi.org/10.1016/j.specom.2021.10.003

Cohn, Michelle; Predeck, Kristin; Sarian, Melina; Zellou, Georgia (December 2021, Speech Communication)
null (Ed.)
Full Text Available
Speech Rate Adjustments in Conversations With an Amazon Alexa Socialbot

https://doi.org/10.3389/fcomm.2021.671429

Cohn, Michelle; Liang, Kai-Hui; Sarian, Melina; Zellou, Georgia; Yu, Zhou (May 2021, Frontiers in Communication)
null (Ed.)
This paper investigates users’ speech rate adjustments during conversations with an Amazon Alexa socialbot in response to situational (in-lab vs. at-home) and communicative (ASR comprehension errors) factors. We collected user interaction studies and measured speech rate at each turn in the conversation and in baseline productions (collected prior to the interaction). Overall, we find that users slow their speech rate when talking to the bot, relative to their pre-interaction productions, consistent with hyperarticulation. Speakers use an even slower speech rate in the in-lab setting (relative to at-home). We also see evidence for turn-level entrainment: the user follows the directionality of Alexa’s changes in rate in the immediately preceding turn. Yet, we do not see differences in hyperarticulation or entrainment in response to ASR errors, or on the basis of user ratings of the interaction. Overall, this work has implications for human-computer interaction and theories of linguistic adaptation and entrainment.
more » « less
Full Text Available
Individual Variation in Language Attitudes Toward Voice-AI: The Role of Listeners’ Autistic-Like Traits

https://doi.org/10.21437/Interspeech.2020-1339

Cohn, Michelle; Sarian, Melina; Predeck, Kristin; Zellou, Georgia (October 2020, Proceedings of Interspeech)
null (Ed.)
More and more, humans are engaging with voice-activated artificially intelligent (voice-AI) systems that have names (e.g., Alexa), apparent genders, and even emotional expression; they are in many ways a growing ‘social’ presence. But to what extent do people display sociolinguistic attitudes, developed from human-human interaction, toward these disembodied text-to-speech (TTS) voices? And how might they vary based on the cognitive traits of the individual user? The current study addresses these questions, testing native English speakers’ judgments for 6 traits (intelligent, likeable, attractive, professional, human-like, and age) for a naturally-produced female human voice and the US-English default Amazon Alexa voice. Following exposure to the voices, participants completed these ratings for each speaker, as well as the Autism Quotient (AQ) survey, to assess individual differences in cognitive processing style. Results show differences in individuals’ ratings of the likeability and human-likeness of the human and AI talkers based on AQ score. Results suggest that humans transfer social assessment of human voices to voice-AI, but that the way they do so is mediated by their own cognitive characteristics.
more » « less
Full Text Available

Search for: All records