Fair or Fare? Understanding Automated Transcription Error Bias in Social Media and Videoconferencing Platforms

Dubois, Daniel J; Holliday, Nicole; Waddell, Kaveh; Choffnes, David

doi:10.1609/icwsm.v18i1.31320

Citation Details

Fair or Fare? Understanding Automated Transcription Error Bias in Social Media and Videoconferencing Platforms

As remote work and learning increases in popularity, individuals, especially those with hearing impairments or who speak English as a second language, may depend on automated transcriptions to participate in business, school, entertainment, or basic communication. In this work, we investigate the automated transcription accuracy of seven popular social media and videoconferencing platforms with respect to some personal characteristics of their users, including gender, age, race, first language, speech rate, F0 frequency, and speech readability. We performed this investigation on a new corpus of 194 hours of English monologues by 846 TED talk speakers. Our results show the presence of significant bias, with transcripts less accurate for speakers that are male or non-native English speakers. We also observe differences in accuracy among platforms for different types of speakers. These results indicate that, while platforms have improved their automatic captioning, much work remains to make captions accessible for a wider variety of speakers and listeners. more »

Award ID(s):: 1955227

PAR ID:: 10531479

Author(s) / Creator(s):: Dubois, Daniel J; Holliday, Nicole; Waddell, Kaveh; Choffnes, David

Publisher / Repository:: Proceedings of the International AAAI Conference on Web and Social Media

Date Published:: 2024-05-31

Journal Name:: Proceedings of the International AAAI Conference on Web and Social Media

Volume:: 18

ISSN:: 2162-3449

Page Range / eLocation ID:: 367 to 380

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/icwsm.v18i1.31320

More Like this