Using Confidence Scores to Improve Eyes-free Detection of Speech Recognition Errors

Nowrin, Sadia; Vertanen, Keith

doi:10.1145/3733155.3734896

Citation Details

This content will become publicly available on June 25, 2026

Using Confidence Scores to Improve Eyes-free Detection of Speech Recognition Errors

Conversational systems rely heavily on speech recognition to interpret and respond to user commands and queries. Despite progress on speech recognition accuracy, errors may still sometimes occur and can significantly affect the end-user utility of such systems. While visual feedback can help detect errors, it may not always be practical, especially for people who are blind or low-vision. In this study, we investigate ways to improve error detection by manipulating the audio output of the transcribed text based on the recognizer's confidence level in its result. Our findings show that selectively slowing down the audio when the recognizer exhibited uncertainty led to a 12% relative increase in participants' ability to detect errors compared to uniformly slowing the audio. It also reduced the time it took participants to listen to the recognition result and decide if there was an error by 11%. more »

Award ID(s):: 1909248

PAR ID:: 10628760

Author(s) / Creator(s):: Nowrin, Sadia; Vertanen, Keith

Publisher / Repository:: ACM

Date Published:: 2025-06-25

ISBN:: 9798400714023

Page Range / eLocation ID:: 194 to 201

Format(s):: Medium: X

Location:: Corfu Island Greece

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 25, 2026
Conference Paper:
https://doi.org/10.1145/3733155.3734896

More Like this