From the perspective of perceptual speech quality: The robustness of frequency bands to noise

Fan, Junyi; Williamson, Donald S.

doi:10.1121/10.0025272

Citation Details

From the perspective of perceptual speech quality: The robustness of frequency bands to noise

Speech quality is one of the main foci of speech-related research, where it is frequently studied with speech intelligibility, another essential measurement. Band-level perceptual speech intelligibility, however, has been studied frequently, whereas speech quality has not been thoroughly analyzed. In this paper, a Multiple Stimuli With Hidden Reference and Anchor (MUSHRA) inspired approach was proposed to study the individual robustness of frequency bands to noise with perceptual speech quality as the measure. Speech signals were filtered into thirty-two frequency bands with compromising real-world noise employed at different signal-to-noise ratios. Robustness to noise indices of individual frequency bands was calculated based on the human-rated perceptual quality scores assigned to the reconstructed noisy speech signals. Trends in the results suggest the mid-frequency region appeared less robust to noise in terms of perceptual speech quality. These findings suggest future research aiming at improving speech quality should pay more attention to the mid-frequency region of the speech signals accordingly. more »

Award ID(s):: 2235228 1942718

PAR ID:: 10502200

Author(s) / Creator(s):: Fan, Junyi; Williamson, Donald S.

Publisher / Repository:: Acoustical Society of America

Date Published:: 2024-03-01

Journal Name:: The Journal of the Acoustical Society of America

Volume:: 155

Issue:: 3

ISSN:: 0001-4966

Page Range / eLocation ID:: 1916 to 1927

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1121/10.0025272

More Like this