DNN-based monaural speech enhancement using alternate analysis windows for phase and magnitude modification

Liu, Xi; Hansen, John HL

doi:10.21437/Interspeech.2024-2244

Citation Details

DNN-based monaural speech enhancement using alternate analysis windows for phase and magnitude modification

In recent decades, considerable research has been devoted to speech enhancement leveraging the short-term Fourier transform (STFT) analysis. As speech processing technology evolves, the significance of phase information in enhancing speech intelligibility becomes more noticeable. Typically, the Hanning window has been widely employed as analysis window in STFT. In this study, we propose the Chebyshev window for phase analysis, and the Hanning window for magnitude analysis. Next, we introduce a novel cepstral domain enhancement approach designed to robustly reinforce the harmonic structure of speech. The performance of our model is evaluated using the DNS challenge test set as well as the naturalistic APOLLO Fearless Steps evaluation set. Experimental results demonstrate that the Chebyshev-based phase solution outperforms the Hanning option for in phase-aware speech enhancement. Furthermore, the incorporation of quefrency emphasis proves effective in enhancing overall speech quality. more »

Award ID(s):: 2016725

PAR ID:: 10542791

Author(s) / Creator(s):: Liu, Xi; Hansen, John HL

Publisher / Repository:: ISCA

Date Published:: 2024-09-01

Edition / Version:: 1

Volume:: 1

Issue:: 2244

Page Range / eLocation ID:: 1705 to 1709

Subject(s) / Keyword(s):: Speech Enhancement DNN Phase and Magniture Chebyshev

Format(s):: Medium: X Size: 8MB

Size(s):: 8MB

Location:: Kos Island, Greece

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/Interspeech.2024-2244

More Like this