Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices

Corey, Ryan; Mittal, Manan; Sarkar, Kanad; Singer, Andrew C.

doi:10.21437/Interspeech.2022-11025

Citation Details

Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices

We consider the problem of separating speech from several talkers in background noise using a fixed microphone array and a set of wearable devices. Wearable devices can provide reliable information about speech from their wearers, but they typically cannot be used directly for multichannel source separation due to network delay, sample rate offsets, and relative motion. Instead, the wearable microphone signals are used to compute the speech presence probability for each talker at each time-frequency index. Those parameters, which are robust against small sample rate offsets and relative motion, are used to track the second-order statistics of the speech sources and background noise. The fixed array then separates the speech signals using an adaptive linear time-varying multichannel Wiener filter. The proposed method is demonstrated using real-room recordings from three human talkers with binaural earbud microphones and an eight-microphone tabletop array. more »

Award ID(s):: 1919257

PAR ID:: 10475918

Author(s) / Creator(s):: Corey, Ryan; Mittal, Manan; Sarkar, Kanad; Singer, Andrew C.

Publisher / Repository:: ISCA

Date Published:: 2022-09-18

Journal Name:: Proc. Interspeech 2022

Page Range / eLocation ID:: 5398 to 5402

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Conference Paper:
https://doi.org/10.21437/Interspeech.2022-11025

More Like this