Fearless Steps APOLLO: Challenges in keyword spotting and topic detection for naturalistic audio streams

Joglekar, A.; Lopez-Espejo, I.; Hansen, J.H.L.

Citation Details

Fearless Steps (FS) APOLLO is a + 50,000 hr audio resource established by CRSS-UTDallas capturing all communications between NASA-MCC personnel, backroom staff, and Astronauts across manned Apollo Missions. Such a massive audio resource without metadata/unlabeled corpus provides limited benefit for communities outside Speech-and-Language Technology (SLT). Supplementing this audio with rich metadata developed using robust automated mechanisms to transcribe and highlight naturalistic communications can facilitate open research opportunities for SLT, speech sciences, education, and historical archival communities. In this study, we focus on customizing keyword spotting (KWS) and topic detection systems as an initial step towards conversational understanding. Extensive research in automatic speech recognition (ASR), speech activity, and speaker diarization using manually transcribed 125 h FS Challenge corpus has demonstrated the need for robust domain-specific model development. A major challenge in training KWS systems and topic detection models is the availability of word-level annotations. Forced alignment schemes evaluated using state-of-the-art ASR show significant degradation in segmentation performance. This study explores challenges in extracting accurate keyword segments using existing sentence-level transcriptions and proposes domain-specific KWS-based solutions to detect conversational topics in audio streams. more »

Award ID(s):: 2016725

PAR ID:: 10484494

Author(s) / Creator(s):: Joglekar, A.; Lopez-Espejo, I.; Hansen, J.H.L.

Corporate Creator(s):: https://doi.org/10.1121/10.0018566

Publisher / Repository:: Acoustical Society of America (Spring Meeting)

Date Published:: 2023-05-08

Journal Name:: Program of the meeting Acoustical Society of America

ISSN:: 0163-0962

Page Range / eLocation ID:: paper: 2pSCb19

Format(s):: Medium: X

Location:: Chicago, IL

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this