Challenges in Metadata Creation for Massive Naturalistic Team-Based Audio Data

Belitz, Chelzy; Hansen, John H.L.

doi:10.21437/Interspeech.2022-11243

Citation Details

Challenges in Metadata Creation for Massive Naturalistic Team-Based Audio Data

A broad range of research fields benefit from the information extracted from naturalistic audio data. Speech research typically relies on the availability of human-generated metadata tags to comprise a set of “ground truth” labels for the development of speech processing algorithms. While the manual generation of metadata tags may be feasible on a small scale, unique problems arise when creating speech resources for massive, naturalistic audio data. This paper presents a general discussion on these challenges and highlights suggestions when creating metadata for speech resources that are intended to be useful both in speech research and in other fields. Further, it provides an overview of how the task of creating a speech resource for various communities has been and is continuing to be approached for the massive corpus of audio from the historic NASA Apollo missions, which includes tens of thousands of hours of naturalistic, team-based audio data featuring numerous speakers across multiple points in history. more »

Award ID(s):: 2016725

PAR ID:: 10402504

Author(s) / Creator(s):: Belitz, Chelzy; Hansen, John H.L.

Date Published:: 2022-09-18

Journal Name:: ISCA INTERSPEECH-2022

Page Range / eLocation ID:: 5210 to 5214

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.21437/Interspeech.2022-11243

More Like this