LONG-DISTANCE DETECTION OF BIOACOUSTIC EVENTS WITH PER-CHANNEL ENERGY NORMALIZATION

Lostanlen, V; Palmer, K; Knight, E; Clark, C; Klinck, H; Farnsworth, A; Wong, T; Cramer, J; Bello, JP

Citation Details

This paper proposes to perform unsupervised detection of bioacous- tic events by pooling the magnitudes of spectrogram frames after per-channel energy normalization (PCEN). Although PCEN was originally developed for speech recognition, it also has beneficial effects in enhancing animal vocalizations, despite the presence of atmospheric absorption and intermittent noise. We prove that PCEN generalizes logarithm-based spectral flux, yet with a tunable time scale for background noise estimation. In comparison with point- wise logarithm, PCEN reduces false alarm rate by 50x in the near field and 5x in the far field, both on avian and marine bioacoustic datasets. Such improvements come at moderate computational cost and require no human intervention, thus heralding a promising future for PCEN in bioacoustics. more »

Award ID(s):: 1633206

PAR ID:: 10118933

Author(s) / Creator(s):: Lostanlen, V; Palmer, K; Knight, E; Clark, C; Klinck, H; Farnsworth, A; Wong, T; Cramer, J; Bello, JP

Date Published:: 2019-10-25

Journal Name:: Detection and Classification of Acoustic Scenes and Events 2019

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this