Matching human vocal imitations to birdsong: An exploratory analysis

Oudyk, K; Wu, Y; Lostanlen, V; Salamon, J; Farnsworth, A; Bello, JP

Citation Details

We explore computational strategies for matching human vocal imitations of birdsong to actual birdsong recordings. We recorded human vocal imitations of birdsong and subsequently analysed these data using three categories of audio features for matching imitations to original birdsong: spectral, temporal, and spectrotemporal. These exploratory analyses suggest that spectral features can help distinguish imitation strategies (e.g. whistling vs. singing) but are insufficient for distinguishing species. Similarly, whereas temporal features are correlated between human imitations and natural birdsong, they are also insufficient. Spectrotemporal features showed the greatest promise, in particular when used to extract a representation of the pitch contour of birdsong and human imitations. This finding suggests a link between the task of matching human imitations to birdsong to retrieval tasks in the music domain such as query-by-humming and cover song retrieval; we borrow from such existing methodologies to outline directions for future research. more »

Award ID(s):: 1633206

PAR ID:: 10118935

Author(s) / Creator(s):: Oudyk, K; Wu, Y; Lostanlen, V; Salamon, J; Farnsworth, A; Bello, JP

Date Published:: 2019-08-29

Journal Name:: 2nd International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this