ADAPTIVE TIME–FREQUENCY SCATTERING FOR PERIODIC MODULATION RECOGNITION IN MUSIC SIGNALS

Wang, C; Benetos, E; Lostanlen, V; Chew, E

Vibratos, tremolos, trills, and flutter-tongue are techniques frequently found in vocal and instrumental music. A com- mon feature of these techniques is the periodic modulation in the time–frequency domain. We propose a representa- tion based on time–frequency scattering to model the inter- class variability for fine discrimination of these periodic modulations. Time–frequency scattering is an instance of the scattering transform, an approach for building invari- ant, stable, and informative signal representations. The proposed representation is calculated around the wavelet subband of maximal acoustic energy, rather than over all the wavelet bands. To demonstrate the feasibility of this approach, we build a system that computes the represen- tation as input to a machine learning classifier. Whereas previously published datasets for playing technique analy- sis focus primarily on techniques recorded in isolation, for ecological validity, we create a new dataset to evaluate the system. The dataset, named CBF-periDB, contains full- length expert performances on the Chinese bamboo flute that have been thoroughly annotated by the players them- selves. We report F-measures of 99% for flutter-tongue, 82% for trill, 69% for vibrato, and 51% for tremolo detec- tion, and provide explanatory visualisations of scattering coefficients for each of these techniques.

More Like this