Learning the Helix Topology of Musical Pitch

Lostanlen, Vincent; Sridhar, Sripathi; McFee, Brian; Farnsworth, Andrew; Bello, Juan Pablo

doi:10.1109/ICASSP40776.2020.9053644

Citation Details

Learning the Helix Topology of Musical Pitch

To explain the consonance of octaves, music psychologists represent pitch as a helix where azimuth and axial coordinate correspond to pitch class and pitch height respectively. This article addresses the problem of discovering this helical structure from unlabeled audio data. We measure Pearson correlations in the constant-Q transform (CQT) domain to build a K-nearest neighbor graph between frequency subbands. Then, we run the Isomap manifold learning algorithm to represent this graph in a three-dimensional space in which straight lines approximate graph geodesics. Experiments on isolated musical notes demonstrate that the resulting manifold resembles a helix which makes a full turn at every octave. A circular shape is also found in English speech, but not in urban noise. We discuss the impact of various design choices on the visualization: instrumentarium, loudness mapping function, and number of neighbors K. more »

Award ID(s):: 1633206

PAR ID:: 10301445

Author(s) / Creator(s):: Lostanlen, Vincent; Sridhar, Sripathi; McFee, Brian; Farnsworth, Andrew; Bello, Juan Pablo

Date Published:: 2020-05-01

Journal Name:: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Page Range / eLocation ID:: 11 to 15

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ICASSP40776.2020.9053644

More Like this