Musical Emotion Recognition with Spectral Feature Extraction Based on a Sinusoidal Model with Model-Based and Deep-Learning Approaches

Xie, Baijun; Kim, Jonathan C.; Park, Chung Hyuk.

Citation Details

This paper presents a method for extracting novel spectral features based on a sinusoidal model. The method is focused on characterizing the spectral shapes of audio signals using spectra peaks in frequency sub-bands. The extracted features are evaluated for predicting the levels of emotional dimensions, namely arousal and valence. Principal component regression, partial least squares regression, and deep convolutional neural network (CNN) models are used as prediction models for the levels of the emotional dimensions. The experimental results indicate that the proposed features include additional spectral information that common baseline features may not include. Since the quality of audio signals, especially timbre, plays a major role in affecting the perception of emotional valence in music, the inclusion of the presented features will contribute to decreasing the prediction error rate. more »

Award ID(s):: 1846658

PAR ID:: 10134992

Author(s) / Creator(s):: Xie, Baijun; Kim, Jonathan C.; Park, Chung Hyuk.

Date Published:: 2020-01-01

Journal Name:: Applied Sciences

Volume:: 10

Issue:: 3

ISSN:: 1454-5101

Page Range / eLocation ID:: 902 - 912

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this