Story2MIDI: Emotionally Aligned Music Generation from Text

Shokri, Mohammad; Salem, Alexandra C; Levine, Gabriel; Devaney, Johanna; Levitan, Sarah Ita

Citation Details

This content will become publicly available on December 9, 2026

Story2MIDI: Emotionally Aligned Music Generation from Text

In this paper, we introduce Story2MIDI, a sequence-to-sequence Transformer-based model for generating emotion-aligned music from a given piece of text. To develop this model, we construct the Story2MIDI dataset by merging existing datasets for sentiment analysis from text and emotion classification in music. The resulting dataset contains pairs of text blurbs and music pieces that evoke the same emotions in the reader or listener. Despite the small scale of our dataset and limited computational resources, our results indicate that our model effectively learns emotion-relevant features in music and incorporates them into its generation process, producing samples with diverse emotional responses. We evaluate the generated outputs using objective musical metrics and a human listening study, confirming the model’s ability to capture intended emotional cues. more »

Award ID(s):: 2228910

PAR ID:: 10657699

Author(s) / Creator(s):: Shokri, Mohammad; Salem, Alexandra C; Levine, Gabriel; Devaney, Johanna; Levitan, Sarah Ita

Publisher / Repository:: Proceedings of IEEE Big Data 2025 (Presented at 3rd Workshop on AI Music Generation, AIMG 2025)

Date Published:: 2025-12-09

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on December 9, 2026
Conference Paper:
The DOI is not currently available.

More Like this