Try DeezerGet notified

enfr br

An error occured Please, provide your email address This address is invalid Your email is already subscribed. Do you wish to unsuscribe ? newsletter nolist newsletter refreshed You've been unsuscribed Your email has been registered

Subscribe

Your email adress

Subscribe

By signing up, you agree to receive emails every time the Deezer Newsroom is updated. You can unsubscribe at any time by clicking the link at the bottom of each email.

Music Mood Detection based on Audio and Lyrics with Deep Neural Net

Deezer Sep 26, 2018 1 min read

In this paper, we consider the task of multimodal music mood prediction based on the audio signal and the lyrics of a track.

Audio and Lyrics

We reproduce the implementation of traditional feature engineering based approaches and propose a new model based on deep learning.

We compare the performance of both approaches on a database containing 18,000 tracks with associated valence and arousal values and show that our approach outperforms classical models on the arousal detection task, and that both approaches perform equally on the valence prediction task.

Multimodal

We also compare the a posteriori fusion with fusion of modalities optimized simultaneously with each unimodal model, and observe a significant improvement of valence prediction.
We release part of our database for comparison purposes.

Performances

This paper has been published in the proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR 2018).
It generated quite a large press coverage, for instance in Engadget.com (EN) or Clubic.com (FR).

Written by Deezer

Up Next

Deezer Deezer launches music solution for businesses with high profile brands on-board

Jun 26, 2025 3 min read

Deezer PeakNetFP: Peak-based Neural Audio Fingerprinting Robust to Extreme Time Stretching

Jun 25, 2025 2 min read

Deezer Deezer launches world’s first AI tagging system for music streaming

Jun 20, 2025 4 min read