An error occured Please, provide your email address This address is invalid Your email is already subscribed. Do you wish to unsuscribe ? newsletter nolist newsletter refreshed You've been unsuscribed Your email has been registered

Subscribe

Your email adress

By signing up, you agree to receive emails every time the Deezer Newsroom is updated. You can unsubscribe at any time by clicking the link at the bottom of each email.

Multilingual Lyrics-to-Audio Alignment

Andrea Vaglio Oct 11, 2020 2 min read

Lyrics-to-audio alignment methods have recently reported impressive results, opening the door to practical applications such as karaoke and within song navigation. However, most studies focus on a single language – usually English – for which annotated data are abundant. The question of their ability to generalize to other languages, especially in low (or even zero) training resource scenarios has been so far left unexplored.

In this paper, we address the lyrics-to-audio alignment task in a generalized multi-lingual setup. More precisely, this investigation presents the first (to the best of our knowledge) attempt to create a language-independent lyrics-to-audio alignment system. Building on a Recurrent Neural Network (RNN) model trained with a Connectionist Temporal Classification (CTC) algorithm, we study the relevance of different intermediate representations, either character or phoneme, along with several strategies to design a training set.

overview of the lyrics-to-audio alignment system

The evaluation is conducted on multiple languages with a varying amount of data available, from plenty to zero. Results show that learning from diverse data and using a universal phoneme set as an intermediate representation yield the best generalization performances.

Lyrics-to-audio evaluation on DALI language subset datasets for phoneme and character based architectures

This paper has been published in the proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR 2020).

Written by Andrea Vaglio

Up Next

Maribel Khouwes Deezer launches Remix Lab feature in France and lets fans remix songs from their favorite artists

Jun 24, 2026 5 min read

lbriand Music Playlist Captioning at Scale with Large Language Models

Jun 21, 2026 1 min read

Jaime.Deezer Deezer AI Music Detector: Find Out How Much AI-Generated Music Is in Your Playlist

Jun 11, 2026 1 min read