Lyrics Sync (lsync)

This project aims to build a lyrics-to-audio alignment system that can synchronize the audio of a polyphonic song with its lyrics and produce time-aligned lyrics with word-level onset and offset as a .lrc file. A deep-learning-based system is developed to approach the problem in three steps, which include separating the vocals, recognizing the singing vocals, and performing forced alignment. For singing vocals recognition, transfer learning is utilized to apply knowledge obtained from the speech domain to the singing domain.

Flow

Installation

conda env update -f environment.yml
conda activate lsync

Usage

from lsync import LyricsSync

lsync = LyricsSync()
words, lrc = lsync.sync(audio_path, lyrics_path)

Demo

Please refer to demo.ipynb.

If you want to visualize .lrc for evaluation, you can use Lrc Player.

Experiments

If you want to fine-tune a Wav2Vec2 model for better accuracy in singing domain, please refer to the experiments section below.

Fine-tune Wav2Vec2 model for lyrics transcription

Make a dataset folder in root folder
Download DALI dataset and put it it inside dataset/DALI/v1
- Similarly, you can download jamendolyrics dataset for evaluation and put it in dataset/jamendolyrics
Download all DALI songs using python get_dataset.py
Run dataset.ipynb to prepare the DALI for fine-tune tasks
- Procedures including vocal extraction, line-level segmentation, and making tokenizer
Run train.ipynb to fine-tune the facebook/wav2vec2-base for singing voice recognition
Run run.ipynb to see how to use the lsync library for lyrics-to-audio alignment based on the fine-tuned model
- Remember to update model path to your model's path inside lsync/phoneme_recognizer.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
lsync		lsync
.gitignore		.gitignore
README.md		README.md
dataset.ipynb		dataset.ipynb
demo.ipynb		demo.ipynb
environment.yml		environment.yml
get_dataset.py		get_dataset.py
steps.ipynb		steps.ipynb
train.ipynb		train.ipynb
vocab.json		vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lyrics Sync (lsync)

Flow

Installation

Usage

Demo

Experiments

Fine-tune Wav2Vec2 model for lyrics transcription

About

Uh oh!

Languages

mikezzb/lyrics-sync

Folders and files

Latest commit

History

Repository files navigation

Lyrics Sync (lsync)

Flow

Installation

Usage

Demo

Experiments

Fine-tune Wav2Vec2 model for lyrics transcription

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages