hubert

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…

music music-information-retrieval beat-tracking self-supervised singing-voice hubert linear-transformer wavlm

Updated Sep 4, 2022
Python

yaya-sy / speechscorer

Star

unsupervised spoken utterances scoring

speech speech-recognition whisper self-supervised-learning speech-translation hubert

Updated Nov 21, 2023
Python

sadPororo / L-TDNN

Star

Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models, to appear in ICAIIC 2026

pretrained-models speaker-recognition speaker-verification hubert wav2vec2 wavlm

Updated Dec 17, 2025
Python

backspacetg / distilAlhubert

Star

code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model

asr distillation hubert

Updated Mar 30, 2025
Python

Amir-Ivry / MAPSS-measures

Star

The code for the MAPSS measures for source separation evaluation.

ai mos diffusion-maps source-separation psychoacoustics speech-separation perceptual-evaluation audio-quality mert hubert wav2vec2 wavlm music-sources-separation speech-measures

Updated Sep 17, 2025
Python

sadPororo / LAP

Star

Rethinking Leveraging Pre-Trained Multi-Layer Representations for Speaker Verification, ISCA Interspeech 2025

pretrained-models speaker-verification voxceleb voxceleb2 hubert wav2vec2 wavlm

Updated May 31, 2025
Python

TerboucheHacene / speech-keyword-spotting

Star

Speech Keyword detection using Wav2Vec Model

transformers pytorch audio-classification keyword-spotting audio-processing fine-tuning onnx pytorch-lightning hubert wav2vec2

Updated Nov 23, 2022
Python

pujariaditya / HiggsAudiov2TokenizerUnofficial

Star

Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.

pytorch audio-synthesis speech-processing audio-processing vector-quantization dac semantic-features hubert audio-generation neural-audio-codec rvq audio-tokenizer neural-codec higgs-audio speech-tokenization

Updated Oct 8, 2025
Python

anilkeshwani / speech-text-alignment

Star

Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets

speech speech-recognition data-pipeline asr hubert uroman

Updated Sep 19, 2025
Python

aitor-alvarez / acoustic-transformer-models

Star

Acoustic Transformer Models for Audio Classification

classification acoustic transformer-models pytorch-lightning hubert wav2vec2 wavlm

Updated Feb 15, 2025
Python

GiovaneIwamoto / voice-cloning-bark-hubert

Star

🐶 Voice Cloning Bark HuBERT - Enables voice cloning from personalized audio samples by processing model's outputs into semantic tokens compatible with text-to-audio system.

tts bark voice-cloning hubert

Updated Oct 22, 2024
Python

omkar-nitsure / Accent-Adaptation-Codebooks

Star

This repository contains different approaches I tried for improving ASR systems for accented English speech. All of them use the HuBERT model as baseline

transformer attention asr-model codebook-approach hubert

Updated Dec 6, 2024
Python

omidnaeej / Speech-Emotion-Recognition-Mel-Spectrograms-and-HuBERT-Embeddings

Star

Implementation of Speech Emotion Recognition (SER) on the CREMA-D dataset using both log-Mel spectrograms and HuBERT embeddings. Includes preprocessing, feature extraction, CNN/MLP models, training/evaluation scripts, and visualization tools for analyzing accuracy, loss, and confusion matrices.

speech-emotion-recognition mel-spectrogram hubert

Updated Sep 26, 2025
Python

PeterGilles / luxembourgish-vowel-classifier

Star

Luxembourgish Vowel Classifier

phonetics luxembourgish hubert wav2vec2

Updated May 26, 2025
Python

akash13s / audio-to-image

Star

Pipeline for generating images conditioned on input audio

pytorch u-net diffusion-models hubert wav2vec2

Updated Jul 25, 2024
Python

Improve this page

Add a description, image, and links to the hubert topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hubert topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hubert

Here are 19 public repositories matching this topic...

voicepaw / so-vits-svc-fork

s3prl / s3prl

lstrgar / self-supervised-phone-segmentation

ECNU-Cross-Innovation-Lab / ShiftSER

mjhydri / Singing-Vocal-Beat-Tracking

yaya-sy / speechscorer

sadPororo / L-TDNN

backspacetg / distilAlhubert

Amir-Ivry / MAPSS-measures

sadPororo / LAP

TerboucheHacene / speech-keyword-spotting

pujariaditya / HiggsAudiov2TokenizerUnofficial

anilkeshwani / speech-text-alignment

aitor-alvarez / acoustic-transformer-models

GiovaneIwamoto / voice-cloning-bark-hubert

omkar-nitsure / Accent-Adaptation-Codebooks

omidnaeej / Speech-Emotion-Recognition-Mel-Spectrograms-and-HuBERT-Embeddings

PeterGilles / luxembourgish-vowel-classifier

akash13s / audio-to-image

Improve this page

Add this topic to your repo