#

hubert

Here are 21 public repositories matching this topic...

mjhydri / Singing-Vocal-Beat-Tracking

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…

music music-information-retrieval beat-tracking self-supervised singing-voice hubert linear-transformer wavlm

Updated Sep 4, 2022
Python

TerboucheHacene / speech-keyword-spotting

Speech Keyword detection using Wav2Vec Model

transformers pytorch audio-classification keyword-spotting audio-processing fine-tuning onnx pytorch-lightning hubert wav2vec2

Updated Nov 23, 2022
Python

yaya-sy / speechscorer

unsupervised spoken utterances scoring

speech speech-recognition whisper self-supervised-learning speech-translation hubert

Updated Nov 21, 2023
Python

ECNU-Cross-Innovation-Lab / ShiftSER

[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

speech-emotion-recognition hubert wav2vec2

Updated Dec 18, 2023
Python

akash13s / audio-to-image

Pipeline for generating images conditioned on input audio

pytorch u-net diffusion-models hubert wav2vec2

Updated Jul 25, 2024
Python

GiovaneIwamoto / voice-cloning-bark-hubert

Voice Cloning Bark HuBERT - Enables voice cloning from personalized audio samples by processing model's outputs into semantic tokens compatible with text-to-audio system.

tts bark voice-cloning hubert

Updated Oct 22, 2024
Python

omkar-nitsure / Accent-Adaptation-Codebooks

This repository contains different approaches I tried for improving ASR systems for accented English speech. All of them use the HuBERT model as baseline

transformer attention asr-model codebook-approach hubert

Updated Dec 6, 2024
Python

aitor-alvarez / acoustic-transformer-models

Acoustic Transformer Models for Audio Classification

classification acoustic transformer-models pytorch-lightning hubert wav2vec2 wavlm

Updated Feb 15, 2025
Python

backspacetg / distilAlhubert

code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model

asr distillation hubert

Updated Mar 30, 2025
Python

PeterGilles / luxembourgish-vowel-classifier

Luxembourgish Vowel Classifier

phonetics luxembourgish hubert wav2vec2

Updated May 26, 2025
Python

omidnaeej / Speech-Emotion-Recognition-Mel-Spectrograms-and-HuBERT-Embeddings

Implementation of Speech Emotion Recognition (SER) on the CREMA-D dataset using both log-Mel spectrograms and HuBERT embeddings. Includes preprocessing, feature extraction, CNN/MLP models, training/evaluation scripts, and visualization tools for analyzing accuracy, loss, and confusion matrices.

speech-emotion-recognition mel-spectrogram hubert

Updated Sep 26, 2025
Python

pujariaditya / HiggsAudiov2TokenizerUnofficial

Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.

pytorch audio-synthesis speech-processing audio-processing vector-quantization dac semantic-features hubert audio-generation neural-audio-codec rvq audio-tokenizer neural-codec higgs-audio speech-tokenization

Updated Oct 8, 2025
Python

sadPororo / L-TDNN

Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models, to appear in ICAIIC 2026

pretrained-models speaker-recognition speaker-verification hubert wav2vec2 wavlm

Updated Dec 17, 2025
Python

sadPororo / LAP

Rethinking Leveraging Pre-Trained Multi-Layer Representations for Speaker Verification, ISCA Interspeech 2025

pretrained-models speaker-verification voxceleb voxceleb2 hubert wav2vec2 wavlm

Updated Dec 30, 2025
Python

s3prl

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Updated Mar 12, 2026
Python

anilkeshwani / speech-text-alignment

Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets

speech speech-recognition data-pipeline asr hubert uroman

Updated Mar 25, 2026
Python

Amir-Ivry / MAPSS-measures

The code for the MAPSS measures for source separation evaluation (ICLR, 2026)

ai mos diffusion-maps source-separation psychoacoustics speech-separation perceptual-evaluation audio-quality mert hubert wav2vec2 wavlm music-sources-separation speech-measures

Updated Apr 3, 2026
Python

eejji / LTD-Conformer-Speech-Depression-Detection

LTD-Conformer model with Speaking and Listening Perspectives - CBMS 2025

audio depression conformer mel-spectrogram hubert long-term-dilated-conformer listening-speaking

Updated Apr 6, 2026
Python

Jabberjay

MattyB95 / Jabberjay

🦜 Synthetic Voice Detection

machine-learning detection voice speech transformers vit spoof audio-classification synthetic bonafide anti-spoofing asvspoof deepfake-detection hubert wav2vec2 synthetic-voice wavlm synthetic-voice-detection rawnet2

Updated Apr 14, 2026
Python

voicepaw / so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

lightning deep-learning realtime pytorch speech-synthesis gan voice-conversion voice-changer pytorch-lightning hubert vits sovits so-vits-svc softvc contentvec

Updated Apr 27, 2026
Python

Improve this page

Add a description, image, and links to the hubert topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hubert topic, visit your repo's landing page and select "manage topics."