Highlights
- Pro
Stars
Official implementation of "Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech"
A LaTex paper template for security and machine learning conferences
Source code for the paper "Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts" (ICML 2024)
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
ICASSP 2025 Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement
UT-Sarulab MOS prediction system using SSL models
The implementation of personalized speech enhancement system based on synthetic data augmentation.
Documentation and background of sign language processing
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Extract xvector and ivector under kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
A python wrapper for Speech Signal Processing Toolkit (SPTK).
A Python wrapper for the high-quality vocoder "World"
A high-quality speech analysis, manipulation and synthesis system
A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Speech recognition module for Python, supporting several engines and APIs, online and offline.