yurayli

Follow

Ray Li yurayli

Follow

23 followers · 85 following

Taipei, Taiwan

Achievements

Achievements

Stars

Speech & Voice ML

66 repositories

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,097 11,542 Updated Dec 15, 2025

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,611 280 Updated Jan 12, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 19,509 1,628 Updated Nov 19, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,939 1,616 Updated Dec 15, 2025

facebookresearch / svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multi…

Python 1,316 188 Updated Nov 16, 2023

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,854 984 Updated Dec 13, 2025

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,678 1,123 Updated Jul 26, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,214 2,044 Updated Oct 21, 2025

haoheliu / voicefixer

General Speech Restoration

Python 1,252 151 Updated Feb 17, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,648 697 Updated Dec 10, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,310 3,237 Updated Dec 17, 2025

jasonppy / PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Python 151 13 Updated Jan 16, 2024

sanchit-gandhi / whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,648 409 Updated Apr 3, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,959 1,446 Updated Dec 17, 2025

NVIDIA / CleanUNet

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Python 340 58 Updated Oct 11, 2023

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 8,324 1,910 Updated Sep 6, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,644 2,361 Updated Dec 16, 2025

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,576 1,367 Updated Apr 24, 2024

xiph / rnnoise

Recurrent neural network for audio noise reduction

C 5,222 1,017 Updated Feb 22, 2025

timsainb / noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,799 262 Updated Aug 19, 2025

slhck / ffmpeg-normalize

Audio Normalization for Python/ffmpeg

HTML 1,457 125 Updated Nov 9, 2025

GregorR / rnnoise-models

Trained neural networks and requisite information and data for rnnoise-nu

C 341 52 Updated Sep 2, 2018

GregorR / rnnoise-nu

Recurrent neural network for audio noise reduction, slightly improved for general use

C 125 24 Updated Apr 25, 2019

sindresorhus / awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

1,952 98 Updated Nov 5, 2025

jitsi / jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 831 108 Updated Feb 15, 2025

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,195 205 Updated Sep 26, 2025

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,757 626 Updated Oct 25, 2025

wq2012 / SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Python 62 9 Updated Aug 28, 2023

nryant / dscore

Diarization scoring tools.

Python 260 46 Updated Mar 28, 2023

axinc-ai / ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

Python 2,295 351 Updated Dec 17, 2025