A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 15,372 1,614 Updated Mar 17, 2026

Embedding / Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 12,189 2,325 Updated Oct 30, 2023

kkroening / ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Python 10,973 939 Updated Aug 4, 2024

QuentinFuxa / WhisperLiveKit

Simultaneous speech-to-text models

Python 9,980 1,023 Updated Mar 18, 2026

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,963 2,435 Updated Mar 24, 2026

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,554 744 Updated Mar 8, 2026

lucidrains / imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,405 797 Updated Oct 7, 2024

librosa / librosa

Python library for audio and music analysis

Python 8,282 1,040 Updated Mar 24, 2026

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 7,812 713 Updated Dec 30, 2025

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,321 1,293 Updated Mar 16, 2026

lancopku / pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Python 6,704 987 Updated Nov 5, 2022

openai / consistency_models

Official repo for consistency models.

Python 6,475 433 Updated Mar 22, 2024

Blaizzy / mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 6,371 507 Updated Mar 24, 2026

tyiannak / pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 6,234 1,224 Updated Aug 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

melodyless agangzz

Block or report agangzz

Stars

ytdl-org / youtube-dl

pytorch / pytorch

openai / whisper

zai-org / ChatGLM-6B

chenfei-wu / TaskMatrix

lllyasviel / ControlNet

matterport / Mask_RCNN

junyanz / pytorch-CycleGAN-and-pix2pix

microsoft / VibeVoice

microsoft / unilm

SYSTRAN / faster-whisper

m-bain / whisperX

magenta / magenta

kaixindelele / ChatPaper

huggingface / sentence-transformers

jaakkopasanen / AutoEq

modelscope / FunASR