Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,593 1,956 Updated Apr 15, 2026

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,162 1,066 Updated Mar 8, 2026

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 12,106 1,067 Updated Jul 31, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 10,105 944 Updated Apr 28, 2026

ace-step / ACE-Step-1.5

The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 9,847 1,173 Updated Apr 30, 2026

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,821 2,400 Updated Apr 30, 2026

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 8,372 1,899 Updated Apr 10, 2026

openai / jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Python 8,041 1,457 Updated Jun 19, 2024

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 8,033 619 Updated Jan 18, 2026

freemocap / freemocap

Free Motion Capture for Everyone 💀✨

Python 7,516 632 Updated Apr 30, 2026

OpenNMT / OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 7,004 2,252 Updated Oct 14, 2025

microsoft / MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, …

Python 5,812 957 Updated Aug 7, 2025

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 5,572 588 Updated Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr-pub

Achievements

Achievements

Block or report asr-pub

Stars

hiyouga / LlamaFactory

RVC-Boss / GPT-SoVITS

jingyaogong / minimind

coqui-ai / TTS

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

yunjey / pytorch-tutorial

facebookresearch / fairseq

fishaudio / fish-speech

svc-develop-team / so-vits-svc

Delgan / loguru

FunAudioLLM / CosyVoice

index-tts / index-tts

ddbourgin / numpy-ml

flairNLP / flair

youfou / wxpy

mlfoundations / open_clip

zalandoresearch / fashion-mnist

PaddlePaddle / PaddleSpeech