Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,554 772 Updated May 27, 2025

acids-ircam / RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,631 208 Updated Jun 23, 2025

wangkai930418 / awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

2,087 95 Updated Dec 19, 2025

EricGuo5513 / momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 1,184 97 Updated Sep 13, 2024

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,541 202 Updated Sep 29, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,154 6,626 Updated Dec 19, 2025

teticio / audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 783 77 Updated Sep 25, 2024

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,096 644 Updated Aug 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SONGsong 99-song

Block or report 99-song

Stars

shivammehta25 / Matcha-TTS

FunAudioLLM / CosyVoice

xingchensong / S3Tokenizer

MontrealCorpusTools / mfa-models

MontrealCorpusTools / Montreal-Forced-Aligner

tabahi / bournemouth-forced-aligner