Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Examples of ComfyUI workflows
FFmpeg for browser, powered by WebAssembly
Javascript audio library for the modern web.
An Implementation of Singing Voice Conversion Based on Diffsinger
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
chinese speech pretrained models
GUI for a Vocal Remover that uses Deep Neural Networks.
speech self-supervised representations
Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc
Noise supression using deep filtering
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Singing Voice Conversion via diffusion model
Code for NeurIPS 2022 Paper, "Poisson Flow Generative Models" (PFGM)
AI based multi-label girl image classification system, implemented by using TensorFlow.
Stable Diffusion web UI
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
A collection of resources and papers on Diffusion Models
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis