-
-
claude-code Public
Forked from ultraworkers/claw-codeClaude Code Snapshot for Research. All original source code is the property of Anthropic.
TypeScript UpdatedMar 31, 2026 -
Spark-TTS Public
Forked from SparkAudio/Spark-TTSSpark-TTS Inference Code
Python Apache License 2.0 UpdatedApr 9, 2025 -
-
X-Codec-2.0 Public
Forked from zhenye234/X-Codec-2.0Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
Python MIT License UpdatedMar 12, 2025 -
-
stable-codec Public
Forked from Stability-AI/stable-codecA family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
Python MIT License UpdatedJan 10, 2025 -
SNAC-Vocos Public
A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.
-
-
-
ctc-forced-aligner Public
Forked from MahmoudAshraf97/ctc-forced-alignerText to speech alignment using CTC forced alignment
Python UpdatedJun 24, 2024 -
ttts Public
Forked from adelacvg/tttsTrain the next generation of TTS systems.
Python Mozilla Public License 2.0 UpdatedJan 17, 2024 -
UniAudio Public
Forked from yangdongchao/UniAudioThe Open Source Code of UniAudio
Python UpdatedOct 22, 2023 -
new-pac Public
Forked from Alvin9999/new-pac翻墙-科学上网、免费科学上网、免费翻墙、油管youtube、fanqiang、VPN、一键翻墙浏览器,vps一键搭建翻墙服务器脚本/教程,免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点,免费自由上网、翻墙梯子,电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网
UpdatedOct 10, 2023 -
facialanimation Public
Forked from on1262/facialanimationSource code for: Expressive Speech-driven Facial Animation with controllable emotions
Python Apache License 2.0 UpdatedJul 13, 2023 -
-
so-vits-svc-5.0 Public
Forked from PlayVoice/whisper-vits-svcCore Engine of Singing Voice Conversion & Singing Voice Clone
Python MIT License UpdatedMay 19, 2023 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedJan 11, 2023 -
GenerSpeech Public
Forked from Rongjiehuang/GenerSpeechPyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Python MIT License UpdatedDec 12, 2022 -
x-ui Public
Forked from vaxilu/x-ui支持多协议多用户的 xray 面板
JavaScript GNU General Public License v3.0 UpdatedNov 6, 2022 -
SiFiGAN Public
Forked from chomeyama/SiFiGANOfficial implementation of the source-filter HiFiGAN vocoder
Python MIT License UpdatedNov 1, 2022 -
stable-diffusion Public
Forked from CompVis/stable-diffusionA latent text-to-image diffusion model
Jupyter Notebook Other UpdatedOct 20, 2022 -
ProDiff Public
Forked from Rongjiehuang/ProDiffPyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
Python MIT License UpdatedSep 24, 2022 -
g2pW Public
Forked from GitYCC/g2pWMandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音
Python Apache License 2.0 UpdatedAug 22, 2022 -
ddsp-singing-vocoders Public
Forked from YatingMusic/ddsp-singing-vocodersOfficial implementation of SawSing (ISMIR'22)
Python GNU Affero General Public License v3.0 UpdatedAug 14, 2022 -
phonemizer Public
Forked from bootphon/phonemizerSimple text to phones converter for multiple languages
-
-
nnsvs Public
Forked from nnsvs/nnsvsNeural network-based singing voice synthesis library for research
Python MIT License UpdatedJun 21, 2022 -
VITS_Singing Public
Forked from PlayVoice/VI-SVSUse VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.
Python Apache License 2.0 UpdatedMar 17, 2022 -
DiffSinger Public
Forked from MoonInTheRiver/DiffSingerDiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code