-
-
stable-audio-3 Public
Forked from Stability-AI/stable-audio-3Python MIT License UpdatedMay 20, 2026 -
Confucius4-TTS Public
Forked from netease-youdao/Confucius4-TTSApache License 2.0 UpdatedMay 20, 2026 -
-
Auto-claude-code-research-in-sleep Public
Forked from wanshuiyin/Auto-claude-code-research-in-sleepARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
Python MIT License UpdatedMay 7, 2026 -
graphify Public
Forked from safishamsi/graphifyAI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…
Python MIT License UpdatedApr 21, 2026 -
AffectSpeech Public
Forked from jeremychee4/AffectSpeechAffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis
UpdatedApr 7, 2026 -
paraspeechclap Public
Forked from ajd12342/paraspeechclapCodebase for 'ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining'
Python MIT License UpdatedApr 6, 2026 -
gemma Public
Forked from google-deepmind/gemmaGemma open-weight LLM library, from Google DeepMind
Python Apache License 2.0 UpdatedApr 3, 2026 -
Raon-OpenTTS Public
Forked from krafton-ai/Raon-OpenTTSOpen-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training support.
Python Apache License 2.0 UpdatedApr 2, 2026 -
claw-code Public
Forked from ultraworkers/claw-codeThe fastest repo in history to surpass 50K stars ⭐, reaching the milestone in just 2 hours after publication. Better Harness Tools, not merely storing the archive of leaked Claude Code but also mak…
Rust UpdatedApr 1, 2026 -
claude-code-source Public
Forked from hangsman/claude-code-sourceclaude code source map v2.1.88
TypeScript UpdatedMar 31, 2026 -
voxtral-tts.c Public
Forked from mudler/voxtral-tts.cPure C implementation of Voxtral-4B-TTS-2603
C MIT License UpdatedMar 27, 2026 -
TTS-arxiv-daily Public
Forked from liutaocode/TTS-arxiv-dailyAutomatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Python Apache License 2.0 UpdatedMar 27, 2026 -
SoulX-Duplug Public
Forked from Soul-AILab/SoulX-DuplugPlug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.
Python Apache License 2.0 UpdatedMar 16, 2026 -
Resonate Public
Forked from xiquan-li/ResonatePre-training, SFT, DPO and GRPO for Text-to-Audio Generation
Python MIT License UpdatedMar 12, 2026 -
-
Ming Public
Forked from inclusionAI/MingMing - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.
Jupyter Notebook MIT License UpdatedFeb 12, 2026 -
awesome-controllable-speech-synthesis Public
Forked from imxtx/awesome-controllable-speech-synthesisThis is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".
MIT License UpdatedJan 27, 2026 -
delayed-streams-modeling Public
Forked from kyutai-labs/delayed-streams-modelingKyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
Python Apache License 2.0 UpdatedJan 26, 2026 -
liquid-audio Public
Forked from Liquid4All/liquid-audioLiquid Audio - Speech-to-Speech audio models by Liquid AI
Python Other UpdatedJan 24, 2026 -
Qwen3-TTS Public
Forked from QwenLM/Qwen3-TTSQwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Python Apache License 2.0 UpdatedJan 22, 2026 -
-
Step-Audio-R1 Public
Forked from stepfun-ai/Step-Audio-R1Python Apache License 2.0 UpdatedJan 15, 2026 -
pocket-tts Public
Forked from kyutai-labs/pocket-ttsA TTS that fits in your CPU (and pocket)
Python MIT License UpdatedJan 14, 2026 -
LEMAS-TTS Public
Forked from LEMAS-Project/LEMAS-TTSLEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Italian Portuguese Indonesian Vietnamese
Python UpdatedJan 9, 2026 -
lattifai-python Public
Forked from lattifai/lattifai-pythonPrecision Alignment, Infinite Possibilities
Python MIT License UpdatedJan 7, 2026 -
UltraEval-Audio Public
Forked from OpenBMB/UltraEval-AudioYour faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。
Python Apache License 2.0 UpdatedJan 4, 2026 -
-
SpeechJudge Public
Forked from AmphionTeam/SpeechJudgeSpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)
Python UpdatedDec 23, 2025