pan310

panpanpan pan310

1 follower · 35 following

Achievements

dots.tts Public
Forked from rednote-hilab/dots.tts

Python Apache License 2.0 Updated Jun 5, 2026
stable-audio-3 Public
Forked from Stability-AI/stable-audio-3

Python MIT License Updated May 20, 2026
Confucius4-TTS Public
Forked from netease-youdao/Confucius4-TTS

Apache License 2.0 Updated May 20, 2026
WavCube Public
Forked from yanghaha0908/WavCube

Python MIT License Updated May 8, 2026
Auto-claude-code-research-in-sleep Public
Forked from wanshuiyin/Auto-claude-code-research-in-sleep

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python MIT License Updated May 7, 2026
graphify Public
Forked from safishamsi/graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…

Python MIT License Updated Apr 21, 2026
AffectSpeech Public
Forked from jeremychee4/AffectSpeech

AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis

Updated Apr 7, 2026
paraspeechclap Public
Forked from ajd12342/paraspeechclap

Codebase for 'ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining'

Python MIT License Updated Apr 6, 2026
gemma Public
Forked from google-deepmind/gemma

Gemma open-weight LLM library, from Google DeepMind

Python Apache License 2.0 Updated Apr 3, 2026
Raon-OpenTTS Public
Forked from krafton-ai/Raon-OpenTTS

Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training support.

Python Apache License 2.0 Updated Apr 2, 2026
claw-code Public
Forked from ultraworkers/claw-code

The fastest repo in history to surpass 50K stars ⭐, reaching the milestone in just 2 hours after publication. Better Harness Tools, not merely storing the archive of leaked Claude Code but also mak…

Rust Updated Apr 1, 2026
claude-code-source Public
Forked from hangsman/claude-code-source

claude code source map v2.1.88

TypeScript Updated Mar 31, 2026
voxtral-tts.c Public
Forked from mudler/voxtral-tts.c

Pure C implementation of Voxtral-4B-TTS-2603

C MIT License Updated Mar 27, 2026
TTS-arxiv-daily Public
Forked from liutaocode/TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python Apache License 2.0 Updated Mar 27, 2026
SoulX-Duplug Public
Forked from Soul-AILab/SoulX-Duplug

Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.

Python Apache License 2.0 Updated Mar 16, 2026
Resonate Public
Forked from xiquan-li/Resonate

Pre-training, SFT, DPO and GRPO for Text-to-Audio Generation

Python MIT License Updated Mar 12, 2026
SemanticVocoder Public
Forked from zeyuxie29/SemanticVocoder

Python Updated Mar 8, 2026
Ming Public
Forked from inclusionAI/Ming

Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

Jupyter Notebook MIT License Updated Feb 12, 2026
awesome-controllable-speech-synthesis Public
Forked from imxtx/awesome-controllable-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".

MIT License Updated Jan 27, 2026
delayed-streams-modeling Public
Forked from kyutai-labs/delayed-streams-modeling

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python Apache License 2.0 Updated Jan 26, 2026
liquid-audio Public
Forked from Liquid4All/liquid-audio

Liquid Audio - Speech-to-Speech audio models by Liquid AI

Python Other Updated Jan 24, 2026
Qwen3-TTS Public
Forked from QwenLM/Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python Apache License 2.0 Updated Jan 22, 2026
DeepASMR-DB-samples Public
Forked from vivian556123/DeepASMR-DB-samples

Updated Jan 15, 2026
Step-Audio-R1 Public
Forked from stepfun-ai/Step-Audio-R1

Python Apache License 2.0 Updated Jan 15, 2026
pocket-tts Public
Forked from kyutai-labs/pocket-tts

A TTS that fits in your CPU (and pocket)

Python MIT License Updated Jan 14, 2026
LEMAS-TTS Public
Forked from LEMAS-Project/LEMAS-TTS

LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Italian Portuguese Indonesian Vietnamese

Python Updated Jan 9, 2026
lattifai-python Public
Forked from lattifai/lattifai-python

Precision Alignment, Infinite Possibilities

Python MIT License Updated Jan 7, 2026
UltraEval-Audio Public
Forked from OpenBMB/UltraEval-Audio

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测，知己知彼。

Python Apache License 2.0 Updated Jan 4, 2026
armel Public
Forked from bfs18/armel

poorman's ar-dit tts

Python Updated Dec 31, 2025
SpeechJudge Public
Forked from AmphionTeam/SpeechJudge

SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)

Python Updated Dec 23, 2025

panpanpan pan310

Achievements

Achievements

dots.tts Public

Uh oh!

stable-audio-3 Public

Uh oh!

Confucius4-TTS Public

Uh oh!

WavCube Public

Uh oh!

Auto-claude-code-research-in-sleep Public

Uh oh!

graphify Public

Uh oh!

AffectSpeech Public

Uh oh!

paraspeechclap Public

Uh oh!

gemma Public

Uh oh!

Raon-OpenTTS Public

Uh oh!

claw-code Public

Uh oh!

claude-code-source Public

Uh oh!

voxtral-tts.c Public

Uh oh!

TTS-arxiv-daily Public

Uh oh!

SoulX-Duplug Public

Uh oh!

Resonate Public

Uh oh!

SemanticVocoder Public

Uh oh!

Ming Public

Uh oh!

awesome-controllable-speech-synthesis Public

Uh oh!

delayed-streams-modeling Public

Uh oh!

liquid-audio Public

Uh oh!

Qwen3-TTS Public

Uh oh!

DeepASMR-DB-samples Public

Uh oh!

Step-Audio-R1 Public

Uh oh!

pocket-tts Public

Uh oh!

LEMAS-TTS Public

Uh oh!

lattifai-python Public

Uh oh!

UltraEval-Audio Public

Uh oh!

armel Public

Uh oh!

SpeechJudge Public

Uh oh!