ggNGggG

ggNGggG

Stars

vspeech / Qwen3-TTS-Train

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 34 3 Updated Mar 18, 2026

advanced-cs / arXiv_daily

arXiv 每日论文，每周一到周五更新。

31 3 Updated Mar 27, 2026

HumeAI / tada

Open Source Speech Language Model

Jupyter Notebook 931 96 Updated Mar 24, 2026

OpenMOSS / MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 1,012 98 Updated Mar 26, 2026

thuhcsi / FlatTN

Chinese Text Normalization and Dataset

Python 91 17 Updated May 14, 2022

NRC-ILT / g2p

Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!

Python 193 35 Updated Mar 13, 2026

CjangCjengh / MoeGoe

Executable file for VITS inference

Python 2,418 246 Updated Aug 22, 2023

CjangCjengh / vits

Forked from jaywalnut310/vits

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 941 192 Updated Dec 6, 2023

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,224 6,142 Updated Feb 9, 2026

morikatron / yakinori

Japanese Converter Kanji to Hiragana, Katakana, Roma-ji

Python 12 1 Updated Jul 19, 2023

GiantAILab / DiaMoE-TTS

Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"

Python 235 21 Updated Nov 28, 2025

gaohongkui / GlobalPointer_pytorch

全局指针统一处理嵌套与非嵌套NER的Pytorch实现

Python 415 48 Updated Mar 23, 2023

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,804 93 Updated Apr 18, 2025

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 960 118 Updated Dec 17, 2025

mozillazg / pinyin-data

汉字拼音数据

Python 1,447 232 Updated Feb 23, 2026

kakaobrain / g2pm

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Python 362 74 Updated Dec 24, 2021

NewZsh / polyphone

Chinese polyphone disambiguation for Text-to-Speech application

Python 42 11 Updated Jun 11, 2024

asr-pub / index-tts-lora

High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.

Python 299 25 Updated Mar 12, 2026

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 660 52 Updated Jan 21, 2026

xingchensong / FlashCosyVoice

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 245 25 Updated Feb 25, 2026

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,261 425 Updated Dec 11, 2025

fxsjy / jieba

结巴中文分词

Python 34,821 6,710 Updated Aug 21, 2024

Alexir / CMUdict

CMUdict maintenance, and tools

Roff 245 41 Updated Jan 8, 2025

wenet-e2e / wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 417 63 Updated Nov 20, 2025

thuhcsi / SpeechCraft

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

Python 184 7 Updated Feb 28, 2026

ShawnPi233 / SynParaSpeech

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (ICASSP 2026)

JavaScript 67 4 Updated Jan 27, 2026

theodorblackbird / lina-speech

Official implementation of the TTS model Lina-Speech

Jupyter Notebook 179 14 Updated Jan 9, 2025

bytedance / MegaTTS3

Python 6,079 471 Updated Aug 29, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,965 323 Updated Jun 12, 2025

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 6,202 751 Updated Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly