Skip to content
View ggNGggG's full-sized avatar

Block or report ggNGggG

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 31 3 Updated Mar 18, 2026

arXiv 每日论文,每周一到周五更新。

31 3 Updated Mar 27, 2026

Open Source Speech Language Model

Jupyter Notebook 915 93 Updated Mar 24, 2026

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 1,008 96 Updated Mar 26, 2026

Chinese Text Normalization and Dataset

Python 91 17 Updated May 14, 2022

Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!

Python 193 34 Updated Mar 13, 2026

Executable file for VITS inference

Python 2,418 246 Updated Aug 22, 2023

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 941 192 Updated Dec 6, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,149 6,132 Updated Feb 9, 2026

Japanese Converter Kanji to Hiragana, Katakana, Roma-ji

Python 12 1 Updated Jul 19, 2023

Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"

Python 234 21 Updated Nov 28, 2025

全局指针统一处理嵌套与非嵌套NER的Pytorch实现

Python 415 48 Updated Mar 23, 2023

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,803 92 Updated Apr 18, 2025

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 959 118 Updated Dec 17, 2025

汉字拼音数据

Python 1,447 232 Updated Feb 23, 2026

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Python 362 74 Updated Dec 24, 2021

Chinese polyphone disambiguation for Text-to-Speech application

Python 42 11 Updated Jun 11, 2024

High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.

Python 298 25 Updated Mar 12, 2026

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 660 52 Updated Jan 21, 2026

FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.

Python 245 26 Updated Feb 25, 2026

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,260 424 Updated Dec 11, 2025

结巴中文分词

Python 34,823 6,711 Updated Aug 21, 2024

CMUdict maintenance, and tools

Roff 246 41 Updated Jan 8, 2025

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 417 63 Updated Nov 20, 2025

The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.

Python 184 7 Updated Feb 28, 2026

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (ICASSP 2026)

JavaScript 67 4 Updated Jan 27, 2026

Official implementation of the TTS model Lina-Speech

Jupyter Notebook 179 14 Updated Jan 9, 2025
Python 6,079 470 Updated Aug 29, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,963 323 Updated Jun 12, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 6,190 748 Updated Mar 13, 2026
Next