Skip to content
View GuodongQi's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ZheJiang University

Block or report GuodongQi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-performance panel based on V2board secondary development supporting new protocols and new features

PHP 4,425 1,245 Updated Jun 17, 2026

Xray panel supporting multi-protocol multi-user expire day & traffic & IP limit (Vmess, Vless, Trojan, ShadowSocks, Wireguard, Hysteria, Tunnel, Mixed, HTTP, Tun)

Go 41,016 7,679 Updated Jun 18, 2026

Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

Python 163 18 Updated Mar 3, 2026

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,654 362 Updated Jun 21, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 24,556 2,810 Updated May 25, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,034 1,561 Updated Mar 17, 2026

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

Python 233 16 Updated Jun 16, 2026

Train the next generation of TTS systems.

Python 169 17 Updated Sep 13, 2024

Fast and memory-efficient exact attention

Python 24,189 2,844 Updated Jun 19, 2026

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 932 69 Updated Apr 9, 2026

Open-Source Frontier Voice AI

Python 49,483 5,516 Updated May 6, 2026

The best ChatGPT that $100 can buy.

Python 55,224 7,587 Updated May 5, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,865 6,434 Updated Jun 16, 2026

pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server

TypeScript 3,424 120 Updated Jun 17, 2026

12306接口抢票

Python 41 15 Updated Sep 19, 2025

Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.

TypeScript 6,614 1,123 Updated Jun 19, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 21,267 2,620 Updated Jun 16, 2026

Awesome speech/audio LLMs, representation learning, and codec models

1,230 75 Updated Jun 1, 2026

The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.

Python 934 48 Updated Jun 15, 2026

Your one-stop solution for voice dataset creation

Python 130 24 Updated Dec 10, 2023

Towards Human-Sounding Speech

Python 6,197 529 Updated Dec 5, 2025

SOTA Open Source TTS

Python 30,868 2,637 Updated Jun 9, 2026

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 512 68 Updated Dec 22, 2025

Text Normalization & Inverse Text Normalization

Python 784 111 Updated Jun 15, 2026

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

Python 609 77 Updated May 18, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,146 88 Updated Dec 23, 2024

Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"

Python 115 13 Updated Oct 16, 2025

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 549 51 Updated May 22, 2023

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".

256 14 Updated Jun 19, 2026
Next