Skip to content
View GuodongQi's full-sized avatar
🎯
Focusing
🎯
Focusing
  • ZheJiang University

Block or report GuodongQi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train the next generation of TTS systems.

Python 171 17 Updated Sep 13, 2024

Fast and memory-efficient exact attention

Python 22,298 2,391 Updated Feb 18, 2026

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 869 57 Updated Feb 13, 2026

Open-Source Frontier Voice AI

Python 23,336 2,560 Updated Feb 7, 2026

The best ChatGPT that $100 can buy.

Python 43,658 5,695 Updated Feb 19, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 55,093 6,019 Updated Feb 9, 2026

pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server

TypeScript 3,127 109 Updated Feb 19, 2026

12306接口抢票

Python 40 13 Updated Sep 19, 2025

Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.

TypeScript 6,195 1,023 Updated Feb 19, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 18,840 2,328 Updated Dec 2, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,209 75 Updated Aug 13, 2025

Audio Large Language Models

Python 873 44 Updated Jul 5, 2025

Your one-stop solution for voice dataset creation

Python 129 23 Updated Dec 10, 2023

Towards Human-Sounding Speech

Python 5,949 508 Updated Dec 5, 2025

SOTA Open Source TTS

Python 24,920 2,074 Updated Feb 2, 2026

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 505 66 Updated Dec 22, 2025

Text Normalization & Inverse Text Normalization

Python 726 97 Updated Feb 3, 2026

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

Python 587 76 Updated May 18, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,058 82 Updated Dec 23, 2024

Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"

Python 109 12 Updated Oct 16, 2025

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 539 51 Updated May 22, 2023

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".

209 12 Updated Feb 10, 2026

Added vLLM support to IndexTTS for faster inference.

Python 1,058 137 Updated Oct 24, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,634 2,215 Updated Feb 11, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2,159 269 Updated Feb 12, 2026

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,846 200 Updated Oct 4, 2025

A bot for automatic First Lady job in Last War mobile game

Java 28 12 Updated Apr 7, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,641 232 Updated Jun 17, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,966 331 Updated Feb 14, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 54,530 9,542 Updated Feb 11, 2026
Next