Skip to content
View Hongjiang-Yu's full-sized avatar

Block or report Hongjiang-Yu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,105 1,564 Updated Mar 17, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 31,345 3,536 Updated Jun 10, 2026
Python 28 1 Updated Apr 6, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 133,852 21,648 Updated Jun 22, 2026

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,886 192 Updated Aug 27, 2025

python wrapper for rubberband

Python 218 28 Updated Sep 30, 2024

Breeze ASR 25 是一款先進的自動語音辨識(ASR)模型,基於 Whisper-large-v2 微調而成,特別針對台灣華語以及華語與英語混用的情境進行優化。Breeze ASR 25 is an advanced ASR model fine-tuned from Whisper-large-v2, optimized for Taiwanese Mandarin and Man…

Python 152 12 Updated Jul 1, 2025

This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, mode…

Python 72 11 Updated Mar 1, 2025

SpeechGPT Series: Speech Large Language Models

Python 1,404 96 Updated Jul 22, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 2,203 318 Updated Jan 25, 2026

[IJCAI'23] Learning to Speak from Text for Low-Resource TTS

Python 65 4 Updated May 30, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,960 6,441 Updated Jun 20, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,162 4,675 Updated Aug 19, 2024

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,952 3,037 Updated May 23, 2026

中文公开聊天语料库

Python 4,193 778 Updated Apr 23, 2024
Python 40 7 Updated Jan 26, 2026

An efficient speech separation method

Python 276 30 Updated Apr 11, 2024

Official repository of SepReformer for speech separation

Python 259 41 Updated May 14, 2026

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 1,026 129 Updated Apr 10, 2026
Python 18 1 Updated Nov 19, 2025

A simple implementation for improving CosyVoice2 by GRPO method

Python 38 1 Updated May 5, 2026

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,624 1,958 Updated Jun 21, 2026

Text-audio foundation model from Boson AI

Python 8,234 634 Updated Jun 5, 2026

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 655 42 Updated Jun 23, 2026

Easy-to-Use Speech MOS predictors

Python 358 18 Updated Oct 24, 2023

Inference and training library for high-quality TTS models.

Python 5,578 590 Updated Dec 10, 2024

Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"

Python 239 21 Updated Nov 28, 2025

Tracking the progress in end-to-end speech translation

260 25 Updated Oct 25, 2023

Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (ICASSP 2026)

JavaScript 71 4 Updated Apr 27, 2026

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

Jupyter Notebook 6 Updated Jan 15, 2024
Next