Skip to content
View QinHsiu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report QinHsiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Awesome-TTS

some amazing TTS projects
122 repositories
Python 260 37 Updated May 15, 2023

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,049 175 Updated Jul 5, 2023

Contrastive Language-Audio Pretraining

Python 2,126 213 Updated May 15, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,211 146 Updated Sep 5, 2024

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,230 178 Updated Feb 5, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,345 554 Updated Jul 27, 2024

A deep neural network architecture for low-latency audio processing

Python 323 35 Updated Aug 15, 2023

Official Implementation of StyleTTS-VC

Python 198 28 Updated Jan 14, 2025

so-vits-svc fork with realtime support, improved interface and more features.

Python 9,287 1,224 Updated Apr 27, 2026

A simple GUI application that slices audio with silence detection

Python 1,450 188 Updated Apr 5, 2026

SoftVC VITS Singing Voice Conversion

Python 28,052 5,067 Updated Nov 11, 2023
Python 1,460 186 Updated Feb 11, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,191 860 Updated Jul 6, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 670 84 Updated Dec 27, 2023

List of speech synthesis papers.

1,072 123 Updated Jul 24, 2023

A Python wrapper for the high-quality vocoder "World"

Cython 786 126 Updated Jan 21, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,098 4,685 Updated Aug 19, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 98,695 12,127 Updated Apr 15, 2026

The deme page of InstructTTS

157 8 Updated Feb 10, 2024

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Python 367 37 Updated Aug 3, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,230 4,809 Updated Apr 30, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,198 6,065 Updated Aug 16, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,871 264 Updated Jun 25, 2025

A library for audio and music analysis, feature extraction.

C 3,303 146 Updated Mar 6, 2026

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,948 357 Updated Jan 4, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,407 96 Updated Jul 22, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,853 914 Updated Apr 23, 2024

singing voice change based on whisper, and lora for singing voice clone

Python 646 80 Updated Nov 3, 2023

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Python 331 41 Updated Feb 9, 2024