Skip to content
View QinHsiu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report QinHsiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Awesome-TTS

some amazing TTS projects
122 repositories
Python 257 37 Updated May 15, 2023

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,026 171 Updated Jul 5, 2023

Contrastive Language-Audio Pretraining

Python 1,941 198 Updated May 15, 2025

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,157 143 Updated Sep 5, 2024

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,221 176 Updated Feb 5, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,284 544 Updated Jul 27, 2024

A deep neural network architecture for low-latency audio processing

Python 321 35 Updated Aug 15, 2023

Official Implementation of StyleTTS-VC

Python 193 27 Updated Jan 14, 2025

so-vits-svc fork with realtime support, improved interface and more features.

Python 9,208 1,226 Updated Dec 19, 2025

A simple GUI application that slices audio with silence detection

Python 1,416 185 Updated Jul 29, 2024

SoftVC VITS Singing Voice Conversion

Python 27,862 5,076 Updated Nov 11, 2023
Python 1,455 185 Updated Feb 11, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,213 864 Updated Jul 6, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 661 80 Updated Dec 27, 2023

List of speech synthesis papers.

1,060 123 Updated Jul 24, 2023

A Python wrapper for the high-quality vocoder "World"

Cython 774 125 Updated Jan 21, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,839 4,674 Updated Aug 19, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,169 11,547 Updated Dec 15, 2025

The deme page of InstructTTS

158 8 Updated Feb 10, 2024

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Python 364 36 Updated Aug 3, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,043 4,669 Updated Dec 19, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,938 5,858 Updated Aug 16, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,790 252 Updated Jun 25, 2025

A library for audio and music analysis, feature extraction.

C 3,229 149 Updated May 24, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,863 342 Updated Jan 4, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,396 95 Updated Jul 22, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,841 923 Updated Apr 23, 2024

singing voice change based on whisper, and lora for singing voice clone

Python 643 78 Updated Nov 3, 2023

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Python 329 41 Updated Feb 9, 2024