Skip to content
View QinHsiu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report QinHsiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Awesome-TTS

some amazing TTS projects
122 repositories
Python 12 3 Updated Jun 19, 2023

Official implementation of the source-filter HiFiGAN vocoder

Python 267 34 Updated Jul 29, 2023

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 328 43 Updated Sep 24, 2022

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

C# 1,001 541 Updated Jan 14, 2026

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

HTML 539 83 Updated Aug 1, 2025

feature extraction from speech signals

Jupyter Notebook 390 86 Updated Jun 15, 2025

Some tolls for Text-To-Speech

Python 1 Updated Aug 21, 2023

Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units.

Python 83 9 Updated Jan 7, 2023

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Python 175 15 Updated Dec 18, 2023

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 65 5 Updated May 25, 2022

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,877 314 Updated Mar 14, 2023

Deezer source separation library including pretrained models.

Python 28,019 3,068 Updated Apr 2, 2025

Keep track of big models in audio domain, including speech, singing, music etc.

506 29 Updated Sep 26, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 2,163 318 Updated Jan 25, 2026

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Python 656 135 Updated Apr 5, 2022

A book about Text-to-Speech (TTS) in Chinese.

TeX 615 81 Updated Apr 19, 2022

unofficial vits2-TTS implementation in pytorch

Python 546 97 Updated Mar 28, 2024

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 280 33 Updated Jul 16, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,334 105 Updated Sep 24, 2023

Audio generation using diffusion models, in PyTorch.

Python 2,094 177 Updated Jun 12, 2023

A collection of useful audio datasets and transforms for PyTorch.

Python 144 23 Updated Feb 11, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,913 71 Updated Jan 4, 2024

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 146 19 Updated Jun 6, 2022

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

371 27 Updated Nov 5, 2021

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Python 135 17 Updated Oct 23, 2024

Multilingual G2P in 100 languages

Jupyter Notebook 374 31 Updated May 26, 2023

Charsiu: A neural phonetic aligner.

Jupyter Notebook 329 42 Updated Sep 19, 2022

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,989 948 Updated Jul 6, 2023

Bi-level Cntrastive Learning for Text-to-Speech

Python 1 1 Updated Aug 22, 2023

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,959 783 Updated Feb 11, 2024