QinHsiu

🎯

Focusing

QinHsiu QinHsiu

🎯

Focusing

Man proposes, Gad disposes.

33 followers · 158 following

13:14 (UTC -12:00)
https://qinhsiu.github.io

Achievements

Stars

Awesome-TTS

some amazing TTS projects

122 repositories

hyllll / VCRS

Python 12 3 Updated Jun 19, 2023

chomeyama / SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Python 267 34 Updated Jul 29, 2023

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 328 43 Updated Sep 24, 2022

Azure-Samples / Cognitive-Speech-TTS

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

C# 1,001 541 Updated Jan 14, 2026

google / tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

HTML 539 83 Updated Aug 1, 2025

jcvasquezc / DisVoice

feature extraction from speech signals

Jupyter Notebook 390 86 Updated Jun 15, 2025

QinHsiu / TTS_tools

Some tolls for Text-To-Speech

Python 1 Updated Aug 21, 2023

b04901014 / UUVC

Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units.

Python 83 9 Updated Jan 7, 2023

neonbjb / tts-scores

Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models

Python 175 15 Updated Dec 18, 2023

ReneeYe / ConST

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

Python 65 5 Updated May 25, 2022

facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,877 314 Updated Mar 14, 2023

deezer / spleeter

Deezer source separation library including pretrained models.

Python 28,019 3,068 Updated Apr 2, 2025

liusongxiang / Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

506 29 Updated Sep 26, 2024

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 2,163 318 Updated Jan 25, 2026

DemisEom / SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Python 656 135 Updated Apr 5, 2022

cnlinxi / book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

TeX 615 81 Updated Apr 19, 2022

p0p4k / vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Python 546 97 Updated Mar 28, 2024

anonymous-pits / pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 280 33 Updated Jul 16, 2023

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,334 105 Updated Sep 24, 2023

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,094 177 Updated Jun 12, 2023

archinetai / audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Python 144 23 Updated Feb 11, 2023

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,913 71 Updated Jan 4, 2024

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 146 19 Updated Jun 6, 2022

tts-tutorial / survey

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

371 27 Updated Nov 5, 2021

0913ktg / SC_VALL-E

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Python 135 17 Updated Oct 23, 2024

lingjzhu / CharsiuG2P

Multilingual G2P in 100 languages

Jupyter Notebook 374 31 Updated May 26, 2023

lingjzhu / charsiu

Charsiu: A neural phonetic aligner.

Jupyter Notebook 329 42 Updated Sep 19, 2022

keithito / tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,989 948 Updated Jul 6, 2023

QinHsiu / BiCLTTS

Bi-level Cntrastive Learning for Text-to-Speech

Python 1 1 Updated Aug 22, 2023

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,959 783 Updated Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QinHsiu QinHsiu

Achievements

Achievements

Block or report QinHsiu

Awesome-TTS

hyllll / VCRS

chomeyama / SiFiGAN

keonlee9420 / Comprehensive-Transformer-TTS

Azure-Samples / Cognitive-Speech-TTS

google / tacotron

jcvasquezc / DisVoice

QinHsiu / TTS_tools

b04901014 / UUVC

neonbjb / tts-scores

ReneeYe / ConST

facebookresearch / denoiser

deezer / spleeter

liusongxiang / Large-Audio-Models

DigitalPhonetics / IMS-Toucan

DemisEom / SpecAugment

cnlinxi / book-text-to-speech

p0p4k / vits2_pytorch

anonymous-pits / pits

lucidrains / naturalspeech2-pytorch

archinetai / audio-diffusion-pytorch

archinetai / audio-data-pytorch

archinetai / audio-ai-timeline

keonlee9420 / Comprehensive-E2E-TTS

tts-tutorial / survey

0913ktg / SC_VALL-E

lingjzhu / CharsiuG2P

lingjzhu / charsiu

keithito / tacotron

QinHsiu / BiCLTTS

Plachtaa / VALL-E-X