Skip to content
View choiHkk's full-sized avatar

Block or report choiHkk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
167 stars written in Python
Clear filter

Voice Conversion With Just Nearest Neighbors

Python 501 70 Updated Mar 18, 2024

Pytorch implementation of the CREPE pitch tracker

Python 486 73 Updated May 16, 2025

UniSpeech - Large Scale Self-Supervised Learning for Speech

Python 472 74 Updated Apr 5, 2024

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 463 67 Updated Nov 17, 2022

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 437 68 Updated May 19, 2025

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 405 59 Updated Apr 21, 2022

This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"

Python 398 36 Updated Feb 23, 2024

PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)

Python 375 30 Updated Jan 12, 2025
Python 371 49 Updated Aug 16, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 342 40 Updated Jul 22, 2024

A Python module for continuous wavelet spectral analysis. It includes a collection of routines for wavelet transform and statistical analysis via FFT algorithm. In addition, the module also include…

Python 335 122 Updated Oct 28, 2025

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Python 335 32 Updated Aug 2, 2025

Official PyTorch implementation of Contrastive Learning of Musical Representations

Python 335 51 Updated Jul 25, 2024

PyTorch implementation of the wavelet analysis from Torrence & Compo (1998)

Python 317 59 Updated Feb 3, 2022

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Python 288 37 Updated May 16, 2025

Official implementation of SawSing (ISMIR'22)

Python 269 40 Updated Aug 28, 2022

Pitch Estimating Neural Networks (PENN)

Python 268 24 Updated Apr 2, 2025

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 264 47 Updated Jul 15, 2025

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 262 55 Updated Jan 13, 2025

Official implementation of the source-filter HiFiGAN vocoder

Python 261 34 Updated Jul 29, 2023

🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation

Python 259 33 Updated Sep 13, 2023

Official implementation of Meta-StyleSpeech and StyleSpeech

Python 252 38 Updated Feb 9, 2022

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Python 242 31 Updated Jul 10, 2019

Audio transformations library for PyTorch

Python 232 28 Updated Apr 19, 2022

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 230 32 Updated Dec 24, 2024

An implementation of SoftDTW for PyTorch.

Python 228 24 Updated May 8, 2020

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python 219 46 Updated Apr 8, 2021

Korean Sentence Embedding Repository

Python 210 16 Updated Dec 1, 2024

The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022

Python 209 33 Updated Jul 14, 2022

Streaming and Fine-tuning for Chatterbox TTS

Python 208 43 Updated Jun 15, 2025