choiHkk

Follow

choihk choiHkk

Follow

SpeechSynthesis

70 followers · 20 following

Seoul
choihk6610@gmail.com

Achievements

Achievements

Stars

170 stars written in Python

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 2,064 221 Updated Oct 9, 2025

ronghuaiyang / arcface-pytorch

Python 1,872 397 Updated Dec 9, 2021

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,821 135 Updated Jul 5, 2024

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,478 193 Updated Sep 26, 2024

microsoft / NeuralSpeech

Python 1,453 185 Updated Feb 11, 2024

kakao / khaiii

Kakao Hangul Analyzer III

Python 1,449 297 Updated Sep 1, 2025

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,407 131 Updated Apr 24, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,332 105 Updated Sep 24, 2023

marl / crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,304 171 Updated Aug 19, 2024

ShannonAI / service-streamer

Boosting your Web Services of Deep Learning Applications.

Python 1,244 189 Updated May 13, 2021

Anemll / Anemll

Artificial Neural Engine Machine Learning Library

Python 1,244 44 Updated Sep 2, 2025

Renovamen / Speech-Emotion-Recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

Python 1,242 227 Updated Mar 25, 2023

lessw2020 / Ranger-Deep-Learning-Optimizer

Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase

Python 1,206 176 Updated Dec 22, 2023

lucidrains / performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Python 1,156 148 Updated Feb 2, 2022

SHI-Labs / Neighborhood-Attention-Transformer

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Python 1,155 89 Updated May 15, 2024

clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

Python 1,143 286 Updated Mar 26, 2024

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,140 141 Updated Sep 5, 2024

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 1,095 96 Updated May 16, 2025

sooftware / conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,085 186 Updated Dec 22, 2023

VincentStimper / normalizing-flows

PyTorch implementation of normalizing flow models

Python 903 129 Updated Aug 25, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 819 72 Updated Jul 30, 2024

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 748 131 Updated Apr 11, 2024

csteinmetz1 / pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 734 57 Updated Jul 2, 2024

Maghoumi / pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Python 718 65 Updated Apr 3, 2024

speechio / chinese_text_normalization

Chinese text normalization for speech processing

Python 712 149 Updated Mar 18, 2023

jaywalnut310 / glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 698 155 Updated Jul 12, 2022

OlaWod / FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Python 691 120 Updated Jan 19, 2025

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 680 99 Updated Aug 22, 2025

sooftware / kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 633 194 Updated May 27, 2023

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 619 61 Updated Jun 9, 2024