Skip to content
View choiHkk's full-sized avatar

Block or report choiHkk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
19 results for sponsorable starred repositories
Clear filter

A next generation HTTP client for Python. 🦋

Python 14,706 971 Updated Oct 16, 2025

Simultaneous speech-to-text model

Python 8,269 771 Updated Oct 30, 2025

(Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307.02486)

Python 53 8 Updated Aug 7, 2023

WaveNet vocoder

Python 2,367 496 Updated Jul 29, 2023

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,189 1,704 Updated Jun 25, 2025

Sequence alignement methods with helpers for PyTorch.

Python 24 3 Updated Nov 30, 2022

Text-to-Audio/Music Generation

Python 2,515 202 Updated Sep 29, 2024

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Python 75 9 Updated Aug 30, 2021

A timeline of the latest AI models for audio generation, starting in 2023!

1,904 69 Updated Jan 4, 2024

Trainer for audio-diffusion-pytorch

Python 129 22 Updated Jan 13, 2023

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,760 248 Updated Jun 25, 2025

Python wrapper for OpenJTalk

Cython 236 81 Updated Apr 8, 2025

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python 219 46 Updated Apr 8, 2021

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Python 335 32 Updated Aug 2, 2025

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 264 47 Updated Jul 15, 2025

Audio generation using diffusion models, in PyTorch.

Python 2,080 178 Updated Jun 12, 2023

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 633 194 Updated May 27, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,082 186 Updated Dec 22, 2023

PyTorch implementation of normalizing flow models

Python 900 129 Updated Aug 25, 2024