Skip to content
View choiHkk's full-sized avatar

Block or report choiHkk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
17 results for sponsorable starred repositories written in Python
Clear filter

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,189 1,704 Updated Jun 25, 2025

A next generation HTTP client for Python. 🦋

Python 14,716 974 Updated Oct 16, 2025

Simultaneous speech-to-text model

Python 8,297 778 Updated Nov 6, 2025

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,760 248 Updated Jun 25, 2025

Text-to-Audio/Music Generation

Python 2,516 202 Updated Sep 29, 2024

WaveNet vocoder

Python 2,367 496 Updated Jul 29, 2023

Audio generation using diffusion models, in PyTorch.

Python 2,080 178 Updated Jun 12, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,084 186 Updated Dec 22, 2023

PyTorch implementation of normalizing flow models

Python 900 129 Updated Aug 25, 2024

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 633 194 Updated May 27, 2023

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Python 335 32 Updated Aug 2, 2025

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 264 47 Updated Jul 15, 2025

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python 219 46 Updated Apr 8, 2021

Trainer for audio-diffusion-pytorch

Python 129 22 Updated Jan 13, 2023

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Python 75 9 Updated Aug 30, 2021

(Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307.02486)

Python 53 8 Updated Aug 7, 2023

Sequence alignement methods with helpers for PyTorch.

Python 24 3 Updated Nov 30, 2022