-
piper1-gpl Public
Forked from OHF-Voice/piper1-gplFast and local neural text-to-speech engine
C++ GNU General Public License v3.0 UpdatedFeb 5, 2026 -
Tacotron Public
Forked from bshall/TacotronA PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Python MIT License UpdatedDec 23, 2025 -
music2latent Public
Forked from SonyCSLParis/music2latentEncode and decode audio samples to/from compressed latent representations!
Python Other UpdatedSep 19, 2025 -
min-sCM Public
Minimal implementation of simple, stable, and scalable consistency models (2410.11081)
-
HeavyBall Public
Forked from HomebrewML/HeavyBallEfficient optimizers
Python BSD 2-Clause "Simplified" License UpdatedJun 5, 2025 -
llmdifftracker Public
Forked from fal-ai-community/llmdifftrackerLightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
Python MIT License UpdatedMay 18, 2025 -
-
minRF Public
Forked from cloneofsimo/minRFMinimal implementation of scalable rectified flow transformers, based on SD3's approach
Jupyter Notebook Apache License 2.0 UpdatedApr 9, 2025 -
fastrtc Public
Forked from gradio-app/fastrtcThe python library for real-time communication
Python MIT License UpdatedMar 11, 2025 -
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 3 minutes
Python MIT License UpdatedMar 9, 2025 -
VisoMaster Public
Forked from visomaster/VisoMasterPowerful & Easy-to-Use Video Face Swapping and Editing Software
Python GNU General Public License v3.0 UpdatedFeb 18, 2025 -
BS-RoFormer Public
Forked from lucidrains/BS-RoFormerImplementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Python MIT License UpdatedDec 20, 2024 -
lingua Public
Forked from facebookresearch/linguaMeta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 12, 2024 -
nano-simsiam Public
Forked from lucas-maes/nano-simsiamMinimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features distributed training, real-time KNN eval, and AMP. Perfect for …
Python MIT License UpdatedNov 25, 2024 -
wavehax Public
Forked from chomeyama/wavehaxOfficial repository of Wavehax vocoder
Python UpdatedNov 23, 2024 -
vec2wav2.0 Public
Forked from cantabile-kwok/vec2wav2.0Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
Python GNU General Public License v3.0 UpdatedNov 11, 2024 -
Amphion Public
Forked from open-mmlab/AmphionAmphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Jupyter Notebook MIT License UpdatedNov 1, 2024 -
minbpe Public
Forked from karpathy/minbpeMinimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Python MIT License UpdatedAug 28, 2024 -
MP-SENet Public
Forked from yxlu-0102/MP-SENetMP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Python MIT License UpdatedAug 16, 2024 -
HiFi-GAN-TorToiSe Public archive
Replace the diffusion + vocoder by a single GAN
-
-
-
lhotse Public
Forked from lhotse-speech/lhotseTools for handling speech data in machine learning projects.
Python Apache License 2.0 UpdatedJul 22, 2024 -
x-transformers Public
Forked from lucidrains/x-transformersA simple but complete full-attention transformer with a set of promising experimental features from various papers
Python MIT License UpdatedJul 20, 2024 -
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorchImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Python MIT License UpdatedJul 16, 2024 -
descript-audio-codec Public
Forked from descriptinc/descript-audio-codecState-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Python MIT License UpdatedJul 11, 2024 -
-
flashT5 Public
Forked from catie-aq/flashT5A fast implementation of T5/UL2 in PyTorch using Flash Attention
Python Apache License 2.0 UpdatedApr 16, 2024 -
descript-audio-vae Public
Forked from innnky/descript-audio-vaeVAE modified from Descript Audio Codec, which replaces the RVQ with VAE
Python MIT License UpdatedApr 2, 2024 -
tacotron2 Public
Forked from NVIDIA/tacotron2Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedFeb 25, 2024