- San Francisco, CA
-
rectified-flow-pytorch Public
Forked from lucidrains/rectified-flow-pytorchImplementation of rectified flow and some of its followup research / improvements in Pytorch
Python MIT License UpdatedDec 20, 2025 -
mlx-audio Public
Forked from Blaizzy/mlx-audioA text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
Python MIT License UpdatedDec 17, 2025 -
nanospeech Public
A simple, hackable text-to-speech system in PyTorch and MLX
-
f5-tts-mlx Public
Implementation of F5-TTS in MLX
-
vector-quantize-pytorch Public
Forked from lucidrains/vector-quantize-pytorchVector (and Scalar) Quantization, in Pytorch
-
f5-tts-swift Public
Implementation of F5-TTS in Swift using MLX
-
mlx Public
Forked from ml-explore/mlxMLX: An array framework for Apple silicon
C++ MIT License UpdatedDec 11, 2024 -
vocos-mlx Public
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
-
descript-mlx Public
Implementation of the Descript Audio Codec in MLX
-
mlx-swift Public
Forked from ml-explore/mlx-swiftSwift API for MLX
Swift MIT License UpdatedOct 17, 2024 -
vocos-swift Public
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in Swift using MLX
-
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorchImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
-
mlx-data Public
Forked from ml-explore/mlx-dataEfficient framework-agnostic data loading
C++ MIT License UpdatedOct 8, 2024 -
e2-tts-mlx Public
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
-
spear-tts-pytorch Public
Forked from lucidrains/spear-tts-pytorchImplementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
-
voicebox-pytorch Public
Forked from lucidrains/voicebox-pytorchImplementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
-
exporters Public
Forked from huggingface/exportersExport Hugging Face models to Core ML and TensorFlow Lite
Python Apache License 2.0 UpdatedOct 11, 2023 -
best-rq-pytorch Public
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
-
soundstorm-pytorch Public
Forked from lucidrains/soundstorm-pytorchImplementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Python MIT License UpdatedAug 24, 2023 -
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorchImplementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch