Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Monolingual Finetuning for Chatterbox Multilingual
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Fine-tune Chatterbox TTS model for custom voice cloning with multi-language support. Includes training pipeline, inference tools, and VAD-based audio processing.
chatterbox TTS + Voice Clone using onnx
Lightning-Fast, On-Device TTS — running natively via ONNX.
The development kit for over a hundred z80 family machines - c compiler, assembler, linker, libraries.
A curated list of awesome Taichi applications, courses, demos and features.
Repo containing various physics simulations in python
An open-source, GPU-accelerated physics simulation engine built upon NVIDIA Warp, specifically targeting roboticists and simulation researchers.
SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀
Continuous Thought Machines, because thought takes time and reasoning is a process.
JarodMica / index-tts
Forked from index-tts/index-ttsAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
VoiceHub: A Unified Inference Interface for TTS Models
SynthAether / T5Voice
Forked from MuyangDu/T5VoiceT5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech synthesis with zero-shot capabilities.
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech synthesis with zero-shot capabilities.
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"
kyutai-labs / nanoGPTaudio
Forked from karpathy/nanoGPTCode for the blog "Neural audio codecs: how to get audio into LLMs"
Trainging, inference, and testing of the SAC speech codec model.
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation
A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.