Highlights
- Pro
Stars
Kotlin Multiplatform Mobile Text-to-speak SDK for using either native TTS or Supertonic
100% in-browser, hands-free AI voice chat using Whisper, WebLLM, and Supertonic TTS
Lightning-Fast, On-Device TTS β running natively via ONNX.
Python library for audio and music analysis
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Lightning-Fast, On-Device, Multilingual TTS β running natively via ONNX.
MFCC-based LipSync plug-in for Unity using Job System and Burst Compiler
TTS model capable of streaming conversational audio in realtime.
Plug-and-play TTS integration toolkit powered by Kokoro-82M. Python + CLI interface. Lightweight, open-source, and ready for real-world use.
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Text-audio foundation model from Boson AI
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
π¬ Chat with anyone on any website.
(NeurIPS 2024 Oral π₯) Improved Distribution Matching Distillation for Fast Image Synthesis
[ICLR 2024] Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation.
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
Generative models for conditional audio generation