Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
-
Updated
May 25, 2026 - Python
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡ Sherpa-ONNX powered 🔊 Natural voice synthesis 📱 Fully offline processing 🚀 No cloud • No limits
Natural-sounding Text-to-Speech App that fits anywhere. Fast, Real-Time and flexible.
From-scratch voice agents in Python: end-to-end speech pipelines, runnable chapters, and a small shared library. Local models, explicit streaming behavior.
A Docker container for running Kokoro Text-to-Speech engine v.1, providing high-quality speech synthesis
A Python package that makes it easy to use the Kokoro voice synthesis library.
TTS toolkit built on Kokoro-82M with librosa audio enhancement, MCP server for Claude/Cursor, CLI & Python API. Free & open-source for YouTube creators.
Production-ready RunPod serverless endpoint for Kokoro TTS. Features high-quality text-to-speech, voice mixing, word-level timestamps, and phoneme generation. Optimized for fast cold starts and auto-scaling.
This tool allows users to create Anki cards with words, meanings, examples, and IPA pronunciations, and convert text to speech for audio files.
An advanced AI mental health assistant that combines voice interaction, fine-tuned psychology models, and intelligent knowledge retrieval to provide comprehensive psychological support.
🚀 Aperture is a modern, feature-rich desktop EPUB reader built with Python 🐍 and PyQt6. It focuses on a clean reading experience ✨ and powerful, integrated Kokoro Text-to-Speech (TTS) capabilities 🗣️🔊
📚 Index and enrich your PDFs and Markdown files locally for a powerful, unified knowledge base with semantic search capabilities.
A powerful, local-first AI orchestration layer that unifies advanced LLM reasoning, real-time voice synthesis, and local system automation. Featuring a Flask-based web interface, this system uses a smart Gemini API rotation strategy with a Groq fallback.
create audio books from pdfs with one click , available on windows , linux, mac
TTS Fast Web,一个简单优雅的本地文字转语音的前端与API接口。A localized, cross-platform, multi-language supported, OpenAI API format compatible, full-stack, ready-to-deploy TTS (Text to Speech) model
Offline Kokoro-82M text-to-speech for Python — library, CLI, and a unix-socket daemon for ~13ms speech from shell scripts. Apache 2.0, CPU real-time, macOS and Linux.
A lightweight, offline Rust inference library for Kokoro TTS - an 82M-parameter open-weights text-to-speech model.
Text-to-speech web application built with React, FastAPI, and Kokoro-82M. Runs locally via start.bat.
Local Kokoro-82M text-to-speech CLI
Add a description, image, and links to the kokoro-82m topic page so that developers can more easily learn about it.
To associate your repository with the kokoro-82m topic, visit your repo's landing page and select "manage topics."