A fast, local neural text to speech system
Robust Speech Recognition via Large-Scale Weak Supervision
Speech-to-text, text-to-speech, and speaker recognition
Speech to Text to Speech, sends text as OSC messages
A free, open source, and extensible speech-to-text application
Comprehensive Gradio WebUI for audio processing
A modern ebook manager and reader with sync and backup
Speech recognition module for Python
A deep learning toolkit for Text-to-Speech, battle-tested in research
Browser extension and cross-platform desktop app based on ChatGPT API
High-quality multi-lingual text-to-speech library by MyShell.ai
Featuring powerful AI capabilities and supporting e-book formats
Subtitle Creation Assistant
Toolkit for conversational AI
Transcribe any audio to text, translate and edit subtitles 100% locall
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
The behavior guidance framework for customer-facing LLM agents
Every programmer needs a rubberduck. COM add-in for the VBA & VB6 IDE
State-of-the-art TTS model under 25MB
Anki flashcards on Android
A block-style editor with clean JSON output
A generative speech model for daily dialogue
Models for the spaCy Natural Language Processing (NLP) library
Open source personal AI Assistant for Linux, Windows and Mac
Repo of Qwen2-Audio chat & pretrained large audio language model