🔍 Enhance image generation with ITO guidance for StableDiffusionXL. Test outputs seamlessly and optimize quality in a streamlined interface.
-
Updated
Dec 13, 2025 - Python
🔍 Enhance image generation with ITO guidance for StableDiffusionXL. Test outputs seamlessly and optimize quality in a streamlined interface.
Your offline, privacy-first voice assistant framework. Transform speech into commands and actions with a powerful, scriptable rule engine.
Free, open-source, real-time dictation for Windows. Runs locally (no cloud!), uses AI, and types directly into any application via a user-friendly GUI.
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Real-time AI voice transcription in any active app on Windows OS
Advanced speech-to-text application with high-accuracy transcription and intelligent context correction. Features multiple AI backends (OpenAI GPT-4o, Groq Whisper, Anthropic Claude), screenshot-based visual context enhancement, persistent recording indicators, and seamless workflow integration.
A user-friendly voice dictation application for Linux that supports multiple languages.
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Short code for dictation using OpenAI Whisper for transcription.
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
Voice input for M-series Mac, a modification of mlx-whisper-dictation.
Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-text dictation.
GPU-accelerated speech-to-text service that types what you say, powered by OpenAI's Whisper AI
🎤 Privacy-first local speech-to-text dictation for NixOS - Whisper.cpp powered push-to-talk with real-time feedback
A nearly-live implementation of OpenAI's Whisper.
Integrate Talon voice dictation commands with TTS, screen readers, braille, and more!
Fast local push-to-talk dictation for macOS with Apple Silicon optimization
Add a description, image, and links to the dictation topic page so that developers can more easily learn about it.
To associate your repository with the dictation topic, visit your repo's landing page and select "manage topics."