🎤 Enable seamless audio processing with "sherpa-onnx," supporting speech recognition, synthesis, and more across multiple platforms.
-
Updated
Nov 11, 2025 - C++
🎤 Enable seamless audio processing with "sherpa-onnx," supporting speech recognition, synthesis, and more across multiple platforms.
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Port of OpenAI's Whisper model in C/C++
OBS plugin for local speech recognition and captioning using AI
Speech Processing Projects
Facebook AI Research's Automatic Speech Recognition Toolkit
Fully Local Push-to-Transcribe
Speech-to-text server framework with next-gen Kaldi
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
React Native binding of whisper.cpp.
A 100% private AI voice transcription app that converts speech to text in 50+ languages. Built with Compose Multiplatform for Android & iOS using Whisper AI - no cloud uploads, all processing happens on-device for complete privacy.
Local ML voice chat using high-end models.
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
Kroko ASR - Speech-to-text
Fork of Whisper.cpp, Much Speech Recognition.
JVM library for speech-to-text recognition, written in Kotlin and based on the C++ library whisper.cpp
A module for Garry's Mod that provides speech recognition interfaces to developers.
Real-time speech recognition using CNN
🗣️ Arduino voice-controlled robot that uses speech recognition module and motor driver for autonomous movement.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."