kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Sep 22, 2025 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
Offline private voice assistant for many human languages
Phonetisaurus G2P
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Dockerfile for compiling Kaldi for Android.
A list of publically available audio data that anyone can download for ASR or other speech activities
How to create your own model for vosk
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Code samples to Get started quickly with Symbl's Voice SDK and APIs: Node.js, JavaScript, WebSockets, & PSTN.
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Top level code to transcribe English audio/video files into text/subtitles
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs via llama.cpp.
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Long audio alignment using Kaldi
State-of-the-art offline (or networked) voice typing everywhere + text terminals (Linux or WFL session on Windows.) with a simple bash script. Usable with X. Does not require X.
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Video Summarization - Summarized a video lecture and converted it to a slideshow using Speech-to-text, Keyword extraction and OpenCV Shot detection.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."