kaldi-asr/kaldi is the official location of the Kaldi project.
-
Updated
Sep 22, 2025 - Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
Offline private voice assistant for many human languages
Phonetisaurus G2P
Kaldi-based Korean ASR (한국어 음성인식) open-source project
A list of publically available audio data that anyone can download for ASR or other speech activities
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs via llama.cpp.
State-of-the-art offline ( or networked) voice typing everywhere + text terminals (Linux or WFL sesson on Windows.) with a simple bash script. Usable with X. Does not require X.
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
How to create your own model for vosk
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Dockerfile for compiling Kaldi for Android.
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes
Scripts for training Kaldi for German speech recognition (ASR).
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."