#

speech-recognition

Here are 71 public repositories matching this topic...

traceypooh / audio2text

creates text from audio of A/V input file, using docker, sphinx. extracts keywords and NLP entities. leverages OpenNews, Stanford, Oxford, CMU and more

audio docker natural-language-processing video offline entities speech-recognition transcript srt extract-keywords

Updated Mar 4, 2017
Shell

QuantiusBenignus / speedo

Speak right into your todo.txt with speedo. Set tags, priority, due dates directly from speech.

linux shell zsh cli todo command-line date natural-language speech-recognition todolist speech-to-text todotxt reminders whisper todoapp date-parser txt-based todotxt-cli

Updated Mar 29, 2023
Shell

tjysdsg / aidatatang_force_align

Perform force alignment on Mandarin data using aidatatang pretrained model at https://kaldi-asr.org/models/m10

speech-recognition chinese kaldi mandarin kaldi-asr force-alignment

Updated Jun 13, 2021
Shell

slegroux / slgKaldi

Resources for easily building ASR systems with Kaldi

speech-recognition kaldi asr diarization

Updated Nov 6, 2020
Shell

tjysdsg / ali_to_phone

Extract phone-level alignment and phonemic transcript from kaldi ali.*.gz files

speech-recognition kaldi kaldi-asr force-alignment phonemic-transcription

Updated Dec 24, 2021
Shell

orbxball / timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

deep-learning phone speech-recognition data-preprocessing mfcc timit-dataset timit

Updated Mar 23, 2023
Shell

techiaith / docker-coqui-stt-cy

Hyfforddi a defnyddio modelau adnabod lleferydd Cymraeg coqui-stt a KenLM // Train and use coqui-stt and KenLM based Welsh language speech recognition models.

training speech api-server speech-recognition welsh cymraeg coqui-ai commonvoice

Updated Oct 25, 2022
Shell

maggieezzat / kaldi-egy-asr

A Kaldi-Recipe for Egyptian Arabic Speech Recognition

speech-recognition kaldi arabic nnet3 kaldi-asr egyptian asr-model

Updated Jan 3, 2021
Shell

cadia-lvl / althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches

speech-recognition icelandic text-normalization kaldi-asr althingi

Updated Feb 24, 2021
Shell

loop333 / realtime_stream_sr

Realtime internet radio stream speech recognition with Julius & ffmpeg

shell bbc stream ffmpeg realtime cnn speech-recognition gmm julius internet-radio

Updated Nov 2, 2018
Shell

falabrasil / htk-br

Scripts para treino de modelos acústicos

speech-recognition asr htk brazilian-portuguese

Updated Oct 26, 2020
Shell

asrajeh / kaldi-arabic

HHM-based Arabic ASR using Kaldi engine

speech-recognition speech-to-text kaldi arabic asr

Updated Nov 3, 2021
Shell

begemotv2718 / recipes

speech-recognition kaldi-asr russian-support

Updated Jan 31, 2018
Shell

alifarrokh / kaldi-dnn-hmm-asr

A Kaldi recipe for training a hybrid DNN-HMM speech recognition model

speech-recognition kaldi asr dnn-hmm

Updated Sep 29, 2024
Shell

mydroidandi / commbase-recorder-transmitter-s

A voice recorder and recording transmitter for Commbase

android shell ios-app dash speech-recognition smartphone-interaction speech-to-text stt iphone-app smartphone-app voice-recorder commbase commbase-stt-whisper-reactive-p

Updated May 19, 2024
Shell

jjlee0802cu / open-set-lid

Open-set speech language identification https://arxiv.org/abs/2205.10397

pytorch speech-recognition language-identification

Updated May 26, 2022
Shell

mende237 / Nda-Nda-Force-Aligner

Forced alignment of Nda‘ Nda’ a Cameroonian language

bash deep-learning python-script dnn speech-recognition shell-script automatic-speech-recognition kaldi bash-script language-model speech-processing asr hmm-model bash-scripting cameroon asr-model speech-recognition-model nda-nda cameroon-language

Updated Jun 19, 2025
Shell

mydroidandi / commbase-recorder-transmitter-b

A voice recorder and recording transmitter for Commbase

android bash ios-app speech-recognition smartphone-interaction speech-to-text stt iphone-app smartphone-app voice-recorder commbase commbase-stt-whisper-reactive-p

Updated May 26, 2024
Shell

tjysdsg / kaldi-align-to-phones

Use kaldi pretrained nnet3 model to align individual sentences and get phone-level transcripts

speech-recognition kaldi nnet3 force-alignment phonemic-transcription

Updated Jul 1, 2021
Shell

acaibowlz / Taiwanese-ASR-with-ESPNET

Report of an end-to-end speech recognition task

kaggle speech-recognition espnet espnetv2

Updated Aug 20, 2024
Shell

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."