Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
-
Updated
Nov 3, 2025 - HTML
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
A TypeScript chrome extension that uses Deepgram to provide live transcription and translation
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 100 hours is transcribed.
An implementation of a DNN speech recognizer as part of the Udacity NLP NanoDegree program
A Deepgram Flux Audio Streaming Demo
WhisperX ASR is a FastAPI-based application for automatic speech recognition. It transcribes audio files to text using WhisperX, supports multiple languages, batch processing, and offers both a web UI and REST API.
Thesis Project
Speakr is a personal, self-hosted web application designed for transcribing audio recordings
Udacity NLP nano degree projects
IF4072 Natural Language & Text Processing
基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统,支持语音输入、自动生成下联和智能评分。An AI-powered interactive Chinese couplet system based on FastAPI, Vue3, Whisper, and DeepSeek API. Supports voice input, automatic couplet generation, and intelligent evaluation.
Ed-Fed: A generic federated learning framework for edge devices
Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline
Automatic Speech Recognition (ASR) Assisted Online Text Editor
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."