whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
-
Updated
May 9, 2024 - HTML
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
A TypeScript chrome extension that uses Deepgram to provide live transcription and translation
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 100 hours is transcribed.
A Deepgram Flux Audio Streaming Demo
Automatic Speech Recognition (ASR) Assisted Online Text Editor
Thesis Project
WhisperX ASR is a FastAPI-based application for automatic speech recognition. It transcribes audio files to text using WhisperX, supports multiple languages, batch processing, and offers both a web UI and REST API.
Speakr is a personal, self-hosted web application designed for transcribing audio recordings
Udacity NLP nano degree projects
An implementation of a DNN speech recognizer as part of the Udacity NLP NanoDegree program
IF4072 Natural Language & Text Processing
Ed-Fed: A generic federated learning framework for edge devices
基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统,支持语音输入、自动生成下联和智能评分。An AI-powered interactive Chinese couplet system based on FastAPI, Vue3, Whisper, and DeepSeek API. Supports voice input, automatic couplet generation, and intelligent evaluation.
Implement a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."