whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
-
Updated
May 9, 2024 - HTML
whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon
A TypeScript chrome extension that uses Deepgram to provide live transcription and translation
The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 100 hours is transcribed.
Automatic Speech Recognition (ASR) Assisted Online Text Editor
WhisperX ASR is a FastAPI-based application for automatic speech recognition. It transcribes audio files to text using WhisperX, supports multiple languages, batch processing, and offers both a web UI and REST API.
🎤 Enable real-time speech recognition with WhisperX using FastAPI for efficient, scalable audio processing.
Thesis Project
A lightweight automatic speech recognition (ASR) and speech translation web application
基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统,支持语音输入、自动生成下联和智能评分。An AI-powered interactive Chinese couplet system based on FastAPI, Vue3, Whisper, and DeepSeek API. Supports voice input, automatic couplet generation, and intelligent evaluation.
GPU-accelerated Japanese → English subtitle generator using faster-whisper and Streamlit
Speakr is a personal, self-hosted web application designed for transcribing audio recordings
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."