asr

基于 FastAPI、Vue3、Whisper 和 DeepSeek API 的 AI 语音对联互动系统，支持语音输入、自动生成下联和智能评分。An AI-powered interactive Chinese couplet system based on FastAPI, Vue3, Whisper, and DeepSeek API. Supports voice input, automatic couplet generation, and intelligent evaluation.

nlp ai speech-to-text whisper asr vue3 fastapi deepseek chinese-couplet

Updated Jun 4, 2025
HTML

tugas-itb-erick / IF4072-Pemrosesan-Teks-dan-Suara-Bahasa-Alami

Star

IF4072 Natural Language & Text Processing

nlp speech-recognition text-recognition asr htk aspect-based-sentiment-analysis if4072

Updated Oct 25, 2020
HTML

HMByteSensei / WhisperAI-Evaluation

Star

Comprehensive benchmark of OpenAI Whisper models for Bosnian, Croatian, and Serbian languages. Includes pipelines for audio transcription, rigorous text normalization, Levenshtein distance evaluation, and LLM-based post-processing.

python nlp benchmarking machine-learning natural-language-processing serbian levenshtein-distance speech-to-text asr bosnian croatian graph-representation wer text-normalization accuracy-evaluation openai-whisper

Updated Dec 7, 2025
HTML

chesterXalan / Web-HakkaTourGame

Star

A web game based on Taiwanese tourism. Using Python as backend.

javascript css game python html api web tourism google-maps tts asr taiwanese hakka chatgpt

Updated Jun 21, 2024
HTML

romanyn36 / whisperx-asr-with-fastapi

Sponsor

Star

WhisperX ASR is a FastAPI-based application for automatic speech recognition. It transcribes audio files to text using WhisperX, supports multiple languages, batch processing, and offers both a web UI and REST API.

python cuda transformers torch speech-recognition openai whisper asr fastapi ctranslate2 whisperx

Updated Oct 27, 2025
HTML

boned-fruitwood759 / whisperx-asr-with-fastapi

Star

🎤 Enable real-time speech recognition with WhisperX using FastAPI for efficient, scalable audio processing.

python cuda transformers torch speech-recognition openai whisper asr fastapi ctranslate2 whisperx

Updated Dec 17, 2025
HTML

tino1b2be / LARMAS

Star

Thesis Project

nlp data django rest-api web-api web-application data-collection corpora uct resource-manager asr resource-management language-resources larmas univeristy-of-cape-town

Updated May 11, 2018
HTML

Ajmain-Inqiad / Udacity-NLP-nanodegree

Star

python nlp course machine-translation speech-recognition udacity-nanodegree asr vui hmm-tagger

Updated May 7, 2020
HTML

j-ranasinghe / speech-to-text-webapp

Star

A lightweight automatic speech recognition (ASR) and speech translation web application

lightweight webapp speech-to-text asr

Updated Apr 19, 2024
HTML

praveenmunagapati / speechrecognitionoffline

Star

tfjs

asr tfjs

Updated Apr 7, 2020
HTML

jnatale11 / Audio-Transcriber

Star

Automatic Speech Recognition (ASR) Assisted Online Text Editor

audio-analysis audio-streaming text-editor asr

Updated Sep 14, 2017
HTML

deepgram-devs / deepgram-demos-flux-streaming

Star

nodejs streaming speech-to-text asr deepgram voice-agent

Updated Nov 27, 2025
HTML

biraj21 / llm-server-from-scratch

Star

FastAPI server for locally serving Gemma 3 270M & OpenAI Whisper with batched inference and streaming support.

transformers inference whisper gemma asr llm

Updated Sep 25, 2025
HTML

HCI-LAB-UGSPEECHDATA / speech_data_ghana_ug

Star

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language. Of which 100 hours is transcribed.

data-science data ml tts ug asr ghana llm legon ugspeechdata

Updated May 2, 2025
HTML

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

Here are 24 public repositories matching this topic...

cuiyuheng / speakr

FuningC / NLP-projects

foteinipapadopoulou / ASR-bias

ammarasmro / DNN-speech-recognizer

zithasasindran / Ed-fed-FL-framework

tnakatani / dnn_speech_recognition

zyovo-0829 / LingxiCouplet

tugas-itb-erick / IF4072-Pemrosesan-Teks-dan-Suara-Bahasa-Alami

HMByteSensei / WhisperAI-Evaluation

chesterXalan / Web-HakkaTourGame

romanyn36 / whisperx-asr-with-fastapi

boned-fruitwood759 / whisperx-asr-with-fastapi

tino1b2be / LARMAS

Ajmain-Inqiad / Udacity-NLP-nanodegree

j-ranasinghe / speech-to-text-webapp

praveenmunagapati / speechrecognitionoffline

jnatale11 / Audio-Transcriber

deepgram-devs / deepgram-demos-flux-streaming

biraj21 / llm-server-from-scratch

HCI-LAB-UGSPEECHDATA / speech_data_ghana_ug

Improve this page

Add this topic to your repo