#

asr

Here are 652 public repositories matching this topic...

BScUniversityCollaborations / automatic-speech-recognition

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

Forced-Alignment-and-Vowel-Extraction / fave-asr

Interface for automated transcription and time alignment of conversational interview data

linguistics asr sociolinguistics

Updated Apr 22, 2024
Python

TIAGo-WE-COBOT / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation.

hri asr ros-noetic

Updated Mar 4, 2025
Python

udit-rawat / whisper-space

An ASR Gradio GUI based project that transcript the audion and provides NLP based analysis.

nlp spacy gradio whisper asr

Updated Jul 27, 2024
Python

anilkeshwani / speech-text-alignment

Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets

speech speech-recognition data-pipeline asr hubert uroman

Updated Sep 19, 2025
Python

FlosMume / AI-Research-Assistant-Starter

Prototype of an intelligent research agent capable of literature retrieval, summarization, and contextual reasoning — a foundation for scientific automation tools.

python text-to-speech asr document-summarization rag fastapi ai-agent llm retrieval-augmented-generation knowledge-retrieval multimodal-ai research-assistan

Updated Oct 20, 2025
Python

HeyHera / Hera

This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.

python scikit-learn nlu spacy kivy tts asr wake-word-detection sgd-classifier vosk nix-tts

Updated Jul 12, 2022
Python

Ailurus1 / ASR-bot

ASR telegram assistant for voice/video messages transcribing

telegram-bot transformers speech-recognition speech-to-text whisper asr

Updated Dec 19, 2024
Python

Sang-Buster / AeroLex-Editor

A powerful web-based editor for transcription and subtitle files with real-time audio/video sync capabilities

aviation nlp asr llm

Updated Apr 26, 2025
Python

thibault-roux / metric-evaluator

Metric evaluator for Automatic Speech Recognition using the HATS dataset

evaluation speech speech-recognition speech-to-text metric speech-processing evaluation-metric asr

Updated Apr 22, 2025
Python

cucumberian / gigaam-api

api for speach recognithion with gigaam model

api stt asr russian-language fastapi sber stt-api gigaam

Updated Dec 11, 2025
Python

danielrosehill / Whisper-WPM-Background-Noise-Eval

Quick eval to try answer a question: how much does speaking pace affect WER/accuracy in ASR?

evaluations stt asr

Updated Dec 9, 2025
Python

jx1100370217 / LAS_Tensorflow_jack

Tensorflow implement of LAS model

tensorflow asr listen-attend-and-spell

Updated Feb 2, 2023
Python

chz816 / gtranscribe

Generate transcript for Google ASR json files

asr

Updated May 7, 2020
Python

ArenAcikgoz / Whisper-Alignment

Forced alignment decoder for Whisper.

speech-recognition whisper asr forced-alignment

Updated Mar 13, 2024
Python

jp1924 / transformer-transducer

Huggingface로 구현한 Transformer-Transducer

asr steraming

Updated Jul 27, 2024
Python

kaminoer / ScrAIbe-Assistant

ScrAIbe Assistant is designed to leverage Whisper for precise audio processing and local LLMs via Ollama for efficient summarization. This tool is perfect for tasks such as taking notes from team meetings or lectures, offering a secure environment where no data—be it text, audio, or otherwise—leaves your local machine.

python productivity ai self-hosted audio-recorder transcription summarizer asr llm llms whisper-ai localllm ollama llama3

Updated Apr 21, 2024
Python

soroush-zendedel / persian-asr

This project involves building a gradio website that accepts user audio input. It then transcribes the audio into Persian text and analyzes the speech to label its sentiment as positive or negative.

python machine-learning sentiment-analysis vad gradio persian-nlp asr silero-vad openai-whisper

Updated Oct 24, 2025
Python

mocomoco-inc / mocovoice-mcp-server

mocoVoice MCP Server

ai mcp voice-recognition automatic-speech-recognition asr mcp-server

Updated Jun 23, 2025
Python

tristan-mcinnis / rt-transcription-system

Real-time audio transcription monitoring system with AI-powered note generation using DeepSeek API

python shell api notes openai transcription mac-os whisper asr meeting-notes mlx qualitative-research ai-notes llm deepseek mlx-whisper realtime-transcription

Updated Aug 21, 2025
Python

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."