asr

Here are 1,442 public repositories matching this topic...

lizunowa / project-asr-metrics

🧑🏻‍🎓 📑 October'20 - April'21. Group uni project. The project topic is Speech-to-Text Assessment Tool. It is a research-type project, most of the documentation is in a private GitLab repository.

asr asr-benchmark

Updated Jun 8, 2021
Jupyter Notebook

BScUniversityCollaborations / automatic-speech-recognition

Star

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

Forced-Alignment-and-Vowel-Extraction / fave-asr

Star

Interface for automated transcription and time alignment of conversational interview data

linguistics asr sociolinguistics

Updated Apr 22, 2024
Python

kingabzpro / hindiSpeechPro-Automatic-Speech-Recognization

Star

The project,being part of Kagglex BIPOC Mentorship Program final project, aims to train two separate Hindi ASR models using the Facebook Wav2Vec2 (300M parameters) and OpenAI Whisper-Small models, respectively. The goal is to compare their performance, with a target WER of less than 13%, across various Hindi accents and dialects.

transformer speech-recognition whisper asr hindi-language wav2vec2

Updated Nov 18, 2023
Jupyter Notebook

TIAGo-WE-COBOT / whisper_streaming

Star

Whisper realtime streaming for long speech-to-text transcription and translation.

hri asr ros-noetic

Updated Mar 4, 2025
Python

alekseevskaia / audio_attack

Star

asr adversarial-attacks carlini-wagner

Updated Jan 29, 2024
Jupyter Notebook

marks038 / Test

Star

Test Repo

test test1 calculators asr

Updated Feb 23, 2024

Nexdata-AI / 162-Hours-French-Children-Spontaneous-Speech-Data

Star

French Child's Spontaneous Speech Data

audio machine-translation speech-recognition asr children-speech

Updated Aug 8, 2024

Nexdata-AI / 87166-Minnan-Dialect-Pronunciation-Dictionary

Star

Dialect-Pronunciation-Dictionary

text lexicon speech-to-text pronunciation-dictionary asr

Updated Aug 8, 2024

koudounasalkis / CSI-MIT

Star

This repo contains the code for "Privacy Preserving Data Selection for Bias Mitigation in Speech Models"

privacy speech-recognition asr data-selection spoken-language-understanding bias-mitigation

Updated May 22, 2025
Jupyter Notebook

udit-rawat / whisper-space

Star

An ASR Gradio GUI based project that transcript the audion and provides NLP based analysis.

nlp spacy gradio whisper asr

Updated Jul 27, 2024
Python

Nexdata-AI / 557-Hours-Kazakh-Spontaneous-Speech-Data

Star

557-Hours-Kazakh-Spontaneous-Speech-Data

speech-recognition speech-to-text asr kazakh spontaneous-speech-recognition

Updated Aug 8, 2024

anilkeshwani / speech-text-alignment

Star

Functionality for speech data processing including time alignment, encoding with speech encoders (tokenizers) and data preprocessing of common datasets

speech speech-recognition data-pipeline asr hubert uroman

Updated Sep 19, 2025
Python

FlosMume / AI-Research-Assistant-Starter

Star

Prototype of an intelligent research agent capable of literature retrieval, summarization, and contextual reasoning — a foundation for scientific automation tools.

python text-to-speech asr document-summarization rag fastapi ai-agent llm retrieval-augmented-generation knowledge-retrieval multimodal-ai research-assistan

Updated Oct 20, 2025
Python

Nexdata-AI / 500-Hours-Korean-Conversational-Speech-Data-by-Mobile-Phone

Star

The dataset of Korean conversational speech

audio machine-learning text-to-speech deep-learning dataset wav speech-recognition automatic-speech-recognition speech-to-text speech-processing asr asr-model

Updated Aug 8, 2024

Kaljurand / ai-ai-ai

Star

Text and speech AI playground. The first version was vibe-coded with OpenAI Codex in the browser in ~10 days of 2 h/day sessions.

text tts estonian asr llm

Updated Aug 7, 2025
JavaScript

HeyHera / Hera

Star

This project presents Hera, an Operating System level voice recognition package that understands voice commands to perform actions to simplify the user’s workflow. We propose a modernistic way of interacting with Linux systems, where the latency of conventional physical inputs are minimized through the use of natural language speech recognition.

python scikit-learn nlu spacy kivy tts asr wake-word-detection sgd-classifier vosk nix-tts

Updated Jul 12, 2022
Python

Nexdata-AI / 203-Hours-Korean-Medical-Entities-Real-world-Casual-Conversation-and-Monologue-speech-dataset

Star

speech-recognition speech-to-text asr

Updated Jan 22, 2025

Nexdata-AI / 105-Hours-Italian-Gaming-Real-world-Casual-Conversation-and-Monologue-speech-dataset

Star

speech-recognition speech-to-text speech-processing asr

Updated Jan 22, 2025

Ailurus1 / ASR-bot

Star

ASR telegram assistant for voice/video messages transcribing

telegram-bot transformers speech-recognition speech-to-text whisper asr

Updated Dec 19, 2024
Python

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

Here are 1,442 public repositories matching this topic...

lizunowa / project-asr-metrics

BScUniversityCollaborations / automatic-speech-recognition

Forced-Alignment-and-Vowel-Extraction / fave-asr

kingabzpro / hindiSpeechPro-Automatic-Speech-Recognization

TIAGo-WE-COBOT / whisper_streaming

alekseevskaia / audio_attack

marks038 / Test

Nexdata-AI / 162-Hours-French-Children-Spontaneous-Speech-Data

Nexdata-AI / 87166-Minnan-Dialect-Pronunciation-Dictionary

koudounasalkis / CSI-MIT

udit-rawat / whisper-space

Nexdata-AI / 557-Hours-Kazakh-Spontaneous-Speech-Data

anilkeshwani / speech-text-alignment

FlosMume / AI-Research-Assistant-Starter

Nexdata-AI / 500-Hours-Korean-Conversational-Speech-Data-by-Mobile-Phone

Kaljurand / ai-ai-ai

HeyHera / Hera

Nexdata-AI / 203-Hours-Korean-Medical-Entities-Real-world-Casual-Conversation-and-Monologue-speech-dataset

Nexdata-AI / 105-Hours-Italian-Gaming-Real-world-Casual-Conversation-and-Monologue-speech-dataset

Ailurus1 / ASR-bot

Improve this page

Add this topic to your repo