#

speech-recognition

Here are 193 public repositories matching this topic...

ganeshguntaka5 / Veronica_Chatbot

🤖 AI Chatbot with Voice Interface - A Flask web app featuring Groq-powered chat, voice input/output, and theme support. Combines natural language processing with speech synthesis for an interactive chat experience. #Python #Flask #AI #VoiceInterface

python nlp api flask text-to-speech ai chatbot speech-recognition voice-interface groq

Updated Nov 10, 2025
HTML

duj12 / ASR-2Pass

ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

websocket speech-recognition inverse-text-normalization voice-activity-detection onnxruntime punctuation-restoration streaming-speech-to-text

Updated Nov 10, 2025
HTML

Sam67xsaad / WWW-5

🎉 Kickstart your Web3 journey by showcasing your project from the Women Web3 Wave #5 Demoday. Join us to drive change and innovation together.

php google csharp neural-network compiler simulation speech deprecated recurrent-neural-networks speech-recognition redirect ensemble-learning rbm convolutional-neural-networks dotnet-framework audio-recognition rqc paper-code

Updated Nov 10, 2025
HTML

ToriViktoria / notiq-public

Private voice memos with offline speech recognition, real-time transcription, and AES‑256 encryption.

notes ios-app speech-recognition audio-recorder speech-to-text transcription voice-memos transcribe ocr-recognition note-taker secure-notes offline-ai voice-n

Updated Nov 6, 2025
HTML

Rayyan9477 / speech_emotion_classification

This project implements a speech emotion classification system using neural networks and genetic algorithms for optimization. The system classifies emotions such as calm, happy, sad, angry, fearful, surprise, and disgust from speech audio using the RAVDESS dataset.

python machine-learning neural-network speech-recognition emotion-detection speech-classification

Updated Nov 6, 2025
HTML

Baisampayan1324 / AI-MOM

Al MOM is an Al-powered meeting intelligence platform that delivers real-time transcription, speaker recognition, and multi-LLM summaries using FastAPI, Whisper, Groq, and OpenRouter for intelligent meeting insights.

python chrome-extension websocket speech-recognition meeting-minutes fastapi groq ai-tools openai-whisper ai-transcription openrouter ai-productivity realtime-transcription meeting-automation ai-meeting-assistant ai-summary iim-integration

Updated Nov 5, 2025
HTML

HearMeTech / landing-page

HearMe website – an AI-based mobile app for personalized atypical speech recognition.

machine-learning accessibility mobile-app speech-recognition startup assistive-technology healthtech speech-impairment hearmetech

Updated Nov 1, 2025
HTML

romanyn36 / whisperx-asr-with-fastapi

WhisperX ASR is a FastAPI-based application for automatic speech recognition. It transcribes audio files to text using WhisperX, supports multiple languages, batch processing, and offers both a web UI and REST API.

python cuda transformers torch speech-recognition openai whisper asr fastapi ctranslate2 whisperx

Updated Oct 27, 2025
HTML

RapDoodle / Web-Real-Time-Speech-Recognition-with-Azure

An example project that provides a web interface to real-time speech-to-text service on a browser with Azure real-time speech-to-text service and Socket.IO.

azure speech-recognition azure-speech-service

Updated Oct 8, 2025
HTML

RohitMukkala / jarvis

Jarvis AI Assistant is a comprehensive desktop application that transforms your computer into an intelligent, voice-controlled environment. Built with Python and modern web technologies, it provides hands-free access to system functions, health monitoring, information retrieval, and entertainment via natural voice commands and biometric security.

python nlp opencv machine-learning text-to-speech computer-vision deep-learning tensorflow scikit-learn chatbot speech-recognition face-recognition desktop-automation voice-assistant system-automation heart-disease-prediction healthcare-ai ai-assistant biometric-security

Updated Oct 1, 2025
HTML

obtic-sorbonne / Toolbox-site

Pandore offers a set of tools that facilitate the most common corpus processing tasks for digital humanities research. Automatic pipelines for a set of tasks are also available

visualization natural-language-processing analysis text-generation toolbox information-extraction speech-recognition text-recognition exploration digital-humanities preprocessing text-correction automatic-annotation processing-pipeline format-conversion corpus-collection

Updated Sep 30, 2025
HTML

hubashaikh2005 / Medical-Chatbot-using-RAG

Medibot is a voice-enabled medical AI assistant using RAG for accurate healthcare conversations. Evolved from my text-based chatbot, it now understands spoken questions and responds with voice answers, making medical guidance more accessible through intuitive multimodal interaction.

python flask natural-language-processing chatbot speech-recognition pinecone multimodal huggingface healthcare-ai large-language-models langchain retrieval-augmented-generation

Updated Sep 27, 2025
HTML

janaalbader28 / Mueen

Accessible app with voice assistance to manage official services, support requests, and communication for users with disabilities.

website accessibility speech-recognition speech-to-text hci voice-assistant disability-support

Updated Sep 24, 2025
HTML

ranjan-builds / Text-to-speech

A web application for real-time voice transcription and speech-to-text conversion. Supports multiple languages and includes features like audio visualization, text-to-speech, word count, and easy export options.

speech-recognition

Updated Sep 11, 2025
HTML

Emotion-Aware-AI-Support-System

saky-semicolon / Emotion-Aware-AI-Support-System

A smart AI-powered platform that detects emotions from student voice input, classifies their intensity, prioritizes critical cases, and responds via an intelligent chatbot.

github ai chatbot speech-recognition learn collaborate fortheloveofcode

Updated Sep 9, 2025
HTML

speechnotes-website

speechnotes / speechnotes-website

New (2023) Doks (hugo + npm) based website for speechnotes.co

npm website hugo-site speech-recognition speech-to-text doks-template speechnotes

Updated Nov 9, 2025
HTML

Kiran8053 / Speech-Emotion-Recognition

This project focuses on real-time Speech Emotion Recognition (SER) using the "ravdess-emotional-speech-audio" dataset. Leveraging essential libraries and Long Short-Term Memory (LSTM) networks, it processes diverse emotional states expressed in 1440 audio files. Professional actors ensure controlled representation, with 24 actors contributing.

deep-learning speech-recognition lstm-neural-networks emotion-recognition mfcc-analysis streamlit ravdess-dataset

Updated Sep 5, 2025
HTML

iamsaksham-21 / Wall-E_voice

WALL-E is a Python-based AI voice assistant that listens to commands and performs tasks like searching Wikipedia, opening websites, playing songs, telling jokes, and more. It uses speech recognition and text-to-speech to create a smooth, hands-free experience.

python api speech-recognition whisper voice-assistant

Updated Aug 30, 2025
HTML

jkbarrerab / french-pronunciation-agent

A simple web application to help users practice French pronunciation: record your voice, compare it against a reference phrase, get quick feedback, and iterate.

python flask web-app speech-recognition audio-processing educational-project

Updated Aug 29, 2025
HTML

Kurama-90 / AI-voice-0.1

🎙️ Speech recognition + natural voice synthesis + AI 🖥️ Animated interface, modular, and future-ready!

javascript html ai chatbot voice speech-recognition voice-assistant webspeech-api animated-website

Updated Aug 6, 2025
HTML

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."