Skip to content
#

speech-recognition

Here are 193 public repositories matching this topic...

Al MOM is an Al-powered meeting intelligence platform that delivers real-time transcription, speaker recognition, and multi-LLM summaries using FastAPI, Whisper, Groq, and OpenRouter for intelligent meeting insights.

  • Updated Nov 5, 2025
  • HTML

Jarvis AI Assistant is a comprehensive desktop application that transforms your computer into an intelligent, voice-controlled environment. Built with Python and modern web technologies, it provides hands-free access to system functions, health monitoring, information retrieval, and entertainment via natural voice commands and biometric security.

  • Updated Oct 1, 2025
  • HTML

Pandore offers a set of tools that facilitate the most common corpus processing tasks for digital humanities research. Automatic pipelines for a set of tasks are also available

  • Updated Sep 30, 2025
  • HTML

Medibot is a voice-enabled medical AI assistant using RAG for accurate healthcare conversations. Evolved from my text-based chatbot, it now understands spoken questions and responds with voice answers, making medical guidance more accessible through intuitive multimodal interaction.

  • Updated Sep 27, 2025
  • HTML

This project focuses on real-time Speech Emotion Recognition (SER) using the "ravdess-emotional-speech-audio" dataset. Leveraging essential libraries and Long Short-Term Memory (LSTM) networks, it processes diverse emotional states expressed in 1440 audio files. Professional actors ensure controlled representation, with 24 actors contributing.

  • Updated Sep 5, 2025
  • HTML

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more