#

speech-recognition

Here are 5,744 public repositories matching this topic...

omarsafti09 / sherpa-onnx

🎤 Enable seamless audio processing with "sherpa-onnx," supporting speech recognition, synthesis, and more across multiple platforms.

audio macos rust raspberry-pi ios csharp cpp dotnet embeddings speech-recognition mfc object-pascal supersocket onnx diarization semantic-kernel xiaozhi-esp32 xiaozhi-server

Updated Nov 11, 2025
C++

cubist38 / mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

flux queue speech-recognition image-generation whisper vision-api mlx fastapi apple-silicon structured-outputs mlx-lm mlx-vlm openai-compatible mlx-openai-server

Updated Nov 11, 2025
Python

isothermal-capitalgainstax520 / Whisper-Transcriber

🎤 Transcribe audio and video files into text or subtitles effortlessly on Google Colab using OpenAI Whisper, with no installation needed.

audio python text-to-speech ffmpeg text jupyter-notebook pytorch speech-recognition automatic-speech-recognition speech-to-text transcription voice-activity-detection deepl transcriber audio-transcription whisper-api audio-to-text openai-whisper

Updated Nov 11, 2025
Jupyter Notebook

Ghalwash123 / MiMo-Audio-Training

🔊 Train audio models efficiently with MiMo-Audio-Training, a toolkit designed for straightforward implementation and enhanced performance in audio processing tasks.

python open-source machine-learning deep-learning signal-processing modeling dataset feature-extraction speech-recognition neural-networks data-analysis performance-evaluation mimo audio-research audio-training

Updated Nov 11, 2025
Python

Siambm / videoTranslatorExtenstion

🎥 Translate and generate subtitles for any video in real-time, enhancing your viewing experience across multiple platforms with privacy-focused processing.

javascript multilingual chrome-extension translation video-processing speech-recognition browser-extension web-audio-api video-translation subtitle-generation real-time-subtitles

Updated Nov 11, 2025
JavaScript

AriakaHS / transcriber

🎙️ Record and transcribe audio effortlessly using AI technologies for clear and accurate text output with this simple web application.

macos swift ios mobile cross-platform nextjs youtube-api speech-recognition automatic-speech-recognition flutter restful-api wearable whisper omi personas smartglasses youtube-transcripts aitool

Updated Nov 11, 2025
TypeScript

yosef-Ctrl / transcribr

🎙️ Transcribe podcast episodes quickly by pasting links and API keys, generating human-readable transcripts with timestamps for easy editing.

kotlin postgresql google-cloud kotlin-android desktop-application speech-recognition gpt transcription asr voice-to-text vite cloudflare-workers jetpack-compose zustand cloudflare-kv material-you whisper-ai daisyui-v5

Updated Nov 11, 2025
TypeScript

falvarop / jarvis

🎤 Control your world with Jarvis, a voice-activated AI assistant that simplifies tasks and enhances productivity.

linux chat bot home-automation raspberry-pi webpack ai deep-learning messenger python-programming assistant voice-recognition speech-recognition openai personal-assistant virtual-assistant tauri jarvis-ai

Updated Nov 11, 2025
Python

Damijan123 / TatvaX-AI-PROTOTYPE

📚 Transform learning with TatvaX, an AI platform providing personalized education in 8 Indian languages, breaking down language barriers for millions.

edtech speech-recognition flask-application voice-assistant smart-education smart-learning educational-technology translation-api python-chatbot indian-languages ai-education voice-enabled nlp-project multilingual-learning ai-chatbot-project student-learning ai-tutor educational-chatbot

Updated Nov 11, 2025
Python

kyugakai / NeuraVoice

🗣️ Elevate your workflow with NeuraVoice, an AI desktop assistant that combines speech recognition and local LLM responses for seamless task automation.

desktop-app python nlp text-to-speech ai speech-recognition tkinter voice-assistant voice-bot llm

Updated Nov 11, 2025

SANTANC / speech-to-owl

🎤 Transform spoken phrases into OWL ontologies, making it easy to create structured data from voice. Ideal for developers and researchers alike.

python flask rdf owl ontology rdflib speech-recognition openai speech-to-text whisper audio-processing rdfxml voice-interface

Updated Nov 11, 2025
Python

itssharmaXD / numbers-le

🔢 Extract numbers swiftly from JSON, YAML, CSV, TOML, INI, and ENV files at 1.5M numbers per second, 100x faster than manual searching.

android python c productivity machine-learning typescript cpp functional-programming analysis linked-data linkedin speech-recognition speech-to-text interview-preparation hms dsa ml-kit neetcode150

Updated Nov 11, 2025
TypeScript

visu123s / MimicKit

🤖 Learn motion imitation with MimicKit, a framework offering advanced methods to train motion controllers using state-of-the-art algorithms and techniques.

open-source machine-learning deep-learning signal-processing python-library speech-recognition neural-networks user-interface data-augmentation audio-processing generative-models voice-synthesis sound-design mimic-kit real-time-synthesis

Updated Nov 11, 2025
Python

Sasuke810 / quran-researcher

📖 Explore the Quran with an AI-powered Next.js app, offering semantic search, tafsir integration, and enhanced study features for deeper understanding.

json machine-learning research speech-recognition islam txt theology religious-studies quran-info gpt3 gpt3-prompts feristha jibrail mikail israfil raqib kafir islamic-teaching

Updated Nov 11, 2025
TypeScript

awesome-german / ai-tools

AI-powered tools and chatbots that personalize German learning and automate feedback.

Updated Nov 11, 2025

TeacherShoxrux / echo-zxx

🌟 Simplify language translation and communication with echo-zxx, a powerful tool for seamless text conversion across multiple languages.

python open-source machine-learning natural-language-processing text-to-speech deep-learning multimedia echo voice-commands speech-recognition command-line-tool audio-processing real-time-processing voice-assistant zxx

Updated Nov 11, 2025

aboda-dirbas / whisperclip

🎤 Enhance your voice-to-text transcriptions with WhisperClip, prioritizing privacy and featuring AI improvements for macOS users.

python macos productivity clipboard privacy local speech-recognition openai speech-to-text whisper audio-processing productivity-tools voice-to-text macos-application audio-transcription whisper-ai llm-inference voice-to-text-transcription

Updated Nov 11, 2025
Swift

Sam67xsaad / WWW-5

🎉 Kickstart your Web3 journey by showcasing your project from the Women Web3 Wave #5 Demoday. Join us to drive change and innovation together.

php google csharp neural-network compiler simulation speech deprecated recurrent-neural-networks speech-recognition redirect ensemble-learning rbm convolutional-neural-networks dotnet-framework audio-recognition rqc paper-code

Updated Nov 11, 2025
HTML

KOLE87 / transformers_dart

🤖 Run state-of-the-art Machine Learning models in Dart with transformers_dart—cross-platform, serverless, and based on Hugging Face's transformers.

audio python nlp machine-learning deep-learning pytorch transformer speech-recognition glm pretrained-models gemma vlm pytorch-transformers model-hub llm qwen deepseek

Updated Nov 11, 2025
Dart

diyuyuyu / Deep-Learning

🤖 Explore deep learning architectures like ANN, CNN, RNN, and LSTM to enhance your understanding of machine learning and neural networks.

data-science deep-neural-networks translation neural-network transformers deep-reinforcement-learning coursera kaggle speech-recognition neural-networks rnn ensemble-learning segmentation bayesian recommender-systems gcn optimizers large-language-models

Updated Nov 11, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."