Audio/Video file transcription to text file
-
Updated
Nov 19, 2025 - JavaScript
Audio/Video file transcription to text file
🤖 AI Babel Fish 🐠 | 🎤 Voice-in → 🧠 LLM Translation → 🔊 Voice-out | Built with Python, Flask, & IBM Watson.
Practice IELTS, TOEFL, & PTE speaking online. This web app offers full test simulations and real-time performance analysis.
The TTS Engine is a sophisticated web-based platform designed to transform written text into speech that sounds remarkably natural.
[NVIDIA ONLY] A minimal Gradio interface for Automatic Speech Recognition. Transcribe Audio in Malayalam language. (Requirements 6GB VRAM / 16GB RAM)
VoAF : nodejs voice assistant framework using google text to speech and speech to text.
A futuristic personal portfolio featuring a voice-activated AI assistant powered by Google Gemini. Built with React and Three.js, it offers an immersive cyberpunk experience with real-time voice interaction, dynamic 3D visualizations, and context-aware responses about skills and projects. Users can naturally converse with the AI to explore.
Transcribe anything localy, fast and safe.
A TTS and STT tool for students to check their reading of specified text (pronunciation).
Super simple Voice based translation system with probablly the fastest speeds available
Beach Wreck Ignition: Challenges in opensource voice, linux.conf.au 2019 Christchurch, New Zealand #lca2019
A React-based chatbot integrating visual AI and voice technologies (STT/TTS) for an optimized user experience.
Elora is a voice-based virtual receptionist for salons powered by the LiveKit Agents Framework. It enables natural conversations with clients, retrieves precise answers from a knowledge base using RAG-based retrieval, and escalates unknown queries to human supervisors — all while continuously learning from resolved interactions.
Brina é um bot de acessibilidade que oferece funções de transcrição de texto.
Sara is a prompt that: listens for commands (keyboard or voice recognition), executes a built in command or a plugin based on regular expression string matching, then uses text-to-speech give the answer. Now with vision support through USB webcam (WIP).
Add a description, image, and links to the stt topic page so that developers can more easily learn about it.
To associate your repository with the stt topic, visit your repo's landing page and select "manage topics."