stt

A futuristic personal portfolio featuring a voice-activated AI assistant powered by Google Gemini. Built with React and Three.js, it offers an immersive cyberpunk experience with real-time voice interaction, dynamic 3D visualizations, and context-aware responses about skills and projects. Users can naturally converse with the AI to explore.

react javascript chatbot tts gemini stt conversational-ai prompt-engineering genrative-ai antigravity

Updated Nov 29, 2025
JavaScript

linto-ai / linto-studio

Star

Transcription and annotation interface for recorded audio or video files

subtitles caption subtitle stt asr captioning-videos audio-transcription video-transcription transcription-edition virtual-scribe

Updated Dec 17, 2025
JavaScript

Anishrkhadka / AetherChat

Star

Lightweight local voice-chat API using VOSK STT, Ollama LLM, and Kokoro TTS. Includes FastAPI backend, web UI, and optional Caddy HTTPS.

python docker text-to-speech tts caddy voice-chat speech-to-text stt realtime-audio fastapi vosk llm local-ai ollama-api

Updated Nov 24, 2025
JavaScript

jleboube / Diji-Scribe

Star

Audio/Video file transcription to text file

speech-to-text stt speech-to-text-transcription

Updated Nov 19, 2025
JavaScript

lalomorales22 / ai-visualization-voice-assistant

Star

AI voice assistant with visualization aspect

tts openai stt whisper groq

Updated Nov 18, 2025
JavaScript

0Shark / live-interview

Star

Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Based on the GPT 4 model and Azure Speech to Text and Text to Speech, it is able to make realistic conversations about a job interview and save conversations.

threejs ai chatbot tts stt job-interview 3d-avatar gpt4

Updated Nov 16, 2025
JavaScript

Gyyyn / OpenWebTTS

Star

Open source Speechify alternative. Read PDFs and EPUBs with local models.

python ai self-hosted pytorch tts webapp stt whisper privacy-focused coqui-tts piper-tts kokoro-tts kitten-tts

Updated Nov 14, 2025
JavaScript

Hrishhii / AgroBot

Star

AI-powered farming assistant built with React + Flask that helps farmers through multilingual chat, voice, and image-based crop disease detection using Google Gemini and Groq APIs.

react flask tts stt gemini-api multimodal ai-assistant

Updated Nov 12, 2025
JavaScript

mahmud-r-farhan / voice-llm-assistant

Star

Talk to an LLM, using your voice and hear it talk back!

chat tts stt voice-assistant ai-agent llm

Updated Nov 4, 2025
JavaScript

Flux690 / Elora-AI-Receptionist

Star

Elora is a voice-based virtual receptionist for salons powered by the LiveKit Agents Framework. It enables natural conversations with clients, retrieves precise answers from a knowledge base using RAG-based retrieval, and escalates unknown queries to human supervisors — all while continuously learning from resolved interactions.

Updated Nov 2, 2025
JavaScript

pmbstyle / EchoTap

Star

Transcribe anything localy, fast and safe.

electron python speech-to-text transcription stt voice-activity-detection vue3 whisper-cpp

Updated Oct 17, 2025
JavaScript

FS-17 / SpeechDataBuilder

Star

Browser-based open-source tool for creating high-quality TTS/STT datasets. Features AI transcription, multiple export formats, and audio visualization with no server-side processing. Perfect for ML engineers, developers, and researchers.

tts stt dataset-maker