A ready-to-use, minimal app that converts any speech into text.
-
Updated
Jul 5, 2024 - JavaScript
A ready-to-use, minimal app that converts any speech into text.
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
在线前端频率分析扒谱 front-end music transcription
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
OpenAI/ChatGPT library for Java - Requires JDK 11 at minimum.
This contains a practical guide for non-technical users on how to use OpenAI's Whisper for transcription and translation
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI Whisper. CPU-only, no GPU required, privacy-focused with local processing.
explore AMT from the perspective of timbre
Automatically generate accurate, per-word video captions with timestamps using Whisper ASR and FFmpeg, perfect for YouTube, social media, and accessibility.
Offline, privacy-first screen recorder with local AI transcription and smart summaries. Built with Electron, React, and TypeScript—capture desktop video, auto-generate transcripts, and get instant AI-powered meeting and lesson insights, all cross-platform and fully customizable.
🚀📜 Customized For Agentic AI: Enhanced the Whisper Assistant extension with improved setup scripts and documentation, ensuring seamless integration and functionality on Linux platforms.
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
a python script that can auto generate subtitle in YouTube Videos
Lightning-fast audio transcription (6x speed) with batch processing, Obsidian integration, and optimized real-time performance. Powered by faster-whisper and Distil-Whisper models.
Flick is a powerful AI-driven SaaS platform for real-time video sharing and collaboration, crafted for both web and desktop environments. Designed for seamless video recording, streaming, and sharing without third-party dependencies, Flick offers teams and individuals an integrated workspace to create, manage, and share video content in real-time.
🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language support, batch processing, GPU acceleration. Generate SRT/WebVTT subtitles instantly!
One-command audio transcription from any video platform Transform video URLs into text transcripts instantly with automatic audio download, AI transcription, and clipboard integration. Perfect for content creators, researchers, students, and anyone who needs quick video-to-text conversion.
An AI-powered voice notes app using React Native & Expo. Seamlessly record, transcribe, and organize thoughts with intelligent summarization and tagging. The ultimate mobile productivity companion.
Instagram content analysis, data insights
Open Video Transcribe - Open-source video transcription tool that emphasizes the primary use case: transcribing video files to text with support for multiple model types.
Add a description, image, and links to the ai-transcription topic page so that developers can more easily learn about it.
To associate your repository with the ai-transcription topic, visit your repo's landing page and select "manage topics."