a cli tool i made for myself.
one simple way to transcribe audio from terminal.
all in native python. no subscription. blazingly fast. free. enjoy.
- Supports Many Formats: Works with popular audio (MP3, WAV, M4A) and video (MP4, AVI, MOV) files.
- Automatic Audio Extraction: Extracts audio from video files on the fly using ffmpeg.
- Different Output Options: Save transcriptions in formats like TXT, JSON, SRT, VTT, and TSV.
- Python 3.x
- FFmpeg (for video file support)
pip install aa-whisperFFmpeg is required if you want to transcribe video files.
Linux:
sudo apt install ffmpegmacOS:
brew install ffmpegWindows (PowerShell):
winget install FFmpegSet your Groq API key as an environment variable.
Linux:
echo 'export GROQ_API_KEY="your_groq_api_key_here"' >> ~/.bashrc && source ~/.bashrcmacOS:
echo 'export GROQ_API_KEY="your_groq_api_key_here"' >> ~/.zshrc && source ~/.zshrcWindows:
[System.Environment]::SetEnvironmentVariable("GROQ_API_KEY", "your_groq_api_key_here", "User")whisper audio_or_video_file [options]--model: Whisper model to use (default: whisper-large-v3-turbo)--language: Language code (optional)--task: 'transcribe' or 'translate' (default: transcribe)--output-dir: Output directory (default: audio_filename_transcription)--response-format: 'verbose_json', 'json', 'text', 'srt', 'vtt' (default: verbose_json)--version: Show version info
Transcribe audio:
whisper recording.mp3Transcribe video:
whisper lecture.mp4Translate to English:
whisper interview.mp3 --task translateMultiple files at once:
whisper file1.mp3 file2.wav video1.mp4Each transcription will generate:
transcript.txt: Plain texttranscript.json: JSON metadatatranscript.srt: SubRip subtitlestranscript.vtt: WebVTT formattranscript.tsv: Timestamps and text
MIT