Skip to content

Starmel/OpenSuperWhisper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenSuperWhisper

OpenSuperWhisper is a macOS application that provides real-time audio transcription using the Whisper model. It offers a seamless way to record and transcribe audio with customizable settings and keyboard shortcuts.

Free alternative to paid services like:

Installation

brew update # Optional
brew install opensuperwhisper

Or from github releases page.

Features

  • 🎙️ Real-time audio recording and transcription
  • ⌨️ Global keyboard shortcuts for quick recording (use cmd + ` )
  • 🌍 Support for multiple languages with auto-detection (not tested, but probably works)
  • 🔄 Optional translation to English (for better translation add initial prompt with english sentences)
  • 💾 Local storage of recordings with transcriptions
  • 🎛️ Advanced transcription settings (not tested)
  • 🇯🇵🇨🇳🇰🇷 Support for Asian languages with auto-correct

Requirements

  • macOS (Apple Silicon/ARM64)

Support

If you encounter any issues or have questions, please:

  1. Check the existing issues in the repository
  2. Create a new issue with detailed information about your problem
  3. Include system information and logs when reporting bugs

Building locally

To build locally, you'll need:

git clone git@github.com:Starmel/OpenSuperWhisper.git
cd OpenSuperWhisper
git submodule update --init --recursive
brew install cmake libomp rust ruby
gem install xcpretty
./run.sh build

In case of problems, consult .github/workflows/build.yml which is our CI workflow where the app gets built automatically on GitHub's CI.

Contributing

Contributions are welcome! Please feel free to submit pull requests or create issues for bugs and feature requests.

Contribution TODO list

  • Streaming transcription (#22)
  • Custom dictionary (#20)
  • Intel macOS compatibility (#16)
  • Agent mode (#14)
  • Background app (#9)
  • Support long-press single key audio recording (#19)

License

OpenSuperWhisper is licensed under the MIT License. See the LICENSE file for details.

Whisper Models

You can download Whisper model files (.bin) from the Whisper.cpp Hugging Face repository. Place the downloaded .bin files in the app's models directory. On first launch, the app will attempt to copy a default model automatically, but you can add more models manually.

About

macOS whisper dictation app

Topics

Resources

License

Stars

Watchers

Forks

Contributors 9

Languages