Skip to content

jonasclick/Speech-to-Text-Script

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

Speech to Text with OpenAI Whisper

This simple script converts spoken text (audio file) into written text using OpenAI's Whisper model.

Requirements

  1. OpenAI Plus Account: To use this script, you need an OpenAI Plus account, which costs $20 per month.
  2. Available Balance: Ensure you have a sufficient balance in your OpenAI Plus account. You can check your balance here.
    • Your OpenAI Plus account operates similarly to a phone plan with both a monthly subscription and a prepaid balance. Make sure you have funds in your prepaid balance.

How to Use

  1. API Key: Generate a personal API key for yourself here. Do not share this key with anyone.
  2. Download the Script: Download the script to your local machine.
  3. Install Dependencies: Make sure you have the following dependencies installed:
pip install python-dotenv
pip install --upgrade openai
  1. Prepare Audio File: Place the audio file you want to transcribe in the same folder as the script. Rename it to 'audio.wav'.
  2. Run the Script: Execute the script. On first execution you will be asked to provide your API Key. Your key will then be stored in an .env file locally in the same folder as the script. Your key will not be shared. Do not share your key.

You can monitor the usage of your balance here. This way you always keep an overview of the cost of your usage.

What Languages are Supported?

You can expect good results when transcribing the following languages:
Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.

Other languages might work, but the results may be of low quality.

You can read more in the OpenAI documentation here.

About

Python script to transcribe spoken text into written text using OpenAI Whisper.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages