a streaming voice AI using OpenAI, Deepgram, and ElevenLabs
This project requires Python 3.9 and a few dependencies. Follow the steps below to set up your environment.
First, create a new Conda environment with Python 3.9. You can do this using the following command:
conda create --name goodlord python=3.9Activate the environment using:
conda activate goodlordNext, install the required dependencies. You can do this using the following command:
pip install -r requirements.txtYou may also need to install mpv a cross platform media player that we use to stream audio from the text to speech API. On mac os, you can do this with homebrew easily:
brew install mpvCreate a .env.secret file in the project root directory.
cp .env.example .env.secretAdd the following environment variables:
OPENAI_API_KEY=your_openai_api_key
ELEVENLABS_API_KEY=your_elevenlabs_api_key
DEEPGRAM_API_KEY=your_deepgram_api_key
ELEVEN_LABS_VOICE_ID=your_eleven_labs_voice_idReplace your_openai_api_key, your_elevenlabs_api_key, your_deepgram_api_key, and your_eleven_labs_voice_id with your actual API keys.
Finally, you can run the code using the following command:
python main.pyAfter running the code, you should see the output in your terminal. If there are any errors, they will also be displayed in the terminal.
To stop the program, use the following command:
CTRL+CThis will stop the execution of the program.
If you encounter any issues while running the program, please check the following:
- Ensure that all the dependencies are installed correctly.
- Verify that the API keys are correct.
- Make sure that the Python version is 3.9.
If the problem persists, please create an issue on Github!