Soniq Music DL - AI Karaoke Creator

🎵 AI-powered karaoke video creator using Docker Spleeter for ML-based vocal separation and OpenAI Whisper for transcription.

🚀 NEW: Daily Automated Karaoke Generation

Automatically download trending music videos and convert them to karaoke format every day!

# Quick start - Set up daily automation
./setup_daily_karaoke.sh

# Or run manually now
python3 daily_karaoke_generator.py

# Test your setup
./test_setup.sh

📚 Complete Daily Karaoke Guide →

What it does

✅ Downloads trending music videos daily from YouTube
✅ Separates vocals from instrumentals using Docker Spleeter
✅ Creates karaoke videos with instrumental-only tracks
✅ Organizes output in timestamped folders
✅ Generates comprehensive reports and logs
✅ Automatic cleanup of old runs

Perfect for: Daily karaoke content creation, music libraries, entertainment venues

Features

🤖 Docker Spleeter integration - True ML-based vocal/instrumental separation
🗣️ OpenAI Whisper transcription - Accurate speech-to-text with word-level timing
📝 Synchronized subtitles - Word highlighting with professional typography
🎚️ Multiple vocal levels - 0%, 5%, 10%, 15%, 25%, 50%, 75%
🌐 Bilingual support - Original language + transliteration
☁️ Cloud deployment - Google Cloud Run ready
⏰ Daily automation - Scheduled runs via cron or systemd

Local Usage

Prerequisites

Python 3.9+
Docker (for Spleeter audio separation)
FFmpeg
yt-dlp (for video downloads)
OpenAI API key (optional, for Whisper transcription)

Quick Setup

# 1. Install dependencies
pip install -r requirements.txt
pip install yt-dlp

# 2. Pull Spleeter Docker image
docker pull researchdeezer/spleeter:3.8-2stems

# 3. Test setup
./test_setup.sh

# 4. Set up daily automation (optional)
./setup_daily_karaoke.sh

Environment Variables

Create a .env file:

# Optional: For Whisper transcription
export OPENAI_API_KEY="your-openai-api-key"

# Optional: For cloud storage
export BUCKET_NAME="your-gcs-bucket"

# Optional: Custom port for web service
export PORT=8080

Daily Karaoke Generation

# Run daily karaoke generation manually
python3 daily_karaoke_generator.py

# Or use the wrapper script
./run_karaoke_now.sh

Output will be saved to: karaoke_daily_runs/run_YYYYMMDD_HHMMSS/

Docker Compose Usage

# Start processing service
docker-compose up processing

# Run in background
docker-compose up -d processing

# View logs
docker-compose logs -f processing

# Test Spleeter directly
mkdir -p input output
docker-compose --profile tools run spleeter separate -i /input/video.mp4 -o /output -p spleeter:2stems

Create Karaoke Videos (Manual)

# Multiple vocal levels (0%, 5%, 10%, 15%, 25%, 50%, 75%)
python create_multi_vocal_karaoke.py

# Low vocal levels (5%, 10%, 15%)
python create_low_vocal_karaoke.py

# Download & process YouTube videos
python download_and_create_karaoke.py

Cloud Deployment

Deploy to Google Cloud Run

Set up Google Cloud Project

gcloud config set project YOUR_PROJECT_ID
gcloud services enable run.googleapis.com cloudbuild.googleapis.com storage.googleapis.com

Create Storage Bucket

gsutil mb gs://soniq-karaoke-videos

Deploy with Cloud Build

gcloud builds submit --config=cloudbuild.yaml

API Usage

Health Check

curl https://soniq-karaoke-HASH-uc.a.run.app/health

Process Video

curl -X POST https://soniq-karaoke-HASH-uc.a.run.app/process \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://www.youtube.com/watch?v=VIDEO_ID",
    "vocal_levels": [0.0, 0.25, 0.5]
  }'

Response:

{
  "job_id": "uuid",
  "title": "Video Title",
  "videos": [
    {
      "vocal_level": 0,
      "url": "https://storage.googleapis.com/bucket/file.mp4",
      "filename": "karaoke_0_vocal.mp4"
    }
  ]
}

Architecture

YouTube URL → yt-dlp → Docker Spleeter → OpenAI Whisper → FFmpeg → Cloud Storage
                ↓              ↓             ↓           ↓
            Video File    Vocal/Instrumental  Subtitles   Karaoke Video

Video Pipeline

Download - Extract video from YouTube URL
Separate - Use Docker Spleeter for ML-based audio separation
Transcribe - OpenAI Whisper for word-level timestamps
Subtitle - Create synchronized ASS subtitles with highlighting
Mix - Combine vocals/instrumentals at specified levels
Render - Generate final karaoke video with FFmpeg
Upload - Store in Google Cloud Storage

Technologies

Python Spleeter - ML audio separation (replaced Docker-in-Docker)
OpenAI Whisper - Speech transcription
yt-dlp - YouTube video downloading
FFmpeg - Video/audio processing
Flask - Web API framework
Google Cloud Run - Serverless deployment
Google Cloud Storage - Video storage

GitHub Auto-Deploy

✅ Trigger "soniqpush" active - pushes to main branch automatically deploy to Cloud Run!

Examples

Created karaoke videos with various vocal levels:

punjaban_karaoke_0_vocal.mp4 - Pure instrumental
punjaban_karaoke_25_vocal.mp4 - Light vocal guide
punjaban_karaoke_50_vocal.mp4 - Balanced mix
punjaban_karaoke_75_vocal.mp4 - Strong vocal guide

Perfect for different karaoke preferences and skill levels!

License

MIT License - See LICENSE file for details.# GitHub Auto-Deployment Test Fri 22 Aug 2025 15:28:44 EDT

GitHub trigger test - Fri 22 Aug 2025 15:38:10 EDT

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
20% Vocals		20% Vocals
Pure Instrumentals		Pure Instrumentals
__pycache__		__pycache__
daily_downloads		daily_downloads
demo_pipeline		demo_pipeline
instrumentals		instrumentals
logs		logs
manual_video_downloads		manual_video_downloads
publish		publish
remotion-karaoke		remotion-karaoke
simple_video_test/run_20250825_144311		simple_video_test/run_20250825_144311
test_logs		test_logs
test_timestamped_downloads		test_timestamped_downloads
today_downloads		today_downloads
today_video_downloads		today_video_downloads
trending_instrumentals/run_20250825_155303		trending_instrumentals/run_20250825_155303
trending_music		trending_music
uploaded_videos		uploaded_videos
working_video_downloads		working_video_downloads
.DS_Store		.DS_Store
.gcloudignore		.gcloudignore
.gitattributes		.gitattributes
CLAUDE.md		CLAUDE.md
CURRENT_WORKFLOW.md		CURRENT_WORKFLOW.md
DAILY_KARAOKE_GUIDE.md		DAILY_KARAOKE_GUIDE.md
DAILY_MUSIC_DOWNLOADER.md		DAILY_MUSIC_DOWNLOADER.md
DEPLOYMENT.md		DEPLOYMENT.md
DEPLOY_MICROSERVICES.md		DEPLOY_MICROSERVICES.md
Dockerfile		Dockerfile
Dockerfile.download		Dockerfile.download
Dockerfile.processing		Dockerfile.processing
GITHUB_TRIGGER_SETUP.md		GITHUB_TRIGGER_SETUP.md
README.md		README.md
README_SPLIT_SERVICES.md		README_SPLIT_SERVICES.md
SYSTEM_OVERVIEW.md		SYSTEM_OVERVIEW.md
TIMESTAMPED_DOWNLOAD_SUMMARY.md		TIMESTAMPED_DOWNLOAD_SUMMARY.md
TRIGGER_SETUP_GUIDE.md		TRIGGER_SETUP_GUIDE.md
UI_TRIGGER_SETUP.md		UI_TRIGGER_SETUP.md
VIDEO_MODAL_SYSTEM.md		VIDEO_MODAL_SYSTEM.md
app.py		app.py
batch_process_trending.py		batch_process_trending.py
billie_birds_direct.mp4		billie_birds_direct.mp4
browse_downloads.py		browse_downloads.py
cloudbuild-download.yaml		cloudbuild-download.yaml
cloudbuild-processing.yaml		cloudbuild-processing.yaml
cloudbuild.yaml		cloudbuild.yaml
create_bilingual_karaoke.py		create_bilingual_karaoke.py
create_billie_birds_karaoke.py		create_billie_birds_karaoke.py
create_billie_birds_modal.py		create_billie_birds_modal.py
create_black_screen_video.py		create_black_screen_video.py
create_instrumentals.py		create_instrumentals.py
create_low_vocal_karaoke.py		create_low_vocal_karaoke.py
create_multi_vocal_karaoke.py		create_multi_vocal_karaoke.py
create_trigger_curl.sh		create_trigger_curl.sh
create_vocal_versions.py		create_vocal_versions.py
create_webhook_trigger.sh		create_webhook_trigger.sh
daily_downloader_config.json		daily_downloader_config.json
daily_downloader_modal_integration.py		daily_downloader_modal_integration.py
daily_karaoke_generator.py		daily_karaoke_generator.py
daily_music_downloader.py		daily_music_downloader.py
demo_full_video_pipeline.py		demo_full_video_pipeline.py
demo_timestamped_structure.py		demo_timestamped_structure.py
deploy.sh		deploy.sh
docker-compose.yml		docker-compose.yml
download_and_create_karaoke.py		download_and_create_karaoke.py
download_service.py		download_service.py
download_trending_music.py		download_trending_music.py
download_with_proxy.py		download_with_proxy.py
github-trigger-setup.sh		github-trigger-setup.sh
github_trigger_guide.md		github_trigger_guide.md
glorilla_tgif_instrumental.mp3		glorilla_tgif_instrumental.mp3
modal_audio_storage.py		modal_audio_storage.py
modal_base_test.py		modal_base_test.py
modal_debug.py		modal_debug.py
modal_debug_test.py		modal_debug_test.py
modal_final_test.py		modal_final_test.py
modal_gradual_test.py		modal_gradual_test.py
modal_simple_spleeter.py		modal_simple_spleeter.py
modal_simple_test.py		modal_simple_test.py
modal_spleeter_final.py		modal_spleeter_final.py
modal_spleeter_python.py		modal_spleeter_python.py
modal_spleeter_working.py		modal_spleeter_working.py
modal_working_test.py		modal_working_test.py
modal_youtube_download.py		modal_youtube_download.py
modal_youtube_js.py		modal_youtube_js.py
modal_youtube_quick.py		modal_youtube_quick.py
modal_youtube_scrapingbee.py		modal_youtube_scrapingbee.py
modal_youtube_spleeter.py		modal_youtube_spleeter.py
processing_service.py		processing_service.py
punjaban_docker_spleeter.mp4		punjaban_docker_spleeter.mp4
punjaban_karaoke_0_vocal.mp4		punjaban_karaoke_0_vocal.mp4
punjaban_karaoke_10_vocal.mp4		punjaban_karaoke_10_vocal.mp4
punjaban_karaoke_15_vocal.mp4		punjaban_karaoke_15_vocal.mp4
punjaban_karaoke_25_vocal.mp4		punjaban_karaoke_25_vocal.mp4
punjaban_karaoke_50_vocal.mp4		punjaban_karaoke_50_vocal.mp4
punjaban_karaoke_5_vocal.mp4		punjaban_karaoke_5_vocal.mp4
punjaban_karaoke_75_vocal.mp4		punjaban_karaoke_75_vocal.mp4
punjaban_pure_instrumental.mp4		punjaban_pure_instrumental.mp4
requirements.txt		requirements.txt
run_processing_docker.py		run_processing_docker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soniq Music DL - AI Karaoke Creator

🚀 NEW: Daily Automated Karaoke Generation

What it does

Features

Local Usage

Prerequisites

Quick Setup

Environment Variables

Daily Karaoke Generation

Docker Compose Usage

Create Karaoke Videos (Manual)

Cloud Deployment

Deploy to Google Cloud Run

API Usage

Architecture

Video Pipeline

Technologies

GitHub Auto-Deploy

Examples

License

GitHub trigger test - Fri 22 Aug 2025 15:38:10 EDT

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Soniq Music DL - AI Karaoke Creator

🚀 NEW: Daily Automated Karaoke Generation

What it does

Features

Local Usage

Prerequisites

Quick Setup

Environment Variables

Daily Karaoke Generation

Docker Compose Usage

Create Karaoke Videos (Manual)

Cloud Deployment

Deploy to Google Cloud Run

API Usage

Architecture

Video Pipeline

Technologies

GitHub Auto-Deploy

Examples

License

GitHub trigger test - Fri 22 Aug 2025 15:38:10 EDT

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages