CAAL

Local voice assistant with n8n workflow integrations and Home Assistant control

Built on LiveKit Agents. Runs fully local with Ollama + Speaches + Kokoro, or GPU-free with Groq + Piper.

Features

First-Start Wizard - Configure everything from the browser, only one edit in .env required
Flexible Providers - Ollama (local) or Groq (cloud) for LLM/STT, Kokoro or Piper for TTS
Home Assistant - Native MCP integration with simplified hass_control and hass_get_state tools
n8n Workflows - Expandable LLM tool capability - any n8n workflow can become a tool for CAAL
Wake Word Detection - "Hey Cal" activation via OpenWakeWord (server-side)
Web Search - DuckDuckGo integration for real-time information
Webhook API - External triggers for announcements and tool reload
Mobile App - Flutter client for Android and iOS

Quick Start

git clone https://github.com/CoreWorxLab/caal.git
cd caal
cp .env.example .env
nano .env  # Set CAAL_HOST_IP to your server's LAN IP

# GPU mode (Ollama + Kokoro)
docker compose up -d

# CPU-only mode (Groq + Piper) - no GPU required
docker compose -f docker-compose.cpu.yaml up -d

Open http://YOUR_SERVER_IP:3000 and complete the setup wizard.

Mode	Hardware	Command
GPU	Linux + NVIDIA GPU	`docker compose up -d`
CPU-only	Any Docker host	`docker compose -f docker-compose.cpu.yaml up -d`
Apple Silicon	M1/M2/M3/M4 Mac	docs/APPLE-SILICON.md
Distributed	GPU Server + Mac	docs/DISTRIBUTED-DEPLOYMENT.md

GPU Mode (NVIDIA Linux)

Full local stack with GPU-accelerated STT (Speaches), LLM (Ollama) and TTS (Kokoro).

Requirements

Docker with NVIDIA Container Toolkit
12GB+ VRAM recommended

Installation

git clone https://github.com/CoreWorxLab/caal.git
cd caal
cp .env.example .env
nano .env  # Set CAAL_HOST_IP

docker compose up -d

The setup wizard will guide you through LLM (Ollama), TTS, and integration configuration.

CPU-Only Mode (No GPU)

Run CAAL without a GPU using Groq for LLM/STT and Piper (CPU) for TTS.

docker compose -f docker-compose.cpu.yaml up -d

For HTTPS:

docker compose -f docker-compose.cpu.yaml --profile https up -d

In the setup wizard:

Select Groq as LLM provider and enter your free API key
Select Piper as TTS provider (models download automatically)

Note: Voice data is sent to Groq's API. For fully local operation, use GPU mode with Ollama.

Apple Silicon (macOS)

CAAL runs on Apple Silicon Macs using mlx-audio for Metal-accelerated STT/TTS.

./start-apple.sh

See docs/APPLE-SILICON.md for full setup instructions.

Distributed Deployment

Run the GPU-intensive backend on a Linux server while using the frontend on a Mac or another device.

See docs/DISTRIBUTED-DEPLOYMENT.md for full setup instructions.

Network Modes

CAAL supports three network configurations:

Mode	Voice From	Access URL	Command
LAN HTTP	Host machine only	`http://localhost:3000`	`docker compose up -d`
LAN HTTPS	Any LAN device	`https://192.168.1.100:3443`	`docker compose --profile https up -d`
Tailscale	Anywhere	`https://your-machine.tailnet.ts.net:3443`	`docker compose --profile https up -d`

Why? Browsers block microphone access on HTTP except from localhost. HTTPS is required for voice from other devices.

Note: For utilization with mobile app as the client, only LAN HTTP is required, not HTTPS

LAN HTTP (Default)

CAAL_HOST_IP=192.168.1.100  # Set in .env
docker compose up -d

LAN HTTPS

Self-signed certificates are auto-generated if none exist in ./certs/.

# Configure .env
CAAL_HOST_IP=192.168.1.100
HTTPS_DOMAIN=192.168.1.100

# Start with HTTPS profile (certs auto-generated)
docker compose --profile https up -d

Access: https://192.168.1.100:3443

Trusted certs: For browser-trusted certs without warnings, use mkcert:
mkcert -install && mkcert 192.168.1.100
mkdir -p certs && mv 192.168.1.100.pem certs/server.crt && mv 192.168.1.100-key.pem certs/server.key

Tailscale (Remote Access)

# Generate Tailscale certs
tailscale cert your-machine.tailnet.ts.net
mkdir -p certs && mv your-machine.tailnet.ts.net.crt certs/server.crt && mv your-machine.tailnet.ts.net.key certs/server.key

# Configure .env
CAAL_HOST_IP=100.x.x.x                         # tailscale ip -4
HTTPS_DOMAIN=your-machine.tailnet.ts.net

# Start
docker compose --profile https up -d

Access: https://your-machine.tailnet.ts.net:3443

Configuration

Environment Variables

Only CAAL_HOST_IP is required. Everything else is configured via the web UI.

Variable	Description	Required
`CAAL_HOST_IP`	Your server's LAN/Tailscale IP	Yes
`HTTPS_DOMAIN`	Domain for HTTPS mode	No

See .env.example for additional options (ports, default models).

Settings Panel

After setup, click the gear icon to access the settings panel:

Agent - Agent name, voice selection, wake greetings
Prompt - Default or custom system prompt
Providers - LLM provider (Ollama/Groq), TTS provider (Kokoro/Piper)
LLM Settings - Temperature, context size, max turns, turn detection settings
Integrations - Home Assistant and n8n connection configuration
Wake Word - Enable/disable, model selection, threshold, timeout

Integrations

Home Assistant

Control your smart home with voice commands. CAAL exposes two simplified tools:

hass_control(action, target, value) - Control devices
- Actions: turn_on, turn_off, set_volume, volume_up, volume_down, mute, unmute, pause, play, next, previous
- Value: 0-100 for set_volume
hass_get_state(target) - Query device states

Setup:

Create a Long-Lived Access Token in Home Assistant
In CAAL settings, enable Home Assistant and enter your host URL and token
Restart the agent - CAAL auto-discovers your devices

See docs/HOME-ASSISTANT.md for action mappings and examples.

n8n Workflows

Extend CAAL with any API, database, or service via n8n workflows exposed through MCP.

Setup n8n:

Enable MCP: Settings > MCP Access > Enable MCP
Set connection method to Access Token and copy the token
In CAAL settings, enable n8n and enter your MCP URL and token

Import example workflows:

cd n8n-workflows
cp config.env.example config.env
nano config.env  # Set your n8n IP and API key
python setup.py  # Creates all workflows

See docs/N8N-WORKFLOWS.md for how to create your own workflows.

Wake Word Detection

Enable "Hey Cal" wake word in the settings panel. Two options:

OpenWakeWord (Server-side) - Runs on the server, works with any client
Picovoice (Client-side) - Requires access key and trained model per device

Webhook API

The agent exposes a REST API on port 8889 for external integrations.

Core Endpoints:

Endpoint	Method	Description
`/announce`	POST	Make CAAL speak a message
`/wake`	POST	Trigger wake word greeting
`/reload-tools`	POST	Refresh MCP tool cache
`/health`	GET	Health check

Settings & Configuration:

Endpoint	Method	Description
`/settings`	GET/POST	Read/update settings
`/prompt`	GET/POST	Read/update system prompt
`/voices`	GET	List available TTS voices
`/models`	GET	List available Ollama models

Wake Word:

Endpoint	Method	Description
`/wake-word/status`	GET	Get wake word status
`/wake-word/enable`	POST	Enable wake word detection
`/wake-word/disable`	POST	Disable wake word detection
`/wake-word/models`	GET	List available wake word models

curl -X POST http://localhost:8889/announce \
  -H "Content-Type: application/json" \
  -d '{"message": "Package delivered"}'

Mobile App

Android app available from GitHub Releases. Download the APK and install on your device.

Building from source:

cd mobile
flutter pub get
flutter build apk

See mobile/README.md for full documentation.

Development

# Install dependencies
uv sync

# Start infrastructure
docker compose up -d livekit speaches kokoro

# Run agent locally
uv run voice_agent.py dev

# Run frontend locally
cd frontend && pnpm install && pnpm dev

Commands:

uv run ruff check src/   # Lint
uv run mypy src/         # Type check
uv run pytest            # Test

Architecture

┌───────────────────────────────────────────────────────────────────────┐
│  Docker Compose Stack                                                 │
│                                                                       │
│  ┌────────────┐  ┌────────────┐  ┌────────────┐  ┌────────────┐       │
│  │  Frontend  │  │  LiveKit   │  │  Speaches  │  │Kokoro/Piper│       │
│  │  (Next.js) │  │   Server   │  │(STT, GPU)  │  │  (TTS)     │       │
│  │   :3000    │  │   :7880    │  │   :8000    │  │   :8880    │       │
│  └─────┬──────┘  └─────┬──────┘  └─────┬──────┘  └─────┬──────┘       │
│        │               │               │               │              │
│        │               └───────────────┼───────────────┘              │
│        └───────────────────────┐       │                              │
│                                │       │                              │
│                          ┌─────┴───────┴─────┐                        │
│                          │       Agent       │                        │
│                          │  (Voice Pipeline) │                        │
│                          │  :8889 (webhooks) │                        │
│                          └─────────┬─────────┘                        │
│                                    │                                  │
└────────────────────────────────────┼──────────────────────────────────┘
                                     │
            ┌────────────────────────┼────────────────────────┐
            │                        │                        │
      ┌─────┴─────┐           ┌──────┴──────┐          ┌──────┴──────┐
      │Ollama/Groq│           │     n8n     │          │    Home     │
      │   (LLM)   │           │  Workflows  │          │  Assistant  │
      └───────────┘           └─────────────┘          └─────────────┘
                       External Services (via MCP)

Troubleshooting

WebRTC Not Connecting

Check CAAL_HOST_IP matches your network mode
Verify firewall ports: 3000, 7880, 7881, 50000-50100 (UDP)
Check logs: docker compose logs livekit | grep -i "ice\|error"

Ollama Connection Failed

# Ensure Ollama binds to network
OLLAMA_HOST=0.0.0.0 ollama serve

# From Docker, use host.docker.internal
OLLAMA_HOST=http://host.docker.internal:11434

First Start Is Slow

Normal - models download on first run (~2-5 minutes):

docker compose logs -f speaches kokoro

Integration Connection Errors

If Home Assistant or n8n fail to connect, you'll see a toast notification with the error. Check:

Host URL is reachable from the Docker container
Access token is valid and has correct permissions
For n8n: MCP Access is enabled in Settings

Related Projects

LiveKit Agents - Voice agent framework
Speaches - Faster-Whisper STT server (also includes Piper TTS)
Kokoro-FastAPI - Kokoro TTS server
Piper - Fast local TTS (CPU-friendly)
mlx-audio - STT/TTS for Apple Silicon
Ollama - Local LLM server
Groq - Fast cloud LLM inference (free tier available)
n8n - Workflow automation
Home Assistant - Smart home platform

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.github		.github
docs		docs
frontend		frontend
mobile		mobile
models		models
n8n-workflows		n8n-workflows
prompt		prompt
src/caal		src/caal
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.apple.yaml		docker-compose.apple.yaml
docker-compose.cpu.yaml		docker-compose.cpu.yaml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.distributed.yml		docker-compose.distributed.yml
docker-compose.yaml		docker-compose.yaml
entrypoint.sh		entrypoint.sh
livekit-tailscale.yaml		livekit-tailscale.yaml
livekit-tailscale.yaml.template		livekit-tailscale.yaml.template
livekit.yaml		livekit.yaml
mcp_servers.default.json		mcp_servers.default.json
mcp_servers.json.example		mcp_servers.json.example
nginx-distributed.conf		nginx-distributed.conf
nginx.conf		nginx.conf
pyproject.toml		pyproject.toml
settings.default.json		settings.default.json
start-apple.sh		start-apple.sh
uv.lock		uv.lock
voice_agent.py		voice_agent.py

Folders and files

Latest commit

History

Repository files navigation

CAAL

Features

Quick Start

GPU Mode (NVIDIA Linux)

Requirements

Installation

CPU-Only Mode (No GPU)

Apple Silicon (macOS)

Distributed Deployment

Network Modes

LAN HTTP (Default)

LAN HTTPS

Tailscale (Remote Access)

Configuration

Environment Variables

Settings Panel

Integrations

Home Assistant

n8n Workflows

Wake Word Detection

Webhook API

Mobile App

Development

Architecture

Troubleshooting

WebRTC Not Connecting

Ollama Connection Failed

First Start Is Slow

Integration Connection Errors

Related Projects

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages