VideoDB is the modern agentic backend for video and audio.
It unifies storage, indexing, streaming, and editing into a single system built for continuous media.
With universal ingestion, it accepts files, live streams, and real-time capture from any source, and converts them into structured, searchable context.
It provides memory and recall at the scene level, allowing agents to retrieve precise moments instead of entire files.
VideoDB also enables real-time stream processing, where video can be modified, enriched, and delivered dynamically over video streams.
Your agents can read text and static images. But the real world is live, continuous, and always changing. To operate with real context, your agent needs real-time access to video calls, camera feeds, screen recordings, and live internet streams.
npx skills add video-db/skillsOr try our interactive notebooks:
See → Understand → Act Quickstart
Multi-Camera Surveillance Demo
Build AI applications that understand and process video as structured data.
| skills | Server-side video workflows for agents — capture, process, understand, generate, stream |
| videodb-python | Official Python SDK — upload, index, search, edit, and stream video programmatically |
| videodb-node | Official Node.js/TypeScript SDK |
| agent-toolkit | MCP server + LLM context files for seamless AI IDE integration |
Build intelligent video agents and orchestrate complex workflows.
| Director ⭐ 1.4k | AI video agents framework — 20+ pre-built agents for summarization, search, clipping, script-to-movie, dubbing, and more |
VideoDB turns continuous media into real-time context, searchable memory, and event-driven actions — enabling agents to see, understand, and act. Checkout Screen-aware, voice-aware AI agents built on VideoDB.
| call.md | Transform meetings into agentic loops — record, transcribe, analyze with real-time AI |
| bloom | Open-source agentic Loom alternative — local-first recording with AI-powered workflows |
| pair-programmer | AI coding assistant that sees your screen and hears your voice — works with Claude Code, Cursor, Codex |
Open-source applications showcasing what's possible with VideoDB.
| agentic-streams | Autonomous agents that research the web and generate personalized video briefings |
| PromptClip | Create video clips instantly from natural language prompts |
| videodb-capture-quickstart | Real-time desktop capture SDK for AI agent perception |
Explore our comprehensive collection of examples and use cases:
| Category | Description |
|---|---|
| Agentic Workflows | Pair Programmer, OpenClaw Monitoring, Call.md, Bloom, Focusd |
| Video RAG & Search | Keyword Search, Multimodal Search, Character Clips, Conference Slides |
| Live Intelligence | Intrusion Detection, Baby Monitoring, Road Monitoring, Sports Analytics |
| Content Factory | Faceless Videos, AI Dubbing, Voiceovers, Text-to-Video |
| Programmatic Editing | Intro/Outro, Brand Elements, Dynamic Ads, Audio Overlay |
| AI-Powered Safety | Content Moderation, Profanity Detection, Copyright Compliance |
Browse all examples → | VideoDB Cookbook →
Connect VideoDB to your existing workflows.
| n8n-nodes-videodb | n8n workflow nodes |
| videodb-zapier-integration | Zapier integration |
- Real-time & streaming-first — process live streams, RTSP, desktop capture, not just files
- Playable outputs — retrieval returns video evidence, not just metadata
- Programmable understanding — indexes-as-code, evolving over time
- Built for agents — the perception, memory, and action layer between storage and reasoning
Read: Why AI Agents Are Blind →
pip install videodbfrom videodb import connect
conn = connect()
coll = conn.get_collection()
# Upload and index
video = coll.upload(url="https://www.youtube.com/watch?v=...")
video.index_spoken_words()
# Search with playable results
results = video.search("key moments")
results.play()Read the docs → | API Reference →