Skip to content
@video-db

VideoDB

The perception, memory, and action layer for AI agents

Logo

VideoDB is the modern agentic backend for video and audio.


It unifies storage, indexing, streaming, and editing into a single system built for continuous media.

With universal ingestion, it accepts files, live streams, and real-time capture from any source, and converts them into structured, searchable context.

It provides memory and recall at the scene level, allowing agents to retrieve precise moments instead of entire files.

VideoDB also enables real-time stream processing, where video can be modified, enriched, and delivered dynamically over video streams.


Quick Start

Your agents can read text and static images. But the real world is live, continuous, and always changing. To operate with real context, your agent needs real-time access to video calls, camera feeds, screen recordings, and live internet streams.

npx skills add video-db/skills

Or try our interactive notebooks:

Open Quickstart in Colab See → Understand → Act Quickstart

Open Multicam in Colab Multi-Camera Surveillance Demo


SDKs & Agent Skills

Build AI applications that understand and process video as structured data.

skills Server-side video workflows for agents — capture, process, understand, generate, stream
videodb-python Official Python SDK — upload, index, search, edit, and stream video programmatically
videodb-node Official Node.js/TypeScript SDK
agent-toolkit MCP server + LLM context files for seamless AI IDE integration

Agent Frameworks

Build intelligent video agents and orchestrate complex workflows.

Director ⭐ 1.4k AI video agents framework — 20+ pre-built agents for summarization, search, clipping, script-to-movie, dubbing, and more

Agents

VideoDB turns continuous media into real-time context, searchable memory, and event-driven actions — enabling agents to see, understand, and act. Checkout Screen-aware, voice-aware AI agents built on VideoDB.

call.md Transform meetings into agentic loops — record, transcribe, analyze with real-time AI
bloom Open-source agentic Loom alternative — local-first recording with AI-powered workflows
pair-programmer AI coding assistant that sees your screen and hears your voice — works with Claude Code, Cursor, Codex

Reference Systems

Open-source applications showcasing what's possible with VideoDB.

agentic-streams Autonomous agents that research the web and generate personalized video briefings
PromptClip Create video clips instantly from natural language prompts
videodb-capture-quickstart Real-time desktop capture SDK for AI agent perception

Examples & Tutorials

Explore our comprehensive collection of examples and use cases:

Category Description
Agentic Workflows Pair Programmer, OpenClaw Monitoring, Call.md, Bloom, Focusd
Video RAG & Search Keyword Search, Multimodal Search, Character Clips, Conference Slides
Live Intelligence Intrusion Detection, Baby Monitoring, Road Monitoring, Sports Analytics
Content Factory Faceless Videos, AI Dubbing, Voiceovers, Text-to-Video
Programmatic Editing Intro/Outro, Brand Elements, Dynamic Ads, Audio Overlay
AI-Powered Safety Content Moderation, Profanity Detection, Copyright Compliance

Browse all examples →   |   VideoDB Cookbook →


Integrations

Connect VideoDB to your existing workflows.

n8n-nodes-videodb n8n workflow nodes
videodb-zapier-integration Zapier integration


Why VideoDB?

  • Real-time & streaming-first — process live streams, RTSP, desktop capture, not just files
  • Playable outputs — retrieval returns video evidence, not just metadata
  • Programmable understanding — indexes-as-code, evolving over time
  • Built for agents — the perception, memory, and action layer between storage and reasoning

Read: Why AI Agents Are Blind →


Get Started

pip install videodb
from videodb import connect

conn = connect()
coll = conn.get_collection()

# Upload and index
video = coll.upload(url="https://www.youtube.com/watch?v=...")
video.index_spoken_words()

# Search with playable results
results = video.search("key moments")
results.play()

Read the docs →  |  API Reference →


WebsiteDocsDiscordTwitterLinkedInYouTube

Pinned Loading

  1. Director Director Public

    AI video agents framework for next-gen video interactions and workflows.

    Python 1.4k 222

  2. skills skills Public

    Server-side video workflows for agents: ingest, understand, search, edit, stream.

    Python 69 2

  3. pair-programmer pair-programmer Public

    Pair Programmer for Claude Code, Codex and Cursor

    JavaScript 116 21

  4. bloom bloom Public

    An open source, agentic Loom alternative. Record locally, make recordings AI-ready, and run workflows on top

    JavaScript 186 32

  5. call.md call.md Public

    Turn meetings into live agent loops. Record, transcribe, and analyze meetings with real-time AI intelligence — before, during, and after calls.

    TypeScript 255 26

  6. agentic-streams agentic-streams Public

    Create & stream personalized agentic video briefings on any topic.

    Python 34 6

Repositories

Showing 10 of 33 repositories

Top languages

Loading…

Most used topics

Loading…