Action Cache - Accelerated Browser Agent with Intelligent Caching

Demo: https://www.veed.io/view/adff73d3-1522-4d8b-ad30-1eeb5b0fb1c2?panel=share

A concept demonstration of how intelligent caching can dramatically accelerate browser automation agents by storing and reusing action plans, eliminating the need for repeated LLM "thinking" about common tasks.

Main files are in /bday

Core Concept

Instead of having the LLM plan every action from scratch, this system:

Caches Action Plans: Stores successful action sequences in a database
Retrieves Similar Plans: Finds cached plans for similar tasks using semantic similarity
Executes Directly: Skips the planning phase and goes straight to execution
Accelerates Performance: Reduces latency and token costs by avoiding redundant LLM calls

Key Features

Action Plan Caching: Stores granular subgoals and browser actions for reuse
Semantic Retrieval: Finds similar cached plans using vector similarity
Zero-Thinking Execution: Bypasses LLM planning for cached action sequences
Multi-Level Caching: LLM responses, subgoal plans, and execution results
Browser Automation: Uses Playwright to navigate Wikipedia pages
Provider Flexibility: Supports OpenAI, Lightpanda, and OpenLLM providers

Quick Start

Install dependencies:

pip install -r requirements.txt
playwright install chromium

Set up environment variables (create a .env file):

# Required: Choose one LLM provider
OPENAI_API_KEY=sk-your-key-here
OPENAI_MODEL=gpt-4o-mini

# Optional: Lightpanda browser service
LIGHTPANDA_TOKEN=your-token-here

Run the agent:

cd bday
python t_agent.py "When was Marie Curie born?"

Usage Examples

# Basic research question
python t_agent.py "What year was Einstein born?"

# Force new plan (bypass cache)
python t_agent.py "Compare Taylor Swift and Beyoncé's Grammy wins" --force-plan

# Preview stored plan without execution
python t_agent.py "When did World War II end?" --plan-preview

# Run in headless mode
python t_agent.py "Who invented the telephone?" --headless

# Show cache statistics
python t_agent.py "What is photosynthesis?" --show-counts

How It Works

First Run (Cache Miss)

Planning Phase: LLM breaks down question into specific subgoals
Action Generation: LLM creates concrete browser actions for each subgoal
Execution: Playwright automates the browser to collect information
Caching: Successful action sequences are stored in the database
Answer Extraction: LLM synthesizes collected data into final answer

Subsequent Runs (Cache Hit)

Cache Lookup: System finds similar cached action plans using semantic similarity
Direct Execution: Skips LLM planning, executes cached actions immediately
Answer Extraction: LLM synthesizes collected data into final answer

Result: Dramatically faster execution with reduced token usage and lower costs

Cache Architecture

The system implements a three-tier caching strategy to maximize acceleration:

LLM Cache: Stores and reuses LLM responses for similar prompts
Subgoal Cache: Core innovation - Reuses complete action plans for similar research tasks
Answer Cache: Stores final answers (currently disabled for fresh results)
Cloud Based Database Stores cache in a structured Weaviates vector database

Cache Hit Example

Question: "When was Einstein born?"
Cache Lookup: Finds similar cached plan for "birth date research"
Result: Executes cached actions directly, skipping 2-3 LLM planning calls
Speed Improvement: ~70% faster execution, ~60% fewer tokens used

# Purge cache for specific question
python t_agent.py "Your question here" --purge

# Emergency cleanup of wrong answers
python t_agent.py --emergency-purge

Configuration

The system automatically selects the best available LLM provider:

OpenAI (recommended): Reliable JSON responses and usage tracking
Lightpanda: Cloud-based browser automation
OpenLLM: Self-hosted models

Force a specific provider:

export FORCE_PROVIDER=openai
export OPENAI_API_KEY=sk-...
export OPENAI_MODEL=gpt-4o-mini

Project Structure

t_agent.py - Main automation script with caching logic
agent_core.py - Core browser automation logic
llm_client.py - Provider-agnostic LLM interface
cachedb/ - SQLite database with vector embeddings for semantic caching
cachedb_integrations/ - Cache adapters and integrations

Performance Benefits

This caching approach provides significant advantages for browser automation:

Speed: 60-70% faster execution on cache hits
Cost: 50-60% reduction in token usage
Reliability: Proven action sequences reduce execution errors
Scalability: Cache grows smarter with each successful execution

Requirements

Python 3.9+
Playwright with Chromium
LLM API access (OpenAI, Lightpanda, or OpenLLM)
Internet connection for Wikipedia access

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
bday		bday
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Action Cache - Accelerated Browser Agent with Intelligent Caching

Demo: https://www.veed.io/view/adff73d3-1522-4d8b-ad30-1eeb5b0fb1c2?panel=share

Core Concept

Key Features

Quick Start

Usage Examples

How It Works

First Run (Cache Miss)

Subsequent Runs (Cache Hit)

Cache Architecture

Cache Hit Example

Configuration

Project Structure

Performance Benefits

Requirements

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Lyadalachanchu/action-cache

Folders and files

Latest commit

History

Repository files navigation

Action Cache - Accelerated Browser Agent with Intelligent Caching

Demo: https://www.veed.io/view/adff73d3-1522-4d8b-ad30-1eeb5b0fb1c2?panel=share

Core Concept

Key Features

Quick Start

Usage Examples

How It Works

First Run (Cache Miss)

Subsequent Runs (Cache Hit)

Cache Architecture

Cache Hit Example

Configuration

Project Structure

Performance Benefits

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages