ARC-AGI-3 Agent Lab

My lab for learning AI agent tooling by building agents that play ARC-AGI-3 games.

The repo is based on the official ARC-AGI-3-Agents runner, with local adaptations for hands-on experimentation:

local/offline game runs
simple agent templates to modify
recordings for manual inspection
a small LLM smoke-test agent
tests for the runner and agent harness

Setup

Install dependencies with uv:

uv sync

Create local config:

cp .env.example .env

For local development, .env should usually contain:

OPERATION_MODE=offline

Add OPENAI_API_KEY to .env if you want to run LLM-based agents.

Run Agents

Run the baseline random agent:

uv run main.py --agent=random --game=ls20

Run the short LLM smoke-test agent:

uv run main.py --agent=smokellm --game=ls20

smokellm uses gpt-4o-mini and stops after 3 actions. It exists to verify that local execution and OpenAI credentials work before running longer experiments.

Local Mode

Local mode uses downloaded game files under environment_files/. It is faster and avoids online scorecard/replay calls.

If a game is not available locally yet, run once in normal or online mode to download it, then switch back to offline.

Available Agents

Print registered agents:

uv run python -c "from agents import AVAILABLE_AGENTS; print(sorted(AVAILABLE_AGENTS))"

Current useful starting points:

random
fastllm
llm
reasoningllm
guidedllm
langgraphrandom
langgraphfunc
langgraphtextonly
langgraphthinking
smokellm

Tests

uv run pytest

The tests cover the runner, agent contract, recordings, and basic template behavior. They do not evaluate agent quality.

Notes

Recordings are written to recordings/ and ignored by git.
Downloaded environments are written to environment_files/ and ignored by git.
Real secrets belong in .env; .env.example is only for placeholders.
Full frame dumps can be very token-heavy for LLM agents, so prompt compression is an important next area to explore.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
agents		agents
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
llms.txt		llms.txt
main.py		main.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ARC-AGI-3 Agent Lab

Setup

Run Agents

Local Mode

Available Agents

Tests

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ARC-AGI-3 Agent Lab

Setup

Run Agents

Local Mode

Available Agents

Tests

Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages