cwc-workshops

Workshop materials. Not maintained and not accepting contributions.

Materials from Anthropic-run Code with Claude workshops.

Workshops

rightmodel/ — Picking the Right Model: use a Claude Code SKILL to audit an LLM eval suite and sweep it across models and inference parameters (extended thinking, effort) to find the best quality-per-dollar and quality-per-second configuration.
agent-decomposition/ — Compose Multi-Agent Systems with Skills and MCP: decompose a 400-line-prompt inventory agent into skills + code execution + callable_agents on Claude Managed Agents, with evals to verify each step.
how-we-claude-code/ — How We Claude Code: a three-phase walkthrough of an AI-assisted product workflow — interview to spec, four divergent design explorations as static HTML, and a Vite + React app whose components emit a machine-readable DOM contract so an agent (or CI) can verify them at runtime.
ship-your-first-managed-agent/ — Ship Your First Managed Agent: a Streamlit incident dashboard with an offline SRE Agent chat panel. You bring it online by implementing seven small functions in agent.py, each a single Claude Managed Agents API call — until it can grep a 70k-line log in its sandbox, call your local tools, and name the bad commit.
agent-battle/ — Agent Battle: a 45-minute competition to configure a Claude Managed Agent — system prompt, skills, MCP servers, model — that drives a local game bot over MCP. Most diamonds wins, fewest tokens breaks ties; a fast --eval decision-probe loop lets you test config changes in ~30s before committing to a 5-minute run.
agents-that-remember/ — Agents That Remember: start with a Managed Agent that's visibly amnesiac across sessions, then layer in memory primitives one at a time — a memory store for cross-session persistence, then the Dreaming Service to consolidate past transcripts — going "goldfish to colleague" in 45 minutes.
eval-driven-agent-development/ — Eval-Driven Agent Development: iterate a PPTX-generating Managed Agent through six variants (naive → visual → typography → palette → density → QA-loop), scoring each against a 10-task suite with a two-layer grader (programmatic .pptx XML metrics + LLM-as-judge on rendered slides) so every prompt change is measured, not vibed.
production-ready-agent/ — Deal Desk: a chat-first UI over a multi-agent M&A research team on Claude Managed Agents — a coordinator delegates to four parallel research sub-agents, reads prior-deal lessons from a memory store, reaches Linear via MCP, and emits a graded investment thesis while the UI streams every event and gated tool call.

License

Apache License 2.0. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cwc-workshops

Workshops

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
agent-battle		agent-battle
agent-decomposition		agent-decomposition
agents-that-remember		agents-that-remember
eval-driven-agent-development		eval-driven-agent-development
how-we-claude-code		how-we-claude-code
production-ready-agent		production-ready-agent
rightmodel		rightmodel
ship-your-first-managed-agent		ship-your-first-managed-agent
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

cwc-workshops

Workshops

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages