Skip to content

mckang/cwc-workshops

 
 

Repository files navigation

cwc-workshops

Workshop materials. Not maintained and not accepting contributions.

Materials from Anthropic-run Code with Claude workshops.

Workshops

  • rightmodel/Picking the Right Model: use a Claude Code SKILL to audit an LLM eval suite and sweep it across models and inference parameters (extended thinking, effort) to find the best quality-per-dollar and quality-per-second configuration.
  • agent-decomposition/Compose Multi-Agent Systems with Skills and MCP: decompose a 400-line-prompt inventory agent into skills + code execution + callable_agents on Claude Managed Agents, with evals to verify each step.
  • how-we-claude-code/How We Claude Code: a three-phase walkthrough of an AI-assisted product workflow — interview to spec, four divergent design explorations as static HTML, and a Vite + React app whose components emit a machine-readable DOM contract so an agent (or CI) can verify them at runtime.
  • ship-your-first-managed-agent/Ship Your First Managed Agent: a Streamlit incident dashboard with an offline SRE Agent chat panel. You bring it online by implementing seven small functions in agent.py, each a single Claude Managed Agents API call — until it can grep a 70k-line log in its sandbox, call your local tools, and name the bad commit.
  • agent-battle/Agent Battle: a 45-minute competition to configure a Claude Managed Agent — system prompt, skills, MCP servers, model — that drives a local game bot over MCP. Most diamonds wins, fewest tokens breaks ties; a fast --eval decision-probe loop lets you test config changes in ~30s before committing to a 5-minute run.
  • agents-that-remember/Agents That Remember: start with a Managed Agent that's visibly amnesiac across sessions, then layer in memory primitives one at a time — a memory store for cross-session persistence, then the Dreaming Service to consolidate past transcripts — going "goldfish to colleague" in 45 minutes.
  • eval-driven-agent-development/Eval-Driven Agent Development: iterate a PPTX-generating Managed Agent through six variants (naive → visual → typography → palette → density → QA-loop), scoring each against a 10-task suite with a two-layer grader (programmatic .pptx XML metrics + LLM-as-judge on rendered slides) so every prompt change is measured, not vibed.
  • production-ready-agent/Deal Desk: a chat-first UI over a multi-agent M&A research team on Claude Managed Agents — a coordinator delegates to four parallel research sub-agents, reads prior-deal lessons from a memory store, reaches Linear via MCP, and emits a graded investment thesis while the UI streams every event and gated tool call.

License

Apache License 2.0. See LICENSE.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • TypeScript 34.2%
  • Python 30.2%
  • JavaScript 17.0%
  • CSS 7.7%
  • Shell 5.7%
  • HTML 5.1%
  • Dockerfile 0.1%