Context Engineering Wiki

promptsengineeringbest-practices

Prompt Engineering Overview

Claude API Documentation

promptsreasoningchain-of-thought

Chain of Thought Prompting

Comprehensive guide to prompt engineering techniques for Claude's latest models, covering clarity, examples, XML structuring, thinking, and agentic systems.

Anthropiccontext management

Context Windows

Claude API Documentation

contextwindowstokens

Anthropiccontext management

Long Context Window Tips

Comprehensive guide to prompt engineering techniques for Claude's latest models, covering clarity, examples, XML structuring, thinking, and agentic systems.

contextlong-contextoptimization

Anthropictoken optimization

Token Counting

Claude API Documentation

tokenscountingusage

Use XML Tags in Prompts

Comprehensive guide to prompt engineering techniques for Claude's latest models, covering clarity, examples, XML structuring, thinking, and agentic systems.

promptsxmlstructure

reasoningthinkingchain-of-thought

Extended Thinking

Claude API Documentation

Google AIcaching

Context Caching

Saiba como usar o armazenamento em cache de contexto na API Gemini

cachingcontextoptimization

Google AIcontext management

Long Context

Learn about how to get started building with long context (1 million context window) on Gemini.

contextlong-contexttokens

Google AItoken optimization

Tokens

/ Styles inlined from /site assets/css/style.css / body theme="googledevai theme" { devsite background 0: var devsite background 1 ; devsite button border: 1px solid 747775; devsite...

tokenscountingusage

Google AIprompt engineering

Prompting Strategies

/ Styles inlined from /site assets/css/style.css / body theme="googledevai theme" { devsite background 0: var devsite background 1 ; devsite button border: 1px solid 747775; devsite...

promptsstrategiesengineering

Google AIprompt engineering

System Instructions

Gemini API ile sohbet ve metin oluşturma uygulamaları geliştirmeye başlayın

system-promptsinstructionsengineering

Google AItool use

Code Execution

Learn how to use the Gemini API code execution feature.

codeexecutiontools

Progressive Disclosure

Instead of loading an entire codebase—which would immediately overwhelm the attention budget—modern agents use JIT context. The assistant dynamically loads only the necessary data at runtime.

contextjitoptimization

contextreferencesefficiency

Lightweight Identifiers

The assistant maintains references (file paths, stored queries) and dynamically loads only the necessary data at runtime using tools like grep, head, or tail.

contextcompressionlong-horizon

Compaction

When a session nears its token limit, the assistant summarizes critical details—such as architectural decisions and unresolved bugs—while discarding redundant tool outputs.

Tool Result Clearing

A light touch form of compaction where the raw results of previous tool calls (like long terminal outputs) are cleared to save space.

contexttoolsoptimization

Structured Note-taking

The agent may maintain an external NOTES.md or a to-do list to track dependencies and progress across thousands of steps, which it can read back into its context after a reset.

contextpersistencenotes

contextpollutionrelevance

Distractors

Files or code snippets that are topically related to the query but do not contain the answer can cause the model to lose focus or hallucinate.

Built-inprompt engineering

Context Rot

As more tokens are added, the model's ability to accurately retrieve needles of information from the haystack of the codebase decreases.

contextdegradationtokens

XML Tagging

Use tags like <background_information>, <tool_guidance>, <constraints> to clearly separate different types of instructions in system prompts.

promptsxmlstructure

Built-intoken optimization

High-Signal Tokens

The objective is to provide the smallest possible set of high-signal tokens that maximize the likelihood of the correct code generation.

tokensoptimizationquality

Structural Patterns

Research suggests that models often perform better on shuffled or unstructured context than on logically structured haystacks, impacting how they process long files.

contextstructureresearch

Agent Skills

Reusable packages of domain expertise defined in SKILL.md files that provide specialized AI agent capabilities. Introduced as GA in VS Code 1.109, skills can be invoked as slash commands or loaded...

skillsagentsvscode+1

Deterministic shell commands that execute at key lifecycle points during agent sessions. Unlike instructions, hooks run code with guaranteed outcomes for security policies, quality checks, or audit...

hooksagentslifecycle+1

orchestrationmulti-agentsubagent+1

Agent Orchestration

A multi-agent pattern where specialized subagents collaborate on complex tasks, each operating in its own dedicated context window. Provides context efficiency, specialization with different models,...

Message Steering

An agent interaction pattern where follow-up messages redirect a running agent request. The agent yields after the active tool execution and processes the new message. Alternatives include request...

agentssteeringqueueing+1

securitysandboxterminal+1

Terminal Sandboxing

A security mechanism restricting file system and network access for agent-executed terminal commands. Sandboxed commands have read/write access only to the workspace directory, and network access can...

Built-intoken economics

Thinking Tokens

Tokens generated during a model's internal reasoning process before producing a visible response. Thinking tokens consume context budget but improve quality on complex tasks. Anthropic models support...

thinkingreasoningtokens+1