Glossary

52 terms across 5 domains. Master the technical vocabulary required for the Claude Certified Architect Foundations exam.

Jump:

52 of 52 terms

Agentic Loop

A repeating cycle where Claude receives input, reasons about it, optionally invokes tools, observes results, and decides whether to continue or stop. The loop is governed by stop_reason values returned by the API.

Exam Context

Know the three stop_reason values (end_turn, tool_use, max_tokens) and how each affects loop continuation. Expect questions on when to break the loop versus when to continue.

1.1 Agentic Loops

allowedTools

A settings array that specifies which tools Claude Code can use without asking for permission. Supports glob patterns for matching tool names. Tools not in this list require user approval before execution.

Exam Context

Know the glob syntax for matching tool families. Understand that allowedTools is a whitelist, not a blacklist. Combining with deniedTools creates a layered permission model.

3.4 Permissions & Security

cache_control

An API parameter that marks content as cacheable for prompt caching. Placed as a property on message content blocks with type: "ephemeral". Content up to and including the marked block is cached for reuse in subsequent requests.

Exam Context

Place cache_control breakpoints strategically: after the system prompt and after tool definitions. Up to 4 breakpoints allowed. Cache TTL is 5 minutes (refreshed on each hit).

5.2 Prompt Caching

Chain of Thought

A prompting technique that asks Claude to show its reasoning step-by-step before providing a final answer. Improves accuracy on complex reasoning tasks by forcing intermediate reasoning steps.

Exam Context

Use chain of thought for complex reasoning, math, and multi-step analysis. Can be combined with XML tags to separate thinking from the final answer. Extended thinking is the API feature for this.

4.5 Prompt Optimisation

CI/CD Integration

Running Claude Code in non-interactive mode within continuous integration pipelines. Uses the --print flag for single-shot execution and --dangerously-skip-permissions for automated environments where no human can approve tool use.

Exam Context

Know the flags: --print (non-interactive), --dangerously-skip-permissions (auto-approve all tools). Understand security implications of skipping permissions in CI.

3.6 CI/CD Integration

Claude Agent SDK

A Python framework for building production agentic systems with Claude. Provides primitives for agents, tools, guardrails, handoffs between agents, and tracing. Designed for multi-agent orchestration with built-in safety features.

Exam Context

Know the core primitives: Agent, Runner, tool decorator, guardrail decorator, and handoff. Understand how Runner.run() manages the agentic loop internally.

1.4 Claude Agent SDK

CLAUDE.md

A markdown file that provides persistent context and instructions to Claude Code. Can be placed at project root, user home directory (~/.claude/CLAUDE.md), or in subdirectories. Claude reads these files at session start to understand project conventions.

Exam Context

Know the hierarchy: user-level, project-level, and directory-level CLAUDE.md files. Project-level is most common. Content should include coding standards, project structure, and key conventions.

3.2 CLAUDE.md Files

Context Window

The maximum number of tokens Claude can process in a single API call, including both input and output. Claude's context window size varies by model. All content (system prompt, conversation history, tool definitions, and response) must fit within this limit.

Exam Context

Know that the context window is shared between input and output. Larger context windows cost more per token. Plan token budgets: system prompt + tools + history + expected output < window size.

5.1 Context Window Management

end_turn

A stop_reason value indicating Claude has voluntarily finished its response. In an agentic loop, this signals that Claude believes the task is complete and no further tool calls are needed.

Exam Context

Distinguish from max_tokens (involuntary stop). In agentic loops, end_turn is the primary signal to exit the loop.

1.1 Agentic Loops

Error Recovery

Strategies for handling failures in agentic systems, including retry with exponential backoff, graceful degradation, fallback tools, and providing error context back to Claude so it can self-correct.

Exam Context

Know that feeding tool errors back to Claude often enables self-correction. Understand retry strategies and when to escalate versus retry.

1.7 Error Recovery & Resilience

Fan-Out/Fan-In

A parallelization pattern where a task is split into independent subtasks (fan-out), each processed concurrently, and results are aggregated (fan-in). Reduces latency for tasks with independent components.

Exam Context

Know when parallelization is safe (independent subtasks with no shared state) versus when sequential processing is required (dependent steps).

1.2 Orchestration Patterns

Few-Shot Example

Concrete input-output pairs included in the prompt to demonstrate the desired behavior. Examples teach Claude the expected format, style, and reasoning pattern more effectively than abstract instructions alone.

Exam Context

Include 2-5 diverse examples that cover edge cases. Place examples after instructions but before the actual input. Use consistent formatting between examples and the real task.

4.4 Few-Shot & Examples

Guardrail

A safety mechanism that constrains Claude's behavior within defined boundaries. Guardrails can operate on input (pre-processing validation), output (post-processing filtering), or tool use (restricting which tools can be called and with what parameters).

Exam Context

Understand the difference between input guardrails, output guardrails, and tool-use guardrails. Know that guardrails in the Claude Agent SDK run as parallel checks that can halt execution.

1.3 Guardrails & Safety

Hook

A shell command that executes at specific lifecycle points in Claude Code. Hooks run before tool calls (PreToolUse), after tool calls (PostToolUse), and on notifications. They enable logging, validation, and custom automation.

Exam Context

Know the three hook types and their timing. PreToolUse hooks can block tool execution by returning a non-zero exit code. PostToolUse hooks can inspect results.

3.3 Hooks & Automation

Human-in-the-Loop

A pattern where the agentic system pauses execution to request human approval before proceeding with sensitive operations. The human can approve, reject, or modify the proposed action.

Exam Context

Understand when human-in-the-loop is required (irreversible actions, high-stakes decisions) versus when it adds unnecessary friction. Know how to implement approval gates in the agentic loop.

1.6 Human-in-the-Loop

inputSchema

The JSON Schema object within a tool definition that specifies the structure, types, and constraints of the tool's input parameters. Follows standard JSON Schema with properties, required fields, and descriptions.

Exam Context

Always mark required parameters explicitly. Use description fields on every property. Constrain with enum, minimum, maximum, and pattern where appropriate.

2.1 Tool Schema Design

Long Conversation

A conversation that approaches or exceeds the context window limit. Requires management strategies such as summarization, sliding window truncation, or context compression to continue meaningfully without losing critical information.

Exam Context

Know the three main strategies: summarization (compress old messages), truncation (drop oldest messages), and selective retention (keep important messages, compress others). Each has tradeoffs.

5.3 Long Conversations

MCP Client

A component that connects to one or more MCP servers, discovers their capabilities, and makes them available to the LLM. The client manages the lifecycle of server connections and translates between the MCP protocol and the LLM's tool format.

Exam Context

Understand that a single client can connect to multiple servers simultaneously. The client aggregates tools from all connected servers into a unified tool list for Claude.

2.3 MCP Client Integration

MCP Server

A process that exposes tools, resources, and prompt templates over the Model Context Protocol. MCP servers can be implemented in any language and communicate via stdio or HTTP (SSE) transport.

Exam Context

Know the three capability types: tools (executable actions), resources (data sources), and prompts (templates). Understand that servers declare capabilities during initialization.

2.2 MCP Server Implementation

MCP Transport

The communication layer between MCP clients and servers. Two standard transports: stdio (local processes communicating via stdin/stdout) and HTTP with Server-Sent Events (remote servers). Stdio is simpler; HTTP enables remote deployment.

Exam Context

Stdio is the default for local tools. HTTP+SSE is required for remote or shared servers. Know the tradeoffs: stdio has no network overhead but requires local execution.

2.2 MCP Server Implementation

Monitoring

Observing and measuring the behavior of Claude-powered systems in production. Key metrics include latency, token usage, error rates, tool call patterns, and output quality scores. Essential for maintaining reliability and controlling costs.

Exam Context

Track: latency (time-to-first-token, total), cost (input/output tokens), errors (rate, type), and quality (user feedback, automated evals). Set alerts on anomalies.

5.5 Monitoring & Observability

Multi-Agent System

An architecture where multiple specialized agents collaborate on a task. Each agent has its own system prompt, tools, and guardrails. Agents communicate through handoffs or a central orchestrator.

Exam Context

Know when to use multi-agent versus single-agent. Multi-agent adds complexity; use it when tasks require fundamentally different tool sets or expertise domains.

1.5 Multi-Agent Systems

Orchestration Pattern

An architecture for coordinating multiple Claude API calls or agents. Common patterns include prompt chaining (sequential), routing (conditional dispatch), parallelization (fan-out/fan-in), and evaluator-optimizer loops.

Exam Context

Be able to identify which pattern fits a given scenario. Routing is for classification-based dispatch; chaining is for sequential transformation; parallelization is for independent subtasks.

1.2 Orchestration Patterns

Output Validation

Verifying that Claude's response meets expected criteria before using it downstream. Includes JSON schema validation, content safety checks, format verification, and semantic correctness checks using a second LLM call.

Exam Context

Always validate structured output. Use tool_use for schema enforcement. For critical applications, add a separate validation step or LLM-as-judge evaluation.

4.6 Output Validation

Permission

A configuration that controls which tools Claude Code can use and which require approval. Permissions are set in settings files and follow a hierarchy from user-level to project-level configuration.

Exam Context

Understand the permission model: allow, deny, or ask. Know that allowedTools in settings controls the whitelist. Project-level settings cannot override user-level denials.

3.4 Permissions & Security

PostToolUse

A hook that fires after a tool completes execution in Claude Code. Receives the tool name, input, and output. Cannot prevent execution (already happened) but can log results, trigger notifications, or perform cleanup.

Exam Context

PostToolUse hooks are for observation, not prevention. Use for audit logging, notifications, or triggering dependent workflows after tool completion.

3.3 Hooks & Automation

Prefilling

Starting the assistant's response with specific text to guide Claude's output format or direction. Done by including a partial assistant message in the API call. Forces Claude to continue from the provided prefix.

Exam Context

Useful for forcing JSON output (prefill with opening brace), maintaining format consistency, or steering response direction. The prefilled content appears in the final response.

4.2 Structured Output

PreToolUse

A hook that fires before a tool is executed in Claude Code. Receives the tool name and input as environment variables. Can block execution by returning exit code 2, or modify the operation. Used for validation and logging.

Exam Context

Exit code 0 allows execution, exit code 2 blocks it with feedback to Claude. The hook receives TOOL_NAME and TOOL_INPUT environment variables.

3.3 Hooks & Automation

Production Reliability

Patterns for building robust Claude-powered systems: retry logic with exponential backoff, circuit breakers, fallback models, graceful degradation, health checks, and deployment strategies like canary releases.

Exam Context

Know the retry pattern (exponential backoff + jitter), circuit breaker pattern (fail fast after threshold), and fallback pattern (switch to simpler model or cached response).

5.6 Production Reliability

Prompt Caching

An API feature that caches repeated prompt prefixes to reduce cost and latency on subsequent calls. Marked with cache_control breakpoints. Cached content is reused when the prefix matches exactly, reducing input token costs by up to 90%.

Exam Context

Know the cache_control breakpoint placement strategy. Content before the breakpoint is cached. Any change to cached content invalidates the cache. Place breakpoints after stable content (system prompt, tool definitions).

5.2 Prompt Caching

Prompt Chain

A sequence of Claude API calls where the output of one call feeds into the input of the next. Each step performs a focused transformation, validation, or decision. Chains decompose complex tasks into manageable steps.

Exam Context

Know when to chain versus when to use a single call. Chain when steps have different requirements (model, temperature, tools). Each step should have a clear, verifiable output.

4.3 Prompt Chaining

Prompt Optimization

The iterative process of refining prompts to improve output quality, reduce cost, or increase reliability. Techniques include prompt compression, example selection, instruction clarification, and A/B testing against evaluation datasets.

Exam Context

Optimization is empirical: measure before and after with a consistent eval set. Common wins: clearer instructions, better examples, removing ambiguity, and adding output format constraints.

4.5 Prompt Optimisation

Prompt Template

An MCP capability that provides reusable prompt structures. Servers expose prompt templates with defined arguments that clients can fill in. Useful for standardizing interactions across different clients.

Exam Context

Prompt templates are the third MCP capability alongside tools and resources. They enable server-defined interaction patterns that clients can discover and use.

2.3 MCP Client Integration

Rate Limiting

API-enforced limits on request frequency and token throughput. Claude's API has rate limits per minute (RPM) and tokens per minute (TPM). Exceeding limits returns 429 status codes. Managed with queuing, backoff, and request batching.

Exam Context

Know the HTTP 429 response and retry-after header. Implement exponential backoff with jitter. Use request queuing for high-throughput applications. Different tiers have different rate limits.

5.4 Rate Limiting & Quotas

Resource

An MCP capability that exposes data to the LLM. Unlike tools (which perform actions), resources provide read-only access to content like files, database records, or API responses. Resources have a URI and return text or binary content.

Exam Context

Distinguish resources from tools: resources are for reading data, tools are for performing actions. Resources are identified by URI patterns.

2.3 MCP Client Integration

Retry Strategy

A pattern for handling transient API failures by retrying requests with increasing delays. Exponential backoff (doubling wait time) with random jitter prevents thundering herd problems. Only retry on retryable errors (429, 500, 529).

Exam Context

Know which status codes are retryable: 429 (rate limit), 500 (server error), 529 (overloaded). Do not retry 400 (bad request) or 401 (auth). Always add jitter to prevent synchronized retries.

5.6 Production Reliability

Routing Pattern

An orchestration pattern where an initial classification step determines which specialized agent or prompt chain handles the request. Useful when different input types require fundamentally different processing.

Exam Context

Routing is the right choice when you have distinct categories of input. Use a fast, cheap classification call to route before invoking specialized (potentially expensive) processing.

1.2 Orchestration Patterns

Settings Hierarchy

The order in which Claude Code resolves configuration: user-level (~/.claude/settings.json), project-level (.claude/settings.json), and enterprise-managed settings. More specific settings override general ones, with security exceptions.

Exam Context

User-level security settings (denied tools) cannot be overridden by project-level config. This prevents malicious repos from escalating permissions.

3.1 Configuration & Settings

Slash Command

A custom command invoked with a / prefix in Claude Code. Built-in commands include /help, /clear, and /compact. Custom slash commands can be defined via .claude/commands/ directory with markdown files that serve as prompt templates.

Exam Context

Know how to create custom slash commands: place a .md file in .claude/commands/ with the prompt template. The filename becomes the command name. Arguments use $ARGUMENTS placeholder.

3.5 Slash Commands & MCP

stop_reason

A field in the Claude API response indicating why generation stopped. Values: end_turn (Claude finished voluntarily), tool_use (Claude wants to call a tool), max_tokens (output limit reached).

Exam Context

Critical for agentic loop implementation. tool_use means continue the loop with tool results. end_turn means the task is complete. max_tokens may require continuation logic.

1.1 Agentic Loops

Structured Output

Techniques for getting Claude to respond in a specific format like JSON, XML, or a defined schema. Achieved through tool use (forces JSON matching the schema), system prompt instructions, or prefilling the assistant response.

Exam Context

Tool use is the most reliable method for structured output. Prefilling with an opening brace or tag is a useful technique. Always validate output against the expected schema.

4.2 Structured Output

Summarization

A context management technique where older conversation messages are compressed into a concise summary. The summary replaces the original messages, freeing context window space while preserving key information for continued conversation.

Exam Context

Summarization trades fidelity for space. Summarize when conversation exceeds a token threshold. Keep the summary in a dedicated system or user message. Re-summarize periodically for very long conversations.

5.3 Long Conversations

System Prompt

The initial instruction block sent to Claude that establishes behavior, persona, constraints, and context for the entire conversation. System prompts are processed first and influence all subsequent responses.

Exam Context

Know that system prompts set the behavioral foundation. Keep them focused: role, constraints, output format. Avoid putting dynamic content in system prompts when it should be in user messages.

4.1 System Prompts

Temperature

A parameter (0.0 to 1.0) controlling randomness in Claude's responses. Lower values produce more deterministic, focused output. Higher values increase creativity and variation. Default is 1.0.

Exam Context

Use temperature 0 for structured output, classification, and extraction. Use higher temperature for creative tasks. Temperature does not affect the quality of reasoning, only the diversity of outputs.

4.5 Prompt Optimisation

Token Counting

Measuring the number of tokens in prompts and responses to manage costs and stay within context window limits. Anthropic provides a token counting API endpoint. Accurate counting is essential for budget management and context window planning.

Exam Context

Use the token counting API for accurate estimates before sending expensive requests. Tokens are not characters or words: a token is roughly 3-4 characters for English text.

5.1 Context Window Management

Tool Approval

A mechanism for requiring user confirmation before a tool executes. Configured through permission settings that classify tools as allowed, denied, or requiring approval based on the tool name or server.

Exam Context

Know how to configure tool permissions in Claude Code. Understand the difference between allowedTools (whitelist) and tools that require explicit user approval.

2.4 Tool Error Handling

Tool Error Handling

The practice of returning structured error information from tools so Claude can understand what went wrong and attempt recovery. Tool results should include is_error: true and a descriptive message rather than throwing exceptions.

Exam Context

Never throw unhandled exceptions from tools. Return errors as structured content so Claude can reason about them. Include actionable information in error messages.

2.4 Tool Error Handling

Tool Schema

A JSON Schema definition that describes a tool's name, purpose, and input parameters. Claude uses the schema to understand what the tool does and how to construct valid inputs. Well-designed schemas include clear descriptions and constrained types.

Exam Context

Schema quality directly affects tool selection accuracy. Include descriptions on every parameter. Use enums to constrain values. Avoid overly complex nested schemas.

2.1 Tool Schema Design

Tool Selection

Claude's process of choosing which tool to invoke based on the user's request and available tool schemas. Affected by tool names, descriptions, parameter schemas, and system prompt guidance.

Exam Context

Tool names and descriptions are the primary signals for selection. Avoid ambiguous or overlapping tool names. Use the system prompt to provide tool selection guidance when needed.

2.5 Tool Selection & Routing

tool_use

A content block type in the Claude API response indicating Claude wants to invoke a tool. Contains the tool name, a unique ID, and the input parameters matching the tool's JSON schema.

Exam Context

Know the structure: each tool_use block has id, name, and input fields. The tool result must reference the same id when returning results to Claude.

1.1 Agentic Loops

Worktree

A Git feature that allows multiple working trees attached to the same repository. Claude Code supports worktrees for parallel development: run multiple Claude Code sessions on different branches of the same repo simultaneously.

Exam Context

Know that worktrees enable parallel Claude Code sessions without conflicting file changes. Each worktree has its own working directory but shares the Git history.

3.1 Configuration & Settings

XML Tags

A prompting technique using XML-style tags to structure input and output. Claude responds well to tags like <instructions>, <context>, <example>, and <output> for clearly delineating sections of complex prompts.

Exam Context

XML tags improve Claude's ability to follow complex multi-part prompts. Use them to separate instructions from data, mark example boundaries, and define output sections.

4.1 System Prompts