Glossary
52 terms across 5 domains. Master the technical vocabulary required for the Claude Certified Architect Foundations exam.
52 of 52 terms
Agentic Loop
D1A repeating cycle where Claude receives input, reasons about it, optionally invokes tools, observes results, and decides whether to continue or stop. The loop is governed by stop_reason values returned by the API.
Exam Context
Know the three stop_reason values (end_turn, tool_use, max_tokens) and how each affects loop continuation. Expect questions on when to break the loop versus when to continue.
allowedTools
D3A settings array that specifies which tools Claude Code can use without asking for permission. Supports glob patterns for matching tool names. Tools not in this list require user approval before execution.
Exam Context
Know the glob syntax for matching tool families. Understand that allowedTools is a whitelist, not a blacklist. Combining with deniedTools creates a layered permission model.
cache_control
D5An API parameter that marks content as cacheable for prompt caching. Placed as a property on message content blocks with type: "ephemeral". Content up to and including the marked block is cached for reuse in subsequent requests.
Exam Context
Place cache_control breakpoints strategically: after the system prompt and after tool definitions. Up to 4 breakpoints allowed. Cache TTL is 5 minutes (refreshed on each hit).
Chain of Thought
D4A prompting technique that asks Claude to show its reasoning step-by-step before providing a final answer. Improves accuracy on complex reasoning tasks by forcing intermediate reasoning steps.
Exam Context
Use chain of thought for complex reasoning, math, and multi-step analysis. Can be combined with XML tags to separate thinking from the final answer. Extended thinking is the API feature for this.
CI/CD Integration
D3Running Claude Code in non-interactive mode within continuous integration pipelines. Uses the --print flag for single-shot execution and --dangerously-skip-permissions for automated environments where no human can approve tool use.
Exam Context
Know the flags: --print (non-interactive), --dangerously-skip-permissions (auto-approve all tools). Understand security implications of skipping permissions in CI.
Claude Agent SDK
D1A Python framework for building production agentic systems with Claude. Provides primitives for agents, tools, guardrails, handoffs between agents, and tracing. Designed for multi-agent orchestration with built-in safety features.
Exam Context
Know the core primitives: Agent, Runner, tool decorator, guardrail decorator, and handoff. Understand how Runner.run() manages the agentic loop internally.
CLAUDE.md
D3A markdown file that provides persistent context and instructions to Claude Code. Can be placed at project root, user home directory (~/.claude/CLAUDE.md), or in subdirectories. Claude reads these files at session start to understand project conventions.
Exam Context
Know the hierarchy: user-level, project-level, and directory-level CLAUDE.md files. Project-level is most common. Content should include coding standards, project structure, and key conventions.
Context Window
D5The maximum number of tokens Claude can process in a single API call, including both input and output. Claude's context window size varies by model. All content (system prompt, conversation history, tool definitions, and response) must fit within this limit.
Exam Context
Know that the context window is shared between input and output. Larger context windows cost more per token. Plan token budgets: system prompt + tools + history + expected output < window size.
end_turn
D1A stop_reason value indicating Claude has voluntarily finished its response. In an agentic loop, this signals that Claude believes the task is complete and no further tool calls are needed.
Exam Context
Distinguish from max_tokens (involuntary stop). In agentic loops, end_turn is the primary signal to exit the loop.
Error Recovery
D1Strategies for handling failures in agentic systems, including retry with exponential backoff, graceful degradation, fallback tools, and providing error context back to Claude so it can self-correct.
Exam Context
Know that feeding tool errors back to Claude often enables self-correction. Understand retry strategies and when to escalate versus retry.
Fan-Out/Fan-In
D1A parallelization pattern where a task is split into independent subtasks (fan-out), each processed concurrently, and results are aggregated (fan-in). Reduces latency for tasks with independent components.
Exam Context
Know when parallelization is safe (independent subtasks with no shared state) versus when sequential processing is required (dependent steps).
Few-Shot Example
D4Concrete input-output pairs included in the prompt to demonstrate the desired behavior. Examples teach Claude the expected format, style, and reasoning pattern more effectively than abstract instructions alone.
Exam Context
Include 2-5 diverse examples that cover edge cases. Place examples after instructions but before the actual input. Use consistent formatting between examples and the real task.
Guardrail
D1A safety mechanism that constrains Claude's behavior within defined boundaries. Guardrails can operate on input (pre-processing validation), output (post-processing filtering), or tool use (restricting which tools can be called and with what parameters).
Exam Context
Understand the difference between input guardrails, output guardrails, and tool-use guardrails. Know that guardrails in the Claude Agent SDK run as parallel checks that can halt execution.
Hook
D3A shell command that executes at specific lifecycle points in Claude Code. Hooks run before tool calls (PreToolUse), after tool calls (PostToolUse), and on notifications. They enable logging, validation, and custom automation.
Exam Context
Know the three hook types and their timing. PreToolUse hooks can block tool execution by returning a non-zero exit code. PostToolUse hooks can inspect results.
Human-in-the-Loop
D1A pattern where the agentic system pauses execution to request human approval before proceeding with sensitive operations. The human can approve, reject, or modify the proposed action.
Exam Context
Understand when human-in-the-loop is required (irreversible actions, high-stakes decisions) versus when it adds unnecessary friction. Know how to implement approval gates in the agentic loop.
inputSchema
D2The JSON Schema object within a tool definition that specifies the structure, types, and constraints of the tool's input parameters. Follows standard JSON Schema with properties, required fields, and descriptions.
Exam Context
Always mark required parameters explicitly. Use description fields on every property. Constrain with enum, minimum, maximum, and pattern where appropriate.
Long Conversation
D5A conversation that approaches or exceeds the context window limit. Requires management strategies such as summarization, sliding window truncation, or context compression to continue meaningfully without losing critical information.
Exam Context
Know the three main strategies: summarization (compress old messages), truncation (drop oldest messages), and selective retention (keep important messages, compress others). Each has tradeoffs.
MCP Client
D2A component that connects to one or more MCP servers, discovers their capabilities, and makes them available to the LLM. The client manages the lifecycle of server connections and translates between the MCP protocol and the LLM's tool format.
Exam Context
Understand that a single client can connect to multiple servers simultaneously. The client aggregates tools from all connected servers into a unified tool list for Claude.
MCP Server
D2A process that exposes tools, resources, and prompt templates over the Model Context Protocol. MCP servers can be implemented in any language and communicate via stdio or HTTP (SSE) transport.
Exam Context
Know the three capability types: tools (executable actions), resources (data sources), and prompts (templates). Understand that servers declare capabilities during initialization.
MCP Transport
D2The communication layer between MCP clients and servers. Two standard transports: stdio (local processes communicating via stdin/stdout) and HTTP with Server-Sent Events (remote servers). Stdio is simpler; HTTP enables remote deployment.
Exam Context
Stdio is the default for local tools. HTTP+SSE is required for remote or shared servers. Know the tradeoffs: stdio has no network overhead but requires local execution.
Monitoring
D5Observing and measuring the behavior of Claude-powered systems in production. Key metrics include latency, token usage, error rates, tool call patterns, and output quality scores. Essential for maintaining reliability and controlling costs.
Exam Context
Track: latency (time-to-first-token, total), cost (input/output tokens), errors (rate, type), and quality (user feedback, automated evals). Set alerts on anomalies.
Multi-Agent System
D1An architecture where multiple specialized agents collaborate on a task. Each agent has its own system prompt, tools, and guardrails. Agents communicate through handoffs or a central orchestrator.
Exam Context
Know when to use multi-agent versus single-agent. Multi-agent adds complexity; use it when tasks require fundamentally different tool sets or expertise domains.
Orchestration Pattern
D1An architecture for coordinating multiple Claude API calls or agents. Common patterns include prompt chaining (sequential), routing (conditional dispatch), parallelization (fan-out/fan-in), and evaluator-optimizer loops.
Exam Context
Be able to identify which pattern fits a given scenario. Routing is for classification-based dispatch; chaining is for sequential transformation; parallelization is for independent subtasks.
Output Validation
D4Verifying that Claude's response meets expected criteria before using it downstream. Includes JSON schema validation, content safety checks, format verification, and semantic correctness checks using a second LLM call.
Exam Context
Always validate structured output. Use tool_use for schema enforcement. For critical applications, add a separate validation step or LLM-as-judge evaluation.
Permission
D3A configuration that controls which tools Claude Code can use and which require approval. Permissions are set in settings files and follow a hierarchy from user-level to project-level configuration.
Exam Context
Understand the permission model: allow, deny, or ask. Know that allowedTools in settings controls the whitelist. Project-level settings cannot override user-level denials.
PostToolUse
D3A hook that fires after a tool completes execution in Claude Code. Receives the tool name, input, and output. Cannot prevent execution (already happened) but can log results, trigger notifications, or perform cleanup.
Exam Context
PostToolUse hooks are for observation, not prevention. Use for audit logging, notifications, or triggering dependent workflows after tool completion.
Prefilling
D4Starting the assistant's response with specific text to guide Claude's output format or direction. Done by including a partial assistant message in the API call. Forces Claude to continue from the provided prefix.
Exam Context
Useful for forcing JSON output (prefill with opening brace), maintaining format consistency, or steering response direction. The prefilled content appears in the final response.
PreToolUse
D3A hook that fires before a tool is executed in Claude Code. Receives the tool name and input as environment variables. Can block execution by returning exit code 2, or modify the operation. Used for validation and logging.
Exam Context
Exit code 0 allows execution, exit code 2 blocks it with feedback to Claude. The hook receives TOOL_NAME and TOOL_INPUT environment variables.
Production Reliability
D5Patterns for building robust Claude-powered systems: retry logic with exponential backoff, circuit breakers, fallback models, graceful degradation, health checks, and deployment strategies like canary releases.
Exam Context
Know the retry pattern (exponential backoff + jitter), circuit breaker pattern (fail fast after threshold), and fallback pattern (switch to simpler model or cached response).
Prompt Caching
D5An API feature that caches repeated prompt prefixes to reduce cost and latency on subsequent calls. Marked with cache_control breakpoints. Cached content is reused when the prefix matches exactly, reducing input token costs by up to 90%.
Exam Context
Know the cache_control breakpoint placement strategy. Content before the breakpoint is cached. Any change to cached content invalidates the cache. Place breakpoints after stable content (system prompt, tool definitions).
Prompt Chain
D4A sequence of Claude API calls where the output of one call feeds into the input of the next. Each step performs a focused transformation, validation, or decision. Chains decompose complex tasks into manageable steps.
Exam Context
Know when to chain versus when to use a single call. Chain when steps have different requirements (model, temperature, tools). Each step should have a clear, verifiable output.
Prompt Optimization
D4The iterative process of refining prompts to improve output quality, reduce cost, or increase reliability. Techniques include prompt compression, example selection, instruction clarification, and A/B testing against evaluation datasets.
Exam Context
Optimization is empirical: measure before and after with a consistent eval set. Common wins: clearer instructions, better examples, removing ambiguity, and adding output format constraints.
Prompt Template
D2An MCP capability that provides reusable prompt structures. Servers expose prompt templates with defined arguments that clients can fill in. Useful for standardizing interactions across different clients.
Exam Context
Prompt templates are the third MCP capability alongside tools and resources. They enable server-defined interaction patterns that clients can discover and use.
Rate Limiting
D5API-enforced limits on request frequency and token throughput. Claude's API has rate limits per minute (RPM) and tokens per minute (TPM). Exceeding limits returns 429 status codes. Managed with queuing, backoff, and request batching.
Exam Context
Know the HTTP 429 response and retry-after header. Implement exponential backoff with jitter. Use request queuing for high-throughput applications. Different tiers have different rate limits.
Resource
D2An MCP capability that exposes data to the LLM. Unlike tools (which perform actions), resources provide read-only access to content like files, database records, or API responses. Resources have a URI and return text or binary content.
Exam Context
Distinguish resources from tools: resources are for reading data, tools are for performing actions. Resources are identified by URI patterns.
Retry Strategy
D5A pattern for handling transient API failures by retrying requests with increasing delays. Exponential backoff (doubling wait time) with random jitter prevents thundering herd problems. Only retry on retryable errors (429, 500, 529).
Exam Context
Know which status codes are retryable: 429 (rate limit), 500 (server error), 529 (overloaded). Do not retry 400 (bad request) or 401 (auth). Always add jitter to prevent synchronized retries.
Routing Pattern
D1An orchestration pattern where an initial classification step determines which specialized agent or prompt chain handles the request. Useful when different input types require fundamentally different processing.
Exam Context
Routing is the right choice when you have distinct categories of input. Use a fast, cheap classification call to route before invoking specialized (potentially expensive) processing.
Settings Hierarchy
D3The order in which Claude Code resolves configuration: user-level (~/.claude/settings.json), project-level (.claude/settings.json), and enterprise-managed settings. More specific settings override general ones, with security exceptions.
Exam Context
User-level security settings (denied tools) cannot be overridden by project-level config. This prevents malicious repos from escalating permissions.
Slash Command
D3A custom command invoked with a / prefix in Claude Code. Built-in commands include /help, /clear, and /compact. Custom slash commands can be defined via .claude/commands/ directory with markdown files that serve as prompt templates.
Exam Context
Know how to create custom slash commands: place a .md file in .claude/commands/ with the prompt template. The filename becomes the command name. Arguments use $ARGUMENTS placeholder.
stop_reason
D1A field in the Claude API response indicating why generation stopped. Values: end_turn (Claude finished voluntarily), tool_use (Claude wants to call a tool), max_tokens (output limit reached).
Exam Context
Critical for agentic loop implementation. tool_use means continue the loop with tool results. end_turn means the task is complete. max_tokens may require continuation logic.
Structured Output
D4Techniques for getting Claude to respond in a specific format like JSON, XML, or a defined schema. Achieved through tool use (forces JSON matching the schema), system prompt instructions, or prefilling the assistant response.
Exam Context
Tool use is the most reliable method for structured output. Prefilling with an opening brace or tag is a useful technique. Always validate output against the expected schema.
Summarization
D5A context management technique where older conversation messages are compressed into a concise summary. The summary replaces the original messages, freeing context window space while preserving key information for continued conversation.
Exam Context
Summarization trades fidelity for space. Summarize when conversation exceeds a token threshold. Keep the summary in a dedicated system or user message. Re-summarize periodically for very long conversations.
System Prompt
D4The initial instruction block sent to Claude that establishes behavior, persona, constraints, and context for the entire conversation. System prompts are processed first and influence all subsequent responses.
Exam Context
Know that system prompts set the behavioral foundation. Keep them focused: role, constraints, output format. Avoid putting dynamic content in system prompts when it should be in user messages.
Temperature
D4A parameter (0.0 to 1.0) controlling randomness in Claude's responses. Lower values produce more deterministic, focused output. Higher values increase creativity and variation. Default is 1.0.
Exam Context
Use temperature 0 for structured output, classification, and extraction. Use higher temperature for creative tasks. Temperature does not affect the quality of reasoning, only the diversity of outputs.
Token Counting
D5Measuring the number of tokens in prompts and responses to manage costs and stay within context window limits. Anthropic provides a token counting API endpoint. Accurate counting is essential for budget management and context window planning.
Exam Context
Use the token counting API for accurate estimates before sending expensive requests. Tokens are not characters or words: a token is roughly 3-4 characters for English text.
Tool Approval
D2A mechanism for requiring user confirmation before a tool executes. Configured through permission settings that classify tools as allowed, denied, or requiring approval based on the tool name or server.
Exam Context
Know how to configure tool permissions in Claude Code. Understand the difference between allowedTools (whitelist) and tools that require explicit user approval.
Tool Error Handling
D2The practice of returning structured error information from tools so Claude can understand what went wrong and attempt recovery. Tool results should include is_error: true and a descriptive message rather than throwing exceptions.
Exam Context
Never throw unhandled exceptions from tools. Return errors as structured content so Claude can reason about them. Include actionable information in error messages.
Tool Schema
D2A JSON Schema definition that describes a tool's name, purpose, and input parameters. Claude uses the schema to understand what the tool does and how to construct valid inputs. Well-designed schemas include clear descriptions and constrained types.
Exam Context
Schema quality directly affects tool selection accuracy. Include descriptions on every parameter. Use enums to constrain values. Avoid overly complex nested schemas.
Tool Selection
D2Claude's process of choosing which tool to invoke based on the user's request and available tool schemas. Affected by tool names, descriptions, parameter schemas, and system prompt guidance.
Exam Context
Tool names and descriptions are the primary signals for selection. Avoid ambiguous or overlapping tool names. Use the system prompt to provide tool selection guidance when needed.
tool_use
D1A content block type in the Claude API response indicating Claude wants to invoke a tool. Contains the tool name, a unique ID, and the input parameters matching the tool's JSON schema.
Exam Context
Know the structure: each tool_use block has id, name, and input fields. The tool result must reference the same id when returning results to Claude.
Worktree
D3A Git feature that allows multiple working trees attached to the same repository. Claude Code supports worktrees for parallel development: run multiple Claude Code sessions on different branches of the same repo simultaneously.
Exam Context
Know that worktrees enable parallel Claude Code sessions without conflicting file changes. Each worktree has its own working directory but shares the Git history.
XML Tags
D4A prompting technique using XML-style tags to structure input and output. Claude responds well to tags like <instructions>, <context>, <example>, and <output> for clearly delineating sections of complex prompts.
Exam Context
XML tags improve Claude's ability to follow complex multi-part prompts. Use them to separate instructions from data, mark example boundaries, and define output sections.