Implement Zero Trust for AI Agents - Anthropic Framework Crosswalk - SWT3 Protocol

Audience: Enterprise architects implementing Anthropic's Zero Trust framework, CISOs evaluating independent compliance layers for Claude agent deployments, security engineers building agentic systems in regulated industries (healthcare, finance, defense), and auditors assessing AI governance controls.

1. What Anthropic's Framework Requires

On May 27, 2026, Anthropic published Zero Trust for AI Agents, a framework for deploying autonomous AI agents in enterprise environments. The paper argues that AI-accelerated offense has compressed attack timelines from months to hours, requiring a fundamental shift from traditional perimeter security to agent-level controls.

The framework defines a three-tier architecture:

Foundation Tier: Basic identity and access controls. Cryptographic identity rooting. Entry-level Zero Trust.
Advanced Tier: Task-scoped permissions. Enhanced memory protections. Strengthened input/output controls.
Optimized Tier: AI-speed defensive operations. Comprehensive breach assumption. Full agentic SOAR integration.

The paper outlines an eight-phase implementation workflow covering identity, access scoping, sandboxing, input and output controls, and memory safeguards. It addresses five core threats: prompt injection, tool poisoning, identity and privilege abuse, memory poisoning, and supply chain attacks.

2. The Independent Witness Gap

Anthropic's framework mandates cryptographic identity, full traceability, and breach-assumption auditing. However, Anthropic is the model provider. If Claude experiences a reasoning failure, prompt injection, or tool misuse, the audit trail cannot credibly come from Anthropic's own infrastructure.

The structural problem: A foundation model provider cannot simultaneously be the AI brain and the independent compliance witness. Regulatory bodies under the EU AI Act, NIST AI RMF, and CMMC frameworks require evidence from a system that is architecturally independent of the model runtime. This is not a criticism of Anthropic's framework. It is an inherent constraint of any system where the entity being audited provides the audit evidence.

SWT3 fills this gap as an independent, out-of-band witness protocol. It does not inspect, filter, or modify inference. It records what happened, whether it matched declared policy, and produces tamper-evident evidence that survives model compromise.

3. Requirement-to-Procedure Mapping

Anthropic Requirement	SWT3 Procedure	What SWT3 Witnesses	Coverage
Cryptographic agent identity	`AI-ID.1`	Agent ID binding, HMAC-SHA256 signing per anchor	Full
Task-scoped permissions	`AI-ACC.1`, `AI-TOOL.1`	Access control decisions, tool authorization records	Full
Prompt-to-action traceability	`AI-INF.1`, `AI-CHAIN.1`	Inference fingerprints, chain-of-custody across agents	Full
Tool containment	`AI-TOOL.1`	Tool name, input/output hashes, latency, success/failure	Full
Prompt injection defense	`AI-GRD.1`, `AI-GRD.2`	Guardrail presence, threat scores, output classification	Partial
Supply chain security	`AI-SUPPLY.1`, `AI-SBOM.1`	Dependency attestation, software bill of materials	Full
Memory poisoning defense	`AI-SKILL.2`	Memory context state, persistence boundaries	Partial
Multi-agent coordination	`AI-MULTI.1`, `AI-CHAIN.2`	Agent handoff records, chain trust verification	Full
Agentic SOAR	`AI-INCIDENT.1`	Incident classification, response attestation	Partial
Breach assumption / audit	`AI-AUDIT.1`	Audit trail integrity, tamper-evident logging	Full
Identity/privilege abuse	`AI-ACC.1`, `AI-SEC.1`	Access decisions, security scan results	Full

Coverage key: Full = SWT3 directly witnesses the control through procedures and produces auditable evidence. Partial = SWT3 provides supporting attestation; the primary defense mechanism requires additional tooling.

4. Cryptographic Agent Identity

AI-ID.1

Agent Identity Binding

Anthropic requires: Identities that are cryptographically rooted, with per-agent identity isolation.

How SWT3 addresses it: Every witness anchor includes an agent_id field that binds the anchor to a specific agent instance. The HMAC-SHA256 payload_signature cryptographically proves which SDK instance minted the anchor. The signing_key is unique per agent and never appears in any anchor payload. Combined with the deterministic fingerprint formula, this creates a non-repudiable identity chain for every inference.

What to show the examiner: Query the witness ledger for a specific agent_id and verify the payload_signature using the agent's signing key. Each anchor is independently verifiable without trusting the model provider's infrastructure.

5. Task-Scoped Permissions

AI-ACC.1 + AI-TOOL.1

Access Control and Tool Authorization Witnessing

Anthropic requires: Task-scoped access instead of standing permissions. Agents should only have the permissions they need for the current task.

How SWT3 addresses it: wrapAccess() records every access control decision with the resource accessed, the authorization result, and the authorization method. wrapTool() records every tool invocation with tool name, input hash, output hash, and whether the call was permitted by the declared .swt3.yaml policy. The authorization_id field links each inference to a pre-approved authorization token, creating evidence that the agent operated within its approved scope.

What to show the examiner: Present the .swt3.yaml policy file alongside the witness anchors. The policy_version hash in each anchor proves which permission set was active at the time of execution.

6. Prompt-to-Action Traceability

AI-INF.1 + AI-CHAIN.1

Inference Witnessing and Chain of Custody

Anthropic requires: Full traceability from prompt to action to outcome. Every step in an agent's reasoning chain must be auditable.

How SWT3 addresses it: witness() records every inference with a deterministic fingerprint: SHA256("WITNESS:{tenant}:{proc}:{fa}:{fb}:{fc}:{ts_ms}").hex()[:12]. Factor A captures the model, Factor B captures the prompt hash, Factor C captures the response hash. For multi-step chains, cycle_id links related inferences into a single reasoning trace. The chain witnessing procedures (AI-CHAIN.1, AI-CHAIN.2) record handoffs between agents with trust level verification at each boundary.

What to show the examiner: Query all anchors with a specific cycle_id to reconstruct the full reasoning chain. Each step's fingerprint is independently verifiable via the /verify endpoint.

7. Tool Containment

AI-TOOL.1

Tool Call Witnessing

Anthropic requires: Sandboxed tool execution with explicit allow-listing.

How SWT3 addresses it: wrapTool() intercepts every tool call and records the tool name, input hash, output hash, execution latency, and success/failure status. The .swt3.yaml policy file declares mcp_policy.tool_allowlist and mcp_policy.tool_blocklist with glob pattern matching. Tools not on the allowlist produce witness anchors flagged as policy violations. Velocity limits (max_velocity) cap the rate of tool execution to prevent automated exploitation loops.

What to show the examiner: Export tool call anchors filtered by procedure_id = AI-TOOL.1. Each record shows exactly which tool was called, whether it was on the allowlist, and what the outcome was.

8. Prompt Injection Defense

AI-GRD.1 + AI-GRD.2

Guardrail Presence and Output Classification

Anthropic requires: Defense against prompt injection and context spoofing attacks.

How SWT3 addresses it: SWT3 does not perform prompt filtering. That is the role of guardrail tools (Prompt Guard, NeMo Guardrails, Claude's native safety layer). What SWT3 does is witness whether guardrails were active and what they detected. AI-GRD.1 records whether required guardrails were present at inference time. AI-GRD.2 records output classification results. AI-SEC.1 records security scan scores and threat types. In gatekeeper mode (strict: true), the SDK will not proceed with inference if required guardrails are absent.

What to show the examiner: Present AI-GRD.1 anchors showing guardrail presence across all inferences. Any inference without guardrails will have a corresponding witness record documenting the gap.

9. Supply Chain Security

AI-SUPPLY.1 + AI-SBOM.1

Supply Chain Attestation

Anthropic requires: Resilience against supply chain attacks on agent components and dependencies.

How SWT3 addresses it: AI-SUPPLY.1 witnesses the supply chain attestation for the agent's deployment. AI-SBOM.1 records a software bill of materials with dependency hashes, versions, and license information. AI-SKILL.1 witnesses the loaded skill manifest at initialization, creating a cryptographic snapshot of what was loaded before any skill executes. AI-MDL.5 provides file-level model weight hashing to detect unauthorized model changes.

What to show the examiner: Compare AI-SBOM.1 anchors across deployments. Any dependency change produces a different fingerprint, making unauthorized modifications detectable without inspecting the model provider's infrastructure.

10. Memory Poisoning Defense

AI-SKILL.2

Memory Context Witnessing

Anthropic requires: Defense against memory poisoning -- adversarial inputs that persist in agent memory and alter future behavior.

How SWT3 addresses it: AI-SKILL.2 witnesses the memory context state, recording what the agent's persistent memory contained at the time of inference. This creates a forensic trail showing whether memory was altered between sessions. The clearing engine (Levels 1-3) strips sensitive content from memory attestation records before transmission, preventing the witness layer itself from becoming a memory exfiltration vector.

What to show the examiner: Compare AI-SKILL.2 anchors across sessions for the same agent. Unexpected changes in memory context hashes indicate potential memory poisoning.

11. Multi-Agent Coordination

AI-MULTI.1 + AI-CHAIN.2

Multi-Agent Handoff and Chain Trust

Anthropic requires: Secure coordination between multiple agents with maintained security boundaries.

How SWT3 addresses it: AI-MULTI.1 records when an agent delegates work to another agent, capturing the source agent, destination agent, task description, and trust level at the handoff boundary. AI-CHAIN.2 verifies chain trust by recording the trust level, verification method, and peer credential at each inter-agent boundary. The Trust Mesh protocol (trust_mesh.mode: strict) requires bilateral credential exchange before any data crosses agent boundaries. The chainMinTrustLevel setting enforces a minimum trust floor for all chain handoffs.

What to show the examiner: Query AI-CHAIN.2 anchors to reconstruct the full agent collaboration graph. Each edge in the graph includes the trust level and credential verification status.

12. Agentic SOAR and Incident Response

AI-INCIDENT.1

Incident Classification and Response Attestation

Anthropic requires: Defensive operations that match AI-accelerated attacker speed. Agentic SOAR capabilities.

How SWT3 addresses it: AI-INCIDENT.1 records incident classifications, severity levels, and response actions. Combined with the OTel exporter, witness anchors can feed directly into existing SIEM/SOAR platforms (Splunk, Datadog, Elastic) for automated response orchestration. The Regulatory Webhook API pushes compliance events to external GRC tools in real time. SWT3 provides the evidence layer for SOAR, not the orchestration engine itself.

What to show the examiner: Present the OTel export configuration and show incident anchors flowing into the organization's SIEM. The witness evidence stream provides the audit trail that SOAR playbooks act upon.

13. Breach Assumption and Audit

AI-AUDIT.1

Audit Trail Integrity

Anthropic requires: Comprehensive breach assumption posture. All actions auditable even after a compromise.

How SWT3 addresses it: The witness ledger is append-only and stored outside the model runtime. Even if the AI agent is fully compromised, the witness records have already been transmitted to an independent infrastructure. The Merkle accumulator computes daily rollup roots, allowing any historical anchor to be proven consistent with all others. The Write-Ahead Log (WAL) provides local forensic recovery with sequence numbers and checkpoint verification. The deterministic fingerprint formula means any anchor can be independently recomputed and verified without trusting the original minting system.

What to show the examiner: Use the /verify endpoint to verify any anchor's fingerprint. Request the Merkle proof via /api/v1/merkle/proof?fingerprint=xxx to demonstrate tamper-evident consistency with the daily rollup.

14. Quick Start

Add SWT3 witnessing to an Anthropic Claude agent deployment:

# Python
pip install swt3-ai

from swt3_ai import SWT3Witness
witness = SWT3Witness(
    tenant_id="YOUR_TENANT",
    agent_id="claude-agent-prod",
    signing_key_env="SWT3_SIGNING_KEY"
)

# Witness every inference
result = witness.witness(
    model="claude-sonnet-4-6",
    prompt_hash=sha256(prompt),
    response_hash=sha256(response),
    procedure="AI-INF.1"
)

# Witness tool calls
result = witness.wrap_tool("search_database", input_data, output_data)

# TypeScript
npm install @tenova/swt3-ai

import { SWT3Witness } from '@tenova/swt3-ai';
const witness = new SWT3Witness({
  tenantId: 'YOUR_TENANT',
  agentId: 'claude-agent-prod',
  signingKeyEnv: 'SWT3_SIGNING_KEY'
});

// Witness every inference
const result = await witness.witness({
  model: 'claude-sonnet-4-6',
  promptHash: sha256(prompt),
  responseHash: sha256(response),
  procedure: 'AI-INF.1'
});

// Witness tool calls
const result = await witness.wrapTool('search_database', inputData, outputData);

Positioning note: SWT3 is an independent witness protocol. It does not modify, filter, or intercept Claude's inference. It records what happened and whether it matched declared policy. Anthropic provides the AI agent. SWT3 provides the independent evidence that the agent operated within its authorized boundaries. Both are necessary. Neither can replace the other.

15. Coverage Summary

Anthropic Requirement	SWT3 Procedures	Coverage
Cryptographic agent identity	`AI-ID.1`	Full
Task-scoped permissions	`AI-ACC.1`, `AI-TOOL.1`	Full
Prompt-to-action traceability	`AI-INF.1`, `AI-CHAIN.1`	Full
Tool containment	`AI-TOOL.1`	Full
Prompt injection defense	`AI-GRD.1`, `AI-GRD.2`	Partial
Supply chain security	`AI-SUPPLY.1`, `AI-SBOM.1`	Full
Memory poisoning defense	`AI-SKILL.2`	Partial
Multi-agent coordination	`AI-MULTI.1`, `AI-CHAIN.2`	Full
Agentic SOAR	`AI-INCIDENT.1`	Partial
Breach assumption / audit	`AI-AUDIT.1`	Full
Identity/privilege abuse	`AI-ACC.1`, `AI-SEC.1`	Full

8 of 11 requirements: Full coverage through direct witnessing and evidence production.

3 of 11 requirements: Partial coverage. SWT3 witnesses the control state but the primary defense mechanism (prompt filtering, memory isolation, SOAR orchestration) requires additional tooling.