SWT3 Design Rationale

Why Every Protocol Decision Was Made

Version:

1.0.0

Date:

2026-04-24

Companion to:

SWT3 Protocol Specification v1.3

License:

Apache 2.0

Purpose. The SWT3 Protocol Specification defines what the protocol does. This document explains why each design decision was made, what alternatives were considered, and why they were rejected. It is intended for patent examiners evaluating non-obviousness, standards reviewers assessing design rigor, technical diligence teams evaluating the protocol's depth, and implementors who want to understand the constraints that shaped the design space.

Cryptographic Choices
Formula Design
Factor Architecture
Clearing Protocol
Anchor Format
UCT Taxonomy
Alternatives Considered and Rejected
Infrastructure-Layer Witnessing
Evidence Custody and Policy Witnessing

1. Cryptographic Choices

Why SHA-256?

Decision: SHA-256 is the sole hash primitive used in SWT3 for fingerprinting, prompt/response hashing, model hashing, and signing (via HMAC-SHA256).

FIPS 140-2 validated. Required for federal and defense adoption. Any protocol targeting GovCon, FedRAMP, or CMMC must use FIPS-approved algorithms. SHA-256 is approved under FIPS 180-4.
Universal availability. SHA-256 is in the standard library of every major language: Python hashlib, Node.js crypto, Rust sha2, C# System.Security.Cryptography, Ruby OpenSSL, Go crypto/sha256, Java MessageDigest. No external dependency required in any language.
Hardware acceleration. Modern x86 CPUs include the SHA-NI instruction set, making SHA-256 computation effectively free in terms of latency overhead.

Rejected: SHA-3 (Keccak). Less ubiquitous in standard libraries. Some embedded platforms and older runtimes do not include SHA-3. No meaningful security advantage over SHA-256 for this use case (hashing structured compliance data, not resisting quantum attacks).

Rejected: BLAKE3. Faster than SHA-256 on software-only benchmarks, but not FIPS validated. Not available in most language standard libraries (requires external crate/package). Speed is not the bottleneck for compliance anchoring; the inference call dominates latency by orders of magnitude.

Rejected: MD5 and SHA-1. Both have demonstrated collision attacks. Disqualified by NIST SP 800-131A for new applications. Using either would undermine the integrity guarantee that is the core value of SWT3.

Why 12 hex characters (48 bits) for fingerprint truncation?

Decision: The anchor fingerprint is the first 12 characters of the lowercase hexadecimal SHA-256 digest. The full 64-character digest SHOULD be stored alongside the anchor.

Collision resistance. 48 bits gives a birthday paradox threshold of approximately 2²⁴ (~16.7 million) anchors before a 50% collision probability. A typical compliance ledger produces tens of thousands of anchors per tenant per year. Even at 100,000 anchors/year for 100 years, the collision probability remains negligible.
Human readability. 12 characters fit in a terminal column, a spreadsheet cell, and a printed assessment report without wrapping. Assessors compare fingerprints visually during site visits.
Anchor token length. The full anchor (SWT3-E-AWS-AI-AIINF1-PASS-1774800000-c059eb5938c0) is already 50+ characters. A 64-character fingerprint would push it past 100 characters, breaking copy-paste usability.

Rejected: 8 characters (32 bits). Birthday threshold drops to ~65,000. A moderately active enterprise ledger could hit collisions within a few years. Insufficient.

Rejected: 16 characters (64 bits). Birthday threshold rises to ~4 billion. No practical security gain over 12 characters for compliance ledgers. Wastes 4 characters in every anchor token for no measurable benefit.

Why HMAC-SHA256 for payload signing?

Decision: Payload signing uses HMAC-SHA256 with a shared symmetric key. The signature message is {fingerprint}:{agent_id} if agent identity is present, or just {fingerprint} otherwise.

Symmetric key simplicity. No PKI infrastructure, no certificate rotation, no revocation checking. The signing key is shared between the agent and the witness endpoint. This matches the trust model: the agent and the platform already share an API key.
Same primitive. SHA-256 is already required for fingerprinting. HMAC-SHA256 adds no new cryptographic dependency.
FIPS 198-1 validated. HMAC is a FIPS-approved construction.

Rejected: Ed25519 (asymmetric). Requires public/private key pair management. Overkill for the agent-to-platform trust model where both parties already share credentials. Implementations that require asymmetric guarantees can layer Ed25519 on top of the base signing.

2. Formula Design

Why the WITNESS: domain separator prefix?

Decision: The fingerprint input string begins with the literal WITNESS: prefix, followed by colon-separated fields.

Cross-protocol collision prevention. Without a domain separator, a SHA-256 hash of unrelated data could accidentally match a valid SWT3 fingerprint. The WITNESS: prefix makes this effectively impossible.
Unambiguous legacy detection. Pre-v1.1.0 anchors were minted without the prefix. Verifiers try the canonical formula first; if verification fails, they fall back to the legacy formula. The prefix is the disambiguation signal.
Standard practice. Domain separation is a well-established pattern in cryptographic protocols. TLS, HKDF, and Merkle trees all use domain separators to prevent cross-context hash collisions.

Why colon-separated plaintext?

Decision: The fingerprint input is a colon-separated string: WITNESS:{tenant}:{proc}:{fa}:{fb}:{fc}:{ts_ms}

Deterministic. The bytes of a colon-separated UTF-8 string are identical in every language and on every platform. No ambiguity.
No serialization dependency. Constructing the input requires only string concatenation. No JSON library, no protobuf compiler, no binary encoder.
Debuggable. A developer can construct the input by hand and verify with echo -n "WITNESS:..." | sha256sum | cut -c1-12. This is the single most important property for developer trust.

Rejected: JSON. JSON objects have no guaranteed key ordering in most languages. {"a":1,"b":2} and {"b":2,"a":1} are semantically identical JSON but produce different SHA-256 hashes. This would break cross-language fingerprint parity. Canonical JSON (RFC 8785) exists but adds complexity and is not widely implemented.

Rejected: Binary / Protocol Buffers. Requires schema definition, compilation step, and deserialization tooling. Opaque to human inspection. Cannot be debugged with shell one-liners. Adds a dependency that provides no benefit for a 7-field input string.

Why millisecond precision?

Collision prevention. Two observations in the same second with identical factors would produce identical fingerprints if the timestamp used seconds. Millisecond precision makes same-second collisions extremely unlikely.
Universal availability. Millisecond timestamps are reliably available on every platform including browsers, embedded systems, and air-gapped environments. Microseconds and nanoseconds are not available on all platforms (notably, some JavaScript runtimes truncate to milliseconds for privacy reasons).
Dual representation. The anchor token uses epoch seconds (human-readable: 10 digits). The fingerprint uses milliseconds (cryptographic precision: 13 digits). This separation serves both audiences without compromise.

Why integer-valued floats stringify as integers?

Cross-language parity rule: If a factor value equals its integer representation (e.g., 1.0, 5000.0), it MUST be formatted without a decimal point ("1", "5000"). True fractional values retain their decimal representation ("1.5").

JSON number identity. In JSON, 1 and 1.0 are the same number. But str(1.0) in Python produces "1.0", while String(1.0) in JavaScript produces "1". Without the canonical rule, the same factors would produce different fingerprints in Python vs. JavaScript.
The rule eliminates divergence. By requiring integer-valued floats to format as integers, all SDK languages (Python, TypeScript, Rust, C#, Ruby) produce identical fingerprint input strings. This is verified by cross-language test vectors.

3. Factor Architecture

Why exactly three factors?

Decision: Every SWT3 witness observation produces exactly three numeric factors: factor_a (baseline/threshold), factor_b (observation), factor_c (context/delta).

Minimum viable evidence triple. Three factors cover every compliance check pattern observed in practice:
- Threshold comparison: "expected X, observed Y, delta Z"
- Presence check: "required 1, found 1, delta 0"
- Count verification: "minimum 3 guardrails, 4 active, all passed"
Fixed-width fingerprint input. The number of factors is part of the fingerprint formula. Adding or removing factors changes the formula, breaking all existing anchors. Three is the design point; it cannot change.
Storage efficiency. Three numeric fields fit in a single database row, a single JSON object, and a single log line. No nested structures required.

Rejected: Two factors. Loses the context dimension. "Latency was 8,000ms against a 5,000ms threshold" requires three values: threshold (5000), observation (8000), and whether this constitutes a violation (1). With two factors, the violation flag must be inferred, which introduces ambiguity.

Rejected: Five or more factors. Every additional factor that enters the fingerprint formula increases the cross-language parity surface (more values to format consistently). Any information beyond three factors can be encoded in the existing three through scaling or composition. The marginal value of a fourth factor does not justify the parity risk and backward compatibility cost.

Why integer representation?

Deterministic stringification. The integer 42 has exactly one string representation in every programming language: "42". Floating-point numbers do not have this property (IEEE 754 edge cases, locale-dependent formatting).
Natural fit. Real-world compliance factors are naturally integer: open port counts, milliseconds of latency, token counts, guardrail counts, boolean flags (0/1).
Fractional values via scaling. A 95.0% accuracy score is represented as 950 (per mille). This avoids floating-point entirely while preserving precision.

4. Clearing Protocol

Why exactly four levels?

Decision: SWT3 defines four discrete clearing levels: 0 (Retain), 1 (Factor-Only), 2 (Anchor-Only), 3 (Sovereign).

Maps to existing classification tiers. Regulated industries already operate with tiered data handling:
- Level 0 (Retain): Unclassified / internal analytics. Full forensic capability.
- Level 1 (Factor-Only): Standard production. Raw evidence purged, factors retained. RECOMMENDED default.
- Level 2 (Anchor-Only): Sensitive / PII / healthcare. Factors purged after fingerprinting. Satisfies HIPAA minimum necessary.
- Level 3 (Sovereign): Classified / defense / air-gapped. Nothing persists on the minting system. Satisfies ITAR and national sovereignty mandates.
Discrete, not continuous. Clearing is an auditable commitment, not a configuration slider. An auditor can verify: "this system operates at Level 2." A continuous scale would require defining what 63% clearing means.

Rejected: Three levels. Loses the distinction between "factors visible but raw data purged" (Level 1) and "factors also purged, only anchor remains" (Level 2). This distinction matters: in healthcare, retained factors may reveal PHI (factor_b = patient count). Level 2 addresses this. Three levels would force healthcare into Level 1 (too much data) or Level 3 (too aggressive).

Why irreversibility as a hard requirement?

The value proposition depends on it. SWT3's clearing guarantee is: "the data is gone, and the proof that it was correct survives." If clearing is reversible, the guarantee is hollow.
Regulatory alignment. GDPR Article 17 (right to erasure) requires actual deletion, not soft deletion or archiving. HIPAA minimum necessary requires that data not be retained beyond its operational purpose. Both mandate irreversibility.
Audit clarity. An assessor must be able to confirm: "at Clearing Level 2, the factors no longer exist on this system." If "soft delete" were allowed, the assessor would need to verify the deletion mechanism, the backup policy, the retention schedule, and the recovery procedure. Irreversibility eliminates this entire class of audit questions.

Why clearing is defined at the protocol level?

Portability. The clearing level is recorded in the anchor metadata. Any SWT3-conformant system can read and understand what clearing level was applied, regardless of which platform minted the anchor.
Vendor neutrality. If clearing were implementation-defined, each vendor could redefine "Level 1" to mean something different. Protocol-level definition prevents this. "Level 1" means the same thing everywhere.
Interoperability. An anchor cleared at Level 2 on Platform A is understood by Platform B without coordination, negotiation, or mapping tables.

5. Anchor Format

Why a self-describing string token?

Decision: An SWT3 anchor is a hyphen-separated string: SWT3-{TIER}-{PROVIDER}-{UCT}-{PROCEDURE}-{VERDICT}-{EPOCH}-{FINGERPRINT}

Human-readable. An auditor can read an anchor during a site visit without any tooling. The tier, provider, domain, procedure, verdict, time, and fingerprint are all visible in the token.
Greppable. grep SWT3- finds every anchor in a log file, a database dump, or a compliance report. No parsing required for discovery.
Copy-pasteable. Fits in an email subject line, a Slack message, a spreadsheet cell, a terminal column, and a printed report.
Self-contained verification. The fingerprint is embedded in the token. Given the factors, anyone can recompute the fingerprint and compare without database access.

Rejected: UUID. Opaque. 550e8400-e29b-41d4-a716-446655440000 tells you nothing about what it represents. Requires a database lookup to understand the content. Provides no self-verification capability.

Rejected: JSON blob. Multi-line. Requires JSON parsing to extract fields. Breaks in log files and terminal output where newlines are significant. Cannot be grepped with a simple pattern.

Why normalized procedure IDs in the anchor but original IDs in the fingerprint?

Anchor is for humans. SC76 is easier to read in a token than SC-7.6. Hyphens and dots in procedure IDs would conflict with the anchor's hyphen-separated format.
Fingerprint is for math. The original format SC-7.6 is the canonical identifier used in compliance frameworks (NIST 800-53, CMMC). The cryptographic input must use the canonical form to ensure the fingerprint is tied to the real control ID, not a derived abbreviation.
Lossless round-trip. The anchor can be parsed to extract the normalized ID, and the normalized ID can be expanded back to the original format using the UCT Registry. No information is lost.

6. UCT Taxonomy

Why framework-agnostic codes?

Decision: The Universal Control Taxonomy (UCT) assigns 3-letter domain codes (ACC, AUD, CFG, CRY, etc.) that are independent of any specific compliance framework.

Portability. A CRY anchor means "cryptography" whether the underlying check came from NIST SC-13, PCI-DSS Requirement 4, HIPAA encryption requirements, or ISO 27001 A.8.24. The UCT code is the bridge.
Longevity. If NIST renames a control family in a future revision, the UCT code remains stable. Anchors minted under the current NIST mapping do not become orphaned.
International scope. The EU AI Act, ISO 27001, SOC 2, and HIPAA do not use NIST family codes. A framework-specific taxonomy would require a separate code set for each jurisdiction. UCT provides one neutral layer that maps to all of them.

Why 3-letter codes?

Short enough to fit in the anchor token without bloating it.
Long enough to be semantically distinct. 2-letter codes would collide with existing identifiers (NIST families: AC, AU, CM are 2 letters; ISO clauses use 2-digit numbering).
Alphabetic only. No digits, no special characters, no case ambiguity. Always uppercase. Unambiguous in any font, any terminal, any printed report.

Why additive-only governance?

Anchor permanence. An anchor minted with UCT code ACC under Registry v1.0 must remain valid and meaningful under Registry v2.0, v3.0, and every future version. If a code were removed or redefined, historical anchors would lose their meaning.
Deprecation without removal. A code may be marked as deprecated with a recommended successor, but it MUST remain valid for parsing and verification indefinitely. This allows the taxonomy to evolve without breaking the historical record.
Precedent. This mirrors the IANA registry model: port numbers are assigned permanently. DNS record types are added but never removed. The stability of the registry is a prerequisite for the stability of the protocol.

7. Alternatives Considered and Rejected

Blockchain-based anchoring

Considered: Writing anchors to a public blockchain (Ethereum, Solana, or a purpose-built chain) for immutability guarantees.

Latency. Block confirmation times (seconds to minutes) are incompatible with real-time inference witnessing. A compliance check that takes 15 seconds to anchor is unusable in production.
Cost. Gas fees per anchor make high-volume witnessing (thousands of inferences per day) prohibitively expensive.
Dependency. A blockchain dependency means the protocol cannot function in air-gapped environments, which is a core requirement for Clearing Level 3 (Sovereign).
Unnecessary. SWT3's fingerprint achieves the same integrity guarantee using SHA-256 alone. The fingerprint is deterministic: given the same inputs, any party produces the same output. Immutability comes from the mathematics, not from a ledger.
Not prohibited. An implementation MAY additionally anchor to a blockchain as a supplementary immutability layer. The protocol does not require or prevent it.

Certificate-based signing (X.509 / PKI)

Considered: Signing each anchor with an X.509 certificate, creating a verifiable chain of trust from a Certificate Authority through the issuing system to the anchor.

Infrastructure overhead. PKI requires certificate issuance, rotation, revocation checking (CRL/OCSP), and trust store management. This is a significant operational burden for a compliance SDK that aims for zero-friction adoption.
Trust model mismatch. SWT3's signing is agent-to-platform, where both parties already share an API key. Asymmetric signing solves a problem (untrusted counterparties) that does not exist in this trust model.
Not prohibited. Implementations that require certificate-grade assurance (e.g., cross-organization evidence exchange) can layer X.509 signing on top of the base HMAC-SHA256 signature. The protocol is extensible without being dependent.

OpenTelemetry as the native evidence format

Considered: Using OpenTelemetry spans as the primary evidence format instead of a custom anchor token and factor structure.

Mutability. OTel spans allow attributes to be added after creation. An SWT3 anchor is immutable once minted. The fingerprint seals the factors at the moment of observation. Post-creation modification breaks the fingerprint.
No self-verification. OTel spans have no built-in mechanism for a third party to independently verify that the span data has not been altered. SWT3 fingerprints provide this by design.
No clearing semantics. OTel has no concept of data sovereignty or controlled purging. Spans are retained or dropped, with no protocol-level guarantee about what persists and what is destroyed.
Complementary, not competing. SWT3 exports TO OTel via the optional OTel exporter. Anchors appear as OTel spans in Datadog, Grafana, Jaeger, and other observability tools. SWT3 is the evidence source; OTel is a distribution channel.

8. Infrastructure-Layer Witnessing

Why a two-layer architecture for NVIDIA Dynamo?

Decision: The Dynamo adapter provides two layers: Layer 1 (decorator-based, zero Dynamo dependencies) and Layer 2 (service-graph integration via Dynamo's depends() and @service patterns).

Adoption layer (Layer 1). A Python decorator (@witness_endpoint()) that works with any async generator endpoint. Zero imports from NVIDIA code. Duck-types OpenAI-compatible request/response formats. Users add one decorator, set one environment variable (SWT3_DSN), and witnessing begins. This minimizes integration friction and works with or without the Dynamo runtime.
Infrastructure layer (Layer 2). A Dynamo-native @service class (WitnessInterceptor) that registers via depends(), injects compliance metadata into Dynamo's nvext request extensions, and publishes metrics through Dynamo's MetricsRegistry (swt3_witness_total, swt3_clearing_level). This provides infrastructure-team visibility without modifying application code.
Patent defensibility. Layer 1 proves the pattern works without any Dynamo-specific code. Layer 2 demonstrates deep integration with the infrastructure runtime. Together they cover both the general method and the specific embodiment.

Why the SWT3_DSN connection string?

Decision: Configuration via a single environment variable: SWT3_DSN=https://axm_live_xxx@sovereign.tenova.io/TENANT_ID

Single-env-var deployment. Infrastructure teams managing hundreds of endpoints need exactly one configuration touch point. The DSN pattern (used by Sentry, PostgreSQL, and database drivers) is well-understood by DevOps teams.
Fallback to individual variables. When DSN is not set, the adapter falls back to SWT3_ENDPOINT, SWT3_API_KEY, and SWT3_TENANT_ID. This supports environments where URL-embedded credentials are not permitted.
Graceful no-op. If neither DSN nor individual variables are set, the decorator passes through without error. This allows developers to add the decorator in development and activate witnessing in production by setting the environment variable.

Rejected: Configuration file. A YAML or TOML configuration file adds a deployment artifact that must be managed, versioned, and distributed alongside the application. Environment variables are universally available in containerized, serverless, and bare-metal environments without additional tooling.

Why duck-typing over explicit Dynamo imports?

Testability. Layer 1 can be tested without GPUs, without the Dynamo runtime, and without NVIDIA hardware. Tests use plain Python dictionaries that match the expected interface shape.
Portability. The same decorator works on vLLM, TGI, or any other inference server that exposes async generator endpoints. The witnessing pattern is not locked to a single vendor's runtime.
Dependency hygiene. Zero external imports means zero supply chain risk from the witnessing layer. The adapter adds no transitive dependencies to the user's environment.

9. Evidence Custody and Policy Witnessing

A cryptographic witness protocol is only as credible as the independence of the witness. If the agent being witnessed controls the witness process, the evidence is self-reported. Self-reported evidence has value, but it has a ceiling: any auditor, regulator, or opposing counsel will ask "what prevented the system from suppressing unfavorable records?" SWT3 addresses this through two mechanisms that exist not to provide security, but to ensure the trustworthiness of the evidence chain.

Why policy witnessing instead of policy enforcement?

The ChainEnforcer evaluates every tool call against the organization's declared policy: velocity limits, chain depth, token budgets, tool blocklists, and allowlists. When a tool call violates policy, the ChainEnforcer halts the call and mints a violation anchor recording what was attempted, which rule applied, and the tool name. The halt is a side effect. The anchor is the product.

This distinction matters because the valuable output is not the prevention of a bad action -- that is the responsibility of the guardrail, the firewall, and the access control layer. The valuable output is the proof that the organization had a policy, the policy was active, and the policy was either satisfied or violated at a specific moment in time. An auditor reviewing an evidence chain of PASS anchors will ask "were there any violations?" Without policy witnessing, there is no answer. With it, the violation anchors are the answer: here are the 3 times policy was triggered, here is what was attempted, here is the rule that applied.

The fail_secure: true configuration means "halt and record" rather than "allow and record." Both modes produce anchors. The halt mode produces anchors and prevents the unwitnessed action from proceeding. The organization chooses the mode based on their risk posture, not SWT3's opinion about what should be allowed.

Why an independent evidence custody daemon?

Without the sentinel daemon, SWT3 operates as a library inside the agent process. The agent calls witness.wrap(), the SDK hashes the inputs and outputs, and the anchor is written to the write-ahead log (WAL) in the same process space. This works. But it means the agent process has write access to its own evidence. A negligent developer could disable witnessing. A compromised agent could suppress violation anchors. A sophisticated adversary could modify the WAL.

The sentinel daemon resolves this by moving evidence custody outside the agent's trust boundary:

Separate process with its own key material. The signing key lives in the daemon, not in the agent. The agent cannot sign anchors directly -- it sends witness requests over IPC, and the daemon signs and stores them. The agent never touches the private key.
Protected write-ahead log. The WAL file is owned by the daemon process. The agent has no write access. Evidence cannot be altered or deleted by the agent.
Cross-process policy enforcement. Token budgets and velocity limits are maintained in the daemon's shared state. An agent cannot circumvent budget limits by restarting its own process, because the budget lives in the daemon.
Gap detection. If an agent stops sending witness requests, the gap in the evidence chain is itself evidence that witnessing was interrupted. The daemon's continuous operation makes silence visible.

This is not a security boundary in the traditional sense. The daemon does not detect intrusions, filter traffic, or block attacks. It ensures that the evidence chain is maintained by a process the agent does not control -- the same principle that requires financial audits to be conducted by an independent firm, not by the company being audited. The sentinel is the independent custodian of the witness record.

Why this is accountability infrastructure, not security tooling

Security tools prevent bad outcomes. Accountability tools prove what happened. Both are necessary. SWT3 is the second kind. A firewall blocks malicious traffic. A SIEM aggregates alerts. An EDR detects intrusions. SWT3 does none of these things. What SWT3 does is produce a cryptographic record that survives an audit, a regulatory inquiry, or a courtroom -- regardless of whether the outcome was good or bad.

The chain enforcer does not block tool calls because they are malicious. It halts them because they violate the organization's declared policy, and it records the violation as evidence. The sentinel daemon does not protect the system from attackers. It protects the evidence from the system. The distinction is not semantic. It determines which budget line pays for the tool, which team operates it, and what claims can honestly be made about what it does.

SWT3 is a notary, not a bodyguard. The notary does not prevent fraud. The notary proves what was signed, by whom, and when. That proof has value precisely because the notary is independent of the parties involved.

This guide is provided for informational purposes only and does not constitute legal, regulatory, or compliance advice. Regulatory mappings and crosswalk interpretations reflect the publisher's analysis and may not address all obligations applicable to your organization. Consult qualified legal counsel before making compliance decisions based on this content.