SOC 2 for AI Agents
What SOC 2 actually is
SOC 2 (Service Organization Control 2) is an attestation framework published by the AICPA. An independent CPA firm audits the vendor against the AICPA's Trust Services Criteria and produces a report. The report is confidential and shared by the vendor under NDA.
Two kinds of report:
- Type I — point-in-time. "On 1 March 2026, the vendor had these controls in place and they were designed appropriately." Useful for small vendors getting started; insufficient for enterprise procurement.
- Type II — operating effectiveness over a window (typically 6-12 months). The auditor not only confirms the control existed but tested samples to prove it actually ran. This is what enterprise security teams ask for.
The five Trust Services Criteria, mapped to AI agent risk
- Security — access controls, network controls, change management, vendor risk, incident response. For agents: who can deploy a new agent, what reviews are needed, how quickly are vulnerabilities patched. The most-tested principle.
- Availability — system uptime SLAs, monitoring, failover. For agents: what happens when the LLM provider has an outage, do agents queue or fail, can you roll back to a previous version.
- Processing integrity — does the system process data completely, accurately, validly. For agents: this maps to deterministic post-processing, schema validation, tool-call safety. Importantly: SOC 2 processing-integrity does not test LLM hallucination — that's a separate concern.
- Confidentiality — non-public information stays non-public. For agents: prompts, retrieved RAG content, agent execution logs, model weights are all confidential data. Encryption at rest, encryption in transit, key management, access logging.
- Privacy — personally identifiable information handled per the vendor's privacy notice. For agents: GDPR-adjacent. The privacy principle is optional for SOC 2 reports; many vendors include only the first four.
Why SOC 2 matters more in 2026 than it did pre-LLM
Three changes specific to AI agents push the SOC 2 conversation harder:
- Prompts are sensitive. An agent's prompt history can contain customer names, contract details, source code, financial data. The Confidentiality principle now covers a broader, more sensitive surface.
- Agents execute tools that affect external state. An agent can send emails, file refunds, modify CRM records. The Security principle now needs to address tool-call governance, not just data access.
- Sub-processor sprawl. A single AI agent platform routes through OpenAI / Anthropic / a vector DB / monitoring vendor / customer-support tools. SOC 2's vendor-risk requirements now demand more transparency about which sub-processor sees what.
What SOC 2 does NOT cover
The honest 2026 reality: SOC 2 was written before LLMs. It does not specifically test:
- Prompt injection resilience
- Training-data provenance and licensing (NIST AI RMF / ISO 42001 territory)
- Model evaluation rigor — accuracy, fairness, robustness
- Hallucination rate measurement
- Agent-decision audit trail completeness ("can you reconstruct exactly what the agent did and why?")
- Sub-processor model retention policies for your prompts
SOC 2 Type II is necessary but insufficient for AI-agent procurement. Buyers in regulated sectors should also ask about NIST AI RMF, ISO/IEC 42001 (the AI management system standard published 2023), and any AI-specific addenda the vendor offers. For the privacy and decision-making counterparts, see AI Agents & GDPR, EU AI Act compliance, and California ADMT.
Vendor questions to ask
Once you receive a SOC 2 Type II under NDA:
- What is the report's observation window? Is it current (within the last 12 months)?
- Which Trust Services Criteria are in scope? (Security alone is the minimum; Availability + Confidentiality common; Privacy + Processing Integrity rarer.)
- Are there any qualified opinions, exceptions, or deviations? Read the auditor's notes carefully.
- Are sub-processors (LLM providers, vector DBs) listed and covered by their own SOC 2 or equivalent?
- What is the customer's responsibility under the "Complementary User Entity Controls" section? You usually inherit some duties.
- How is the agent's execution log retained and where?
- What is the breach-notification timeline if a sub-processor (e.g. the LLM provider) is breached?
- Does the vendor align with NIST AI RMF or ISO/IEC 42001? (Not yes-or-no; ask which controls.)
Which agent platforms have SOC 2 Type II in 2026
Most enterprise-tier AI agent platforms publish or share SOC 2 reports under NDA. As of mid-2026: Workato (Type II + ISO 27001), Voiceflow (Type II), Botpress (Type II), Relevance AI (Type II in progress at last check). Smaller / newer platforms (Gumloop, Lindy) ship security pages that document controls but may not yet have a Type II report — request status.
For high-stakes deployments, do not deploy on a vendor without at least Type I in progress. For regulated sectors (healthcare, finance, government), require Type II + NIST AI RMF alignment up front; if the vendor cannot provide it, walk away.