SOC 2 & AI Compliance

Achieving SOC 2 compliance for AI systems with audit trails and access controls

Overview

SOC 2 (System and Organization Controls 2) is a compliance framework developed by the American Institute of Certified Public Accountants (AICPA) that evaluates an organization's information systems against five Trust Services Criteria: security, availability, processing integrity, confidentiality, and privacy. Unlike prescriptive regulations, SOC 2 is a principles-based framework where organizations define their own controls and an independent auditor assesses their design and operating effectiveness.

For AI and machine learning organizations, SOC 2 compliance has become a de facto requirement for enterprise sales. Customers in financial services, healthcare, technology, and other regulated industries increasingly demand SOC 2 Type II reports before approving vendors who will handle their data. A SOC 2 Type II report covers a minimum observation period of six months and demonstrates not just that controls exist, but that they operated effectively throughout that period.

AI systems introduce unique considerations for SOC 2 audits. Training data pipelines, model versioning, inference endpoints, and automated decision-making systems all fall within the scope of SOC 2 assessments. Auditors evaluate how organizations protect training data confidentiality, ensure processing integrity of model outputs, maintain system availability for production AI services, and safeguard the privacy of individuals whose data appears in training datasets. Organizations that cannot demonstrate robust controls across these areas risk audit findings that delay or prevent enterprise customer acquisition.

AI-Specific Requirements

The Security trust service criterion (formerly the Common Criteria) forms the foundation of every SOC 2 engagement and is the only mandatory category. For AI systems, security controls must address access management for training data and model artifacts, change management for model deployments, risk assessment procedures for AI-specific threats, and monitoring of AI system components. Organizations must implement logical access controls that restrict who can view, modify, or delete training datasets and model weights.

Processing Integrity is particularly critical for AI systems because it requires that system processing is complete, valid, accurate, timely, and authorized. For machine learning pipelines, this means demonstrating that training data transformations produce correct results, that model inference outputs are consistent and reliable, and that data pipeline failures are detected and resolved. Organizations must maintain evidence that their AI processing produces outputs that meet defined quality thresholds and that any anomalies trigger appropriate alerts and remediation workflows.

Confidentiality and Privacy criteria require organizations to protect sensitive information throughout its lifecycle. For AI teams, this encompasses training data containing customer information, proprietary model architectures and weights, evaluation datasets, and inference logs that may contain sensitive inputs. Controls must address data classification, encryption at rest and in transit, secure disposal of data no longer needed, and restrictions on data sharing with third parties. The Privacy criterion adds additional requirements around notice, choice, consent, collection limitation, and data quality that align closely with GDPR and CCPA obligations.

How Ertas Helps

Ertas provides the foundational infrastructure controls that SOC 2 auditors evaluate. Ertas Data Suite's comprehensive audit logging records every data access, transformation, and model operation with timestamps, user identities, and action descriptions. This audit trail directly satisfies SOC 2's monitoring and logging control requirements, providing auditors with verifiable evidence that data access is tracked and anomalous activities can be identified. The logs are immutable and tamper-evident, meeting the integrity requirements that auditors expect.

The on-premise architecture of Ertas Data Suite strengthens your SOC 2 security posture by eliminating data transmission to third-party infrastructure. All training data processing occurs within your organization's controlled environment, reducing the attack surface and simplifying the scope of your SOC 2 assessment. Fewer third-party dependencies mean fewer vendor risk assessments, fewer sub-service organization considerations, and a cleaner system description in your SOC 2 report. The air-gapped deployment option further strengthens security controls for organizations handling highly confidential data.

Ertas Studio's Vault feature provides the encryption and access control capabilities that SOC 2 demands. Data is encrypted at rest using industry-standard algorithms, and role-based access controls enforce the principle of least privilege for all users interacting with training data and models. The structured workflow captures change management evidence automatically — when models are retrained, datasets are updated, or configurations are modified, the system records who made the change, what was changed, and when. This continuous evidence collection transforms SOC 2 audit preparation from a disruptive annual exercise into a natural byproduct of your AI development process.

Compliance Checklist

Comprehensive audit logging with tamper-evident recordsSupported

Role-based access controls for data and model artifactsSupported

Encryption at rest for all stored data and modelsSupported

Change management tracking for model and data updatesSupported

On-premise deployment reducing third-party riskSupported

System availability monitoring and alertingPartial

Formal risk assessment and treatment documentationCustomer Responsibility

Independent SOC 2 audit engagement and reportingCustomer Responsibility

Relevant Ertas Features

Immutable audit logging
Role-based access controls
Vault encryption at rest
On-premise air-gapped deployment
Change management evidence capture
Data lineage and provenance tracking

Ship AI that runs on your users' devices.

Free plan with 30 credits/mo, no card required. Paid plans from $25/mo USD.

or view pricing →