Ertas + OpenClaw for AI Agencies
AI agencies deploying OpenClaw for clients face the same margin-killing problem as every API-dependent tool: per-token costs that scale per client. Ertas enables agencies to run per-client fine-tuned models locally, eliminating API costs entirely while delivering better domain-specific results than generic cloud models.
The Challenge
OpenClaw is the most capable AI agent framework available — autonomous task execution through messaging platforms that clients already use, browser automation, file management, email triage, and cron-based monitoring. For AI agencies, it is a natural evolution from simple chatbot deployments to full-spectrum AI assistants.
But the economics do not work at scale with cloud APIs. Every client's OpenClaw instance routes every interaction through GPT-4o or Claude — generating per-token charges on every email triaged, every report generated, every message sent. A single active client can cost AU$150-300/month in API pass-through. Across 10-20 clients, the agency is looking at AU$2,000-4,000/month in variable costs that eat directly into fixed-retainer margins.
Worse, the costs are unpredictable. A client's marketing campaign goes viral and their support volume triples? The API bill triples too — but the retainer stays the same. An OpenClaw cron job scans an inbox every 30 minutes? That is steady token throughput 24/7, burning through credits even when nothing actionable happens.
The differentiation problem is equally pressing. If every agency deploys OpenClaw with the same GPT-4o backend, clients are effectively getting the same AI — just wrapped in different branding. There is no moat. A technically inclined client can replicate the setup themselves.
The Solution
Ertas transforms agency OpenClaw deployments from API pass-through to proprietary AI infrastructure. The model: fine-tune a per-client LoRA adapter on each client's domain data, deploy all adapters on a shared base model running locally via Ollama, and point each client's OpenClaw instance at the local endpoint.
The economics flip immediately. Instead of AU$150-300/month per client in API costs, inference is free after a one-time hardware investment. A Mac Studio or RTX 4090 server handles 15-20 concurrent client adapters comfortably. The hardware pays for itself in 4-6 weeks. Every client added after that is pure margin.
The quality improvement is the real selling point. A fine-tuned model for a real estate client has learned that client's listings, pricing terminology, and buyer communication style. A fine-tuned model for a dental practice knows the practice's appointment types, insurance panels, and patient communication tone. Generic GPT-4o approximates from a system prompt; a fine-tuned model internalises from training data. Agencies can demonstrate measurable accuracy improvements to clients — which justifies premium pricing and creates switching costs that generic API access never provides.
Key Features
Per-Client Fine-Tuning
Studio enables agencies to fine-tune a LoRA adapter for each client from a shared base model. Upload client conversation logs, product catalogues, or domain data, configure a training run, and produce an adapter that captures that client's specific needs — without managing GPU infrastructure.
Multi-Tenant Agent Serving
Cloud supports deploying a single base model with per-client LoRA adapters loaded dynamically at inference time. Each client's OpenClaw instance routes to the correct adapter automatically. Scale from 5 to 50 clients without proportional infrastructure growth.
Client Data Isolation
Vault enforces strict data boundaries between clients. Each client's training data, adapter weights, and inference logs are encrypted and access-controlled separately. No cross-contamination between client environments — satisfying the data sovereignty requirements that enterprise clients demand.
Model Marketplace for Agencies
Hub provides access to base models optimised for common agency verticals — customer support, email triage, report generation, scheduling. Agencies can benchmark models against client requirements before fine-tuning, reducing time-to-deployment for new client onboarding.
Example Workflow
A Sydney-based AI automation agency manages OpenClaw deployments for 12 small business clients across real estate, hospitality, and professional services. Each client has an OpenClaw agent handling email triage, appointment scheduling, and customer inquiry responses through WhatsApp and email. On cloud APIs, the agency's monthly API spend is AU$2,800 — with three high-volume clients (a real estate agency, a boutique hotel, and an accounting firm) accounting for AU$1,500 of that total. The agency migrates to Ertas. For each client, they export 3-6 months of conversation history, format as JSONL training data, and fine-tune a LoRA adapter in Ertas Studio. Training takes 30-60 minutes per client. All 12 adapters are deployed on a single Mac Studio M2 Ultra (AU$5,500) running Ollama, with adapter hot-swapping based on which client's OpenClaw instance is making the request. After migration, the agency's monthly AI inference cost drops from AU$2,800 to AU$14.50 (Ertas subscription) plus electricity. The hardware breaks even in under 8 weeks. More importantly, client satisfaction improves: the real estate client's inquiry classification accuracy jumps from 76% (GPT-4o) to 93% (fine-tuned model), and the hotel's booking confirmation responses match their brand voice so closely that guests cannot distinguish them from human staff. The agency now sells the fine-tuned model as a premium differentiator — justifying a 30% higher retainer than competitors who deploy generic API-based agents.
Compliance & Security
Local deployment means no client data is transmitted to third-party AI providers. Per-client LoRA adapters with Vault's encryption and access controls provide the data isolation that enterprise and regulated-industry clients require. Agencies can provide written guarantees that client data is processed exclusively on controlled infrastructure — a competitive advantage in RFP responses and enterprise procurement processes.
Related Resources
Adapter
Fine-Tuning
GGUF
Inference
LoRA
Multi-Tenant Inference
OpenClaw for Agencies: Per-Client AI Agents Without the API Bill
How to Power OpenClaw with Fine-Tuned Local Models (No API Costs)
OpenClaw + Fine-Tuned Models vs. OpenClaw + GPT-4: A Practical Comparison
How to Cut Your AI Agency Costs by 90% with Fine-Tuned Local Models
Multi-Tenant AI Deployment: One Base Model, Dozens of Client Adapters
White-Label AI: Build Custom Models for Every Client
Make.com
n8n
Ollama
OpenClaw
Ertas for Customer Support
Ertas for AI Automation Agencies
Ship AI that runs on your users' devices.
Early bird pricing starts at $14.50/mo — locked in for life. Plans for builders and agencies.