System requirements
Hardware and OS floors for running Ertas in a browser and for running an Ertas-exported GGUF on iOS, Android, desktop, and web.
There are two distinct system-requirements questions, and they have very different answers.
The first: what do I need to run Ertas? Ertas is a browser app; GPUs are provisioned in the cloud. The bar is low.
The second: what do my users need to run the model I trained? This is where most of the real constraints live. The Ship section lays out the specifics per platform; this page collects them into one lookup.
Running Ertas (browser-side)
Ertas runs entirely in a modern browser. There is no native install, no GPU required on your machine, and no Python.
| Requirement | Minimum | Recommended |
|---|---|---|
| Browser | A modern evergreen browser (recent Chrome, Edge, Safari, or Firefox) | Latest of any of the above |
| RAM | 4 GB | 8 GB |
| Network | Stable connection during run setup and live monitoring | Stable connection throughout |
| Storage | Negligible (no local install) | Negligible |
Training itself happens on Ertas-managed T4 or A10G GPUs; nothing about your hardware affects training speed. Dataset uploads are bounded by your network upload bandwidth and your plan's dataset storage cap, both of which live on the Pricing page.
Mobile browsers work but are uncomfortable. Studio's canvas is designed for keyboard and mouse. It loads on iPad and Android tablets but the build canvas is fiddly on touch. Plan to set up your project on a laptop and use mobile only for run monitoring.
Running an Ertas-exported model on-device
This is the part where the specific hardware floors matter. The exported GGUF is what runs on the user's device, and the floor depends on the model size and the platform.
iOS
| Aspect | Minimum | Recommended | Source |
|---|---|---|---|
| iOS version | 16.4 (for Metal GPU backend) | 17.0 | Ship: iOS |
| Device class | iPhone 14, iPad Pro 2022 | iPhone 14 Pro and newer | Ship: iOS |
| Device RAM | 6 GB (for 1B-class) | 8 GB and up | Ship: iOS |
| Free storage | 1 GB (for the GGUF and KV cache) | 5 GB | Ship: model delivery |
A 1B-class Q4_K_M model holds about 700 MB of native RAM plus the KV cache. A 3B-class model holds about 2.0 GB. On a 4 GB device the OS will kill the app under memory pressure; on a 6 GB device a 1B-class model is usable but a 3B-class will be tight.
For the integration mechanics, see Ship: iOS. For the delivery story (App Store size limits, on-demand resources, direct download), see Ship: model delivery and UX.
Android
| Aspect | Minimum | Recommended | Source |
|---|---|---|---|
| Android version | 7.0 (API 24) for the llamadart Flutter path | 9.0 (API 28) and up | Ship: Android |
| ABI | arm64-v8a (or x86_64 emulator) | arm64-v8a | Ship: Android |
| Device RAM | 6 GB (for 1B-class) | 8 GB and up | Ship: Android |
| Free storage | 1 GB (for the GGUF and KV cache) | 5 GB | Ship: model delivery |
Apple Intelligence requires iPhone 15 Pro (8 GB) and Pixel 8 Pro requires 12 GB for Gemini Nano. Ertas's 6 GB floor for a 1B-class model is on the lower end of the industry's positioning; surface the requirement in your store listing so users on 4 GB devices are not surprised.
Desktop
| Platform | Minimum | Recommended | Source |
|---|---|---|---|
| macOS | 11 Big Sur, Intel or Apple Silicon | macOS 13 and up, Apple Silicon | Ship: desktop |
| Windows | 10 (1809) or 11 | Windows 11 | Ship: desktop |
| Linux | Any modern distro with glibc 2.31 and up | Ubuntu 22.04 LTS and up | Ship: desktop |
| RAM | 8 GB (for 3B at Q4_K_M) | 16 GB | Ship: desktop |
| GPU (optional) | None required; CPU works | NVIDIA RTX 30 series and up, or Apple Silicon | Ship: desktop |
| Free storage | 5 GB (for a 3B model and Ollama install) | 20 GB | Ship: model delivery |
Desktop is the most forgiving target. The Ollama bundle Ertas ships works on Intel Macs, Apple Silicon Macs, Windows, and Linux out of the box. CUDA acceleration on a discrete NVIDIA GPU gives a 2-to-10x speedup over CPU; without one, modern CPUs still hit 5 to 15 tok/s on a 3B-class model.
Web (browser)
| Aspect | Minimum | Recommended | Source |
|---|---|---|---|
| Browser | Chrome 113, Edge 113 (WebGPU), or Safari 18 / iOS 17+ | Latest Chrome or Safari with WebGPU | Ship: web |
| Device RAM | 6 GB system RAM (for 1B-class in-browser) | 8 GB and up | Ship: web |
| Storage | 3 GB OPFS / IndexedDB quota for the model | 5 GB | Ship: web |
WebGPU support is the gating factor. Without it, wllama falls back to pure WebAssembly which is roughly 5x slower. Firefox enabled WebGPU support in 141 (2026); Safari shipped it in 18 / iOS 17. Older browsers still load the model but inference is uncomfortably slow.
Browser storage quotas vary. Chrome lets you use up to 80% of free disk space; Firefox is similar; Safari historically capped at 1 GB but has lifted to roughly 60% on macOS 14 and iOS 17. The 3 GB minimum reflects what a 3B-class Q4_K_M model needs plus working space; smaller models drop the requirement proportionally.
Storage planning
The model file size is the dominant storage cost, not the runtime. A summary:
| Model class | GGUF size (Q4_K_M) | Native RAM at runtime | Useful storage budget |
|---|---|---|---|
| 1B | ~0.8 GB | ~700 MB + KV cache | 1 GB minimum, 3 GB headroom |
| 3B | ~2.1 GB | ~2.0 GB + KV cache | 3 GB minimum, 5 GB headroom |
| 7B | ~4.5 GB | ~4.5 GB + KV cache | 5 GB minimum, 10 GB headroom |
| 14B | ~9.0 GB | ~9.0 GB + KV cache | 10 GB minimum, 20 GB headroom |
The KV cache scales with sequence length: about 50 to 200 MB for 1B-class models at a typical chat context, more for larger models or longer contexts. See Performance tips for the trade-offs around context length and the KV cache size.
What's next
Ship
The full integration story for each platform, with the hardware floors in context.
File sizes and formats
How the GGUF size column above was computed.
Performance tips
Tuning for the device class you ship onto.
Model delivery and UX
How the model gets onto the user's device once you know the requirements.