
从OpenAI API迁移到微调本地模型:90天手册
将最高量AI工作负载从云API迁移到自有微调模型的90天具体计划——包含评估框架、训练指南和并行切换策略。
这不是理论指南,而是90天运营计划,用于将真实AI工作负载从云API迁移到你拥有和本地运行的微调模型。
第1-30天:构建评估基础
- 构建200-500个高质量输入/输出对的训练数据集
- 构建50-100个保留示 例的评估集
- 在评估集上建立API基线指标
- 定义验收标准(通常±5%的API基线)
第31-60天:微调和验证
- 选择基础模型(Llama 3.1 8B、Qwen 2.5 7B等)
- 在Ertas Studio上微调
- 对比API基线评估
- 不达标则迭代:扩展数据、调整LoRA秩、尝试更大模型
第61-90天:并行部署和切换
- 第9周:部署模型,路由10%流量
- 第10周:25%流量
- 第11周:50%流量
- 第12周:100%流量,保留API作为后备30天
第90天你拥有什么
- 锁定的模型版本
- 确定性行为
- 零每查询成本
- 完全可移植性
- 治理完整性
Ship AI that runs on your users' devices.
Early bird pricing starts at $14.50/mo — locked in for life. Plans for builders and agencies.
Keep reading

What AI Model Ownership Actually Means (and Why It Matters More Than the API Price)
Ownership in AI isn't about having an API key. It's about possessing model weights, controlling behavior, and eliminating the dependency that comes with renting intelligence from someone else.

Model Distillation Is Not Theft — But Here's Why You Should Do It Yourself
Model distillation is a legitimate ML technique — every major lab does it. The DeepSeek incident was a contractual violation, not theft. Here's why fine-tuning open-source models on your own data is the ethical, legal, and strategically sound path.

Fine-Tuning vs. Distillation: Which One Actually Gives You an AI Moat?
Distillation copies generic capabilities from larger models. Fine-tuning on your own data creates domain-specific capabilities nobody else has. One gives you a cheaper clone — the other gives you a competitive moat.