DeepSeek V4
BenchLM Aggregate: 87
DeepSeek V4 currently leads the BenchLM aggregate intelligence index at 87 — narrowly ahead of Kimi K2.6 and substantially ahead of every other open-weight model. The V4 Pro variant (1.6T total / 49B active MoE) combined with its 1M token context window narrows the gap with frontier closed-source models more than any prior open-weight release. The DeepSeek License is permissive enough for nearly all commercial use cases. The downside is scale — V4 Pro deployment requires multi-GPU server infrastructure, putting it out of reach for single-GPU or workstation-class deployments.
Strengths
- Currently #1 open-weight model on aggregate intelligence benchmarks
- 1M token context window with DeepSeek Sparse Attention efficiency
- Unified thinking mode in a single checkpoint (no separate R1-style deployment needed)
- DeepSeek License is broadly commercial-friendly
Trade-offs
- V4 Pro requires multi-GPU server (8x A100 80GB or equivalent) — not workstation deployable
- Smaller V4 Flash variant still needs 4x GPUs at minimum