Phi-4
Quality at <10GB VRAM: Best in class
Microsoft's Phi-4 (14B dense) at Q4_K_M is approximately 8.5GB — fitting comfortably under the 10GB threshold while delivering exceptional capability per parameter. Phi-4 was specifically engineered to punch above its weight class through careful curation of synthetic training data, and it competes with much larger general-purpose models on math, code, and reasoning benchmarks. MIT licensing makes it the strongest commercially-deployable choice in this VRAM tier.
Strengths
- MIT license — fully commercially permissive
- 14B parameters at ~8.5GB Q4_K_M leaves headroom for context
- Strong math and code reasoning for parameter count
- Phi-4-mini (3.8B) and Phi-4-multimodal (5.6B) variants for tighter constraints
Trade-offs
- Heavy synthetic training data introduces artifacts in informal language
- Behind larger models on broad multilingual capability