view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 233
Phi-4 (All Versions) Collection Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes • 20 items • Updated 1 day ago • 78