Distil Efficiency Benchmarks Collection Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated Mar 2 • 3
lmstudio-community/Qwen3-4B-Thinking-2507-MLX-4bit Text Generation • 0.6B • Updated Aug 6, 2025 • 65.9k • 12