Distil Efficiency Benchmarks Collection Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated 14 days ago • 3
lmstudio-community/Qwen3-4B-Thinking-2507-MLX-4bit Text Generation • 0.6B • Updated Aug 6, 2025 • 89.5k • 10
FluidInference/nemotron-speech-streaming-en-0.6b-coreml Automatic Speech Recognition • Updated Jan 16 • 2 • 4