inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 2 days ago • 166
embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead Image-Text-to-Text • 2B • Updated 4 days ago • 1.5k • 9
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 3 days ago • 147
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 3 days ago • 168
RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8 Text Generation • 235B • Updated 3 days ago • 88