Models

403

Full-text search

Active filters: llmcompressor

RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w8a8

Text Generation • 31B • Updated Mar 20 • 4

RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w8a8

Text Generation • 31B • Updated Mar 20 • 327

MedGemmaImpact/medgemma-1.5-4b-it-fp8

4B • Updated Jan 21 • 160 • 1

MedGemmaImpact/medgemma-1.5-4b-it-awq

2B • Updated Jan 22 • 469 • 1

MedGemmaImpact/medgemma-1.5-4b-it-nvfp4

4B • Updated Jan 23 • 348 • 1

RedHatAI/Ministral-3-14B-Instruct-2512-FP8-dynamic

Text Generation • 14B • Updated Feb 5 • 314

YifeiDevs/Huihui-Qwen3-8B-abliterated-v2-FP8

Text Generation • 8B • Updated Jan 24 • 5

Etelis/DeepSeek-V2-Lite-FP8-BLOCK-padded

Text Generation • 16B • Updated Jan 28 • 9

dtometzki/Qwen3-30B-A3B-MXFP4A16

Text Generation • 17B • Updated Jan 28 • 3

JEILDLWLRMA/Qwen3-VL-8B-Instruct-NVFP4

Image-to-Text • 6B • Updated Feb 2 • 27 • 1

inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4

Text Generation • Updated 2 days ago • 166

hassanshka/Biomni-R0-32B-AWQ-INT4

Text Generation • 6B • Updated Feb 5 • 142

hassanshka/Biomni-R0-32B-FP8

Text Generation • Updated Feb 5 • 4

groxaxo/Qwen3-4B-Instruct-2507-heretic-W4A16

Text Generation • 0.9B • Updated Feb 17 • 6

groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16

Text Generation • 4B • Updated Feb 27 • 7 • 1

turnio/medgemma-27b-text-it-FP8-Dynamic

Text Generation • 27B • Updated Feb 18 • 278

saital/GUI-Owl-1.5-8B-Think-FP8-Dynamic

Image-Text-to-Text • 9B • Updated Feb 21 • 14 • 1

embedl/Cosmos-Reason2-2B-NVFP4A16

Image-Text-to-Text • 2B • Updated 4 days ago • 391 • 1

embedl/Cosmos-Reason2-2B-W4A16-Edge2

Image-Text-to-Text • 2B • Updated 4 days ago • 688 • 12

GizzmoShifu/glm-4-9b-chat-hf-nvfp4

Text Generation • 6B • Updated Feb 27 • 5

embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead

Image-Text-to-Text • 2B • Updated 4 days ago • 1.5k • 9

GizzmoShifu/glm-4-9b-chat-hf-fp8

Text Generation • 9B • Updated Mar 4 • 3

pshashid/llama3.1B_8B_SQL_Finetuned_model

Text Generation • 5B • Updated Mar 2 • 4

dataslab/DLM-2.0-14B-FP8

Text Generation • 15B • Updated 24 days ago • 19 • 3

dataslab/DLM-2.0-14B-GPTQ

Text Generation • 3B • Updated 24 days ago • 25 • 1

dataslab/DLM-2.1-14B-FP8

Text Generation • 15B • Updated 24 days ago • 33 • 1

dataslab/DLM-2.1-14B-GPTQ

Text Generation • 3B • Updated 24 days ago • 18 • 1

inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16

Text Generation • 32B • Updated 3 days ago • 147

inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16

Text Generation • 32B • Updated 3 days ago • 168

RedHatAI/Qwen3-235B-A22B-Instruct-2507-quantized.w8a8

Text Generation • 235B • Updated 3 days ago • 88