Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

152

Base only

Active filters: Quantization

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft

9B • Updated Feb 25, 2025 • 5 • 4

VPTQ-community/Qwen2.5-14B-Instruct-v8-k256-256-woft

2B • Updated Mar 20, 2025 • 1

VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft

3B • Updated Mar 20, 2025 • 7

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft

3B • Updated Mar 20, 2025 • 2

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-0-woft

3B • Updated Mar 20, 2025 • 3

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft

4B • Updated Mar 20, 2025 • 5

VPTQ-community/Qwen2.5-7B-Instruct-v16-k65536-65536-woft

2B • Updated Mar 20, 2025 • 10 • 1

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-65536-woft

2B • Updated Mar 20, 2025 • 5

VPTQ-community/Qwen2.5-7B-Instruct-v8-k256-256-woft

2B • Updated Mar 20, 2025 • 12

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-0-woft

2B • Updated Mar 20, 2025 • 39

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft

2B • Updated Jan 13, 2025 • 11 • 5

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-65536-woft

12B • Updated Feb 25, 2025 • 7 • 1

sunilrufus/Mistral_Instruct_Nemo_12B_quantized_w8a8

Text Generation • 12B • Updated Jan 2, 2025 • 124 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v16-k65536-65536-woft

4B • Updated Feb 25, 2025 • 2 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-0-woft

4B • Updated Feb 25, 2025 • 7

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-65536-woft

6B • Updated Feb 25, 2025 • 3 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v8-k256-256-woft

4B • Updated Feb 25, 2025 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-256-woft

5B • Updated Feb 25, 2025 • 18 • 2

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-65536-woft

31B • Updated Feb 26, 2025 • 10 • 3

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-0-woft

9B • Updated Feb 25, 2025 • 7

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-16384-woft

9B • Updated Feb 25, 2025 • 10 • 2

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-256-woft

13B • Updated Feb 25, 2025 • 2

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-65536-woft

10B • Updated Feb 25, 2025 • 4 • 1

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft

17B • Updated Feb 26, 2025 • 5 • 2

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-1024-woft

8B • Updated Feb 25, 2025 • 9

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-4096-woft

8B • Updated Feb 26, 2025 • 2

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-256-woft

42B • Updated Feb 26, 2025 • 4 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-65536-woft

55B • Updated Feb 26, 2025 • 3

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft

9B • Updated Feb 25, 2025 • 4

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft

8B • Updated Feb 25, 2025 • 5 • 1