Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,659

Base only

Active filters: fp8

chutesai/DeepSeek-R1T-Chimera-NextN

Text Generation • 12B • Updated Sep 26, 2025 • 8

chutesai/DeepSeek-TNG-R1T2-Chimera-NextN

Text Generation • 12B • Updated Sep 26, 2025 • 9

mouse/DeepSeek-R1-0528-FP4-NVFP4KV

394B • Updated Sep 28, 2025 • 3

k-l-lambda/DeepSeek-V3.1-Terminus-FP4

397B • Updated Sep 28, 2025 • 140 • 1

xxrjun/DeepSeek-R1-0528-FP4

394B • Updated Sep 27, 2025 • 2

Glazkov/qwen2.5-vl-table-extraction-FP8-Dynamic

Image-to-Text • 4B • Updated Sep 28, 2025 • 5

RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic

Text Generation • 236B • Updated Oct 3, 2025 • 234 • 4

QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8

Text Generation • Updated Oct 8, 2025 • 20

QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8

Text Generation • 236B • Updated Oct 8, 2025 • 10

eniffA/Affine-Model-01

Text Generation • 685B • Updated Sep 28, 2025 • 6

RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block

Text Generation • 236B • Updated Oct 27, 2025 • 9 • 3

deepseek-ai/DeepSeek-V3.2-Exp-Base

Text Generation • 685B • Updated Oct 9, 2025 • 277 • 67

JoshPP/DeepSeek-V3-16layers

153B • Updated Sep 29, 2025 • 2

xaura/affine-6k6k6

1T • Updated Sep 30, 2025 • 3

RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic

Text Generation • 9B • Updated Apr 28 • 884 • 3

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 69k • 29

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 758k • 113

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 8.74k • 57

kavanmevada/eng-word-model-ds32

Text Generation • 2B • Updated Oct 7, 2025 • 5

GaleneAI/Magistral-Small-2509-FP8-Dynamic

Updated Oct 8, 2025 • 13 • 2

RedHatAI/Llama-3.1-8B-Instruct-FP8-block

Text Generation • 8B • Updated Oct 29, 2025 • 9

wangkanai/flux-dev-fp8

Text-to-Image • Updated Oct 28, 2025 • 4

yejingfu/prune-deepseek-v3.1-e32

99B • Updated Oct 11, 2025 • 2

Qwen/Qwen3-VL-4B-Instruct-FP8

Image-Text-to-Text • 5B • Updated Oct 15, 2025 • 385k • 63

Qwen/Qwen3-VL-4B-Thinking-FP8

Image-Text-to-Text • 5B • Updated Nov 26, 2025 • 2.55k • 30

Qwen/Qwen3-VL-8B-Thinking-FP8

Image-Text-to-Text • 9B • Updated Nov 26, 2025 • 37.8k • 33

ai-team-20/deepseek-v3-1

Text Generation • Updated Oct 12, 2025 • 8

ai-team-20/deepseek-v3-2

Text Generation • Updated Oct 12, 2025 • 15

nm-testing/Llama-3.1-70B-Instruct-FP8-block

Text Generation • Updated Oct 14, 2025

RedHatAI/Qwen3-14B-FP8-block

Text Generation • 15B • Updated Oct 24, 2025 • 18