Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,639

Base only

Active filters: fp8

CalamitousFelicitousness/Qwen2.5-32B-Instruct-fp8-dynamic

Text Generation • 33B • Updated Sep 18, 2024 • 154k • 2

CalamitousFelicitousness/Qwen2.5-72B-Instruct-fp8-dynamic

Text Generation • 73B • Updated Sep 18, 2024 • 47 • 4

John6666/blue-pencil-flux1-v021-fp8-flux

Text-to-Image • Updated Sep 19, 2024 • 1

ProbablyFaiz/Meta-Llama-3.1-70B-Instruct-FP8

71B • Updated Sep 22, 2024 • 1

John6666/real-flux-10b-schnell-fp8-flux

Text-to-Image • Updated Sep 23, 2024 • 12

ajinkya-tejankar/Mistral-7B-v0.1-FP8-KV

7B • Updated Sep 24, 2024 • 7

nm-testing/Meta-Llama-3.1-8B-Instruct-FP8-hf

Text Generation • 8B • Updated Sep 24, 2024 • 11

amd/dbrx-instruct-FP8-KV

132B • Updated May 19 • 11

yejingfu/SAO10K-L3-70B-Euryale-v2.1-FP8

71B • Updated Sep 26, 2024 • 1

RedHatAI/Llama-3.2-1B-Instruct-FP8-dynamic

Text Generation • 1B • Updated Oct 9, 2024 • 1.81M • 4

RedHatAI/Llama-3.2-3B-Instruct-FP8-dynamic

Text Generation • 4B • Updated Oct 9, 2024 • 770 • 3

RedHatAI/Llama-3.2-90B-Vision-Instruct-FP8-dynamic

Text Generation • 89B • Updated Oct 2, 2024 • 168 • 11

amd/Llama-3.2-11B-Vision-Instruct-FP8-KV

11B • Updated Jul 23, 2025 • 383 • 1

amd/Llama-3.2-3B-Instruct-FP8-KV

3B • Updated Dec 19, 2024 • 631

amd/Llama-3.2-3B-FP8-KV

3B • Updated Dec 19, 2024 • 36

amd/Llama-3.2-1B-Instruct-FP8-KV

1B • Updated Dec 19, 2024 • 13k • 1

amd/Llama-3.2-1B-FP8-KV

1B • Updated Dec 19, 2024 • 16

SicariusSicariiStuff/Dusk_Rainbow_FP8

Text Generation • 8B • Updated Jul 25, 2025 • 2

amd/Llama-3.2-90B-Vision-Instruct-FP8-KV

89B • Updated Jun 10, 2025 • 9

soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8

47B • Updated Sep 27, 2024 • 1

RedHatAI/Phi-3.5-mini-instruct-FP8-KV

Text Generation • 4B • Updated Oct 1, 2024 • 8 • 2

CalamitousFelicitousness/SorcererLM-8x22b-FP8-Dynamic

141B • Updated Oct 4, 2024 • 4

John6666/stoiqo-afrodite-fluxxl-f1dalpha-fp8-flux

Text-to-Image • Updated Dec 23, 2024 • 6 • 5

fxmarty/quark-legacy-fp8

1.03M • Updated Oct 10, 2024 • 6

amd/jais-13b-chat-FP8

13B • Updated Dec 19, 2024 • 3

RedHatAI/pixtral-12b-FP8-dynamic

Text Generation • 13B • Updated Feb 7, 2025 • 426 • 10

predibase/Qwen2.5-14B-FP8

15B • Updated Oct 10, 2024 • 3

CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic

73B • Updated Oct 11, 2024 • 2

taozi555/Llama-Guard-3-8B-FP8

8B • Updated Oct 12, 2024 • 3 • 1

ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV

7B • Updated Oct 15, 2024 • 1