Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,654

Base only

Active filters: fp8

datasysdev/qwen3-30b-a3b-pruned

Text Generation • 31B • Updated Sep 12, 2025 • 9

datasysdev/qwen3-30b-a3b-new-pruned-arch

22B • Updated Sep 13, 2025 • 5 • 1

jobs-git/Kimi-K2-Base

Text Generation • 1T • Updated Sep 13, 2025 • 10

jobs-git/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Sep 13, 2025 • 4

TheClusterDev/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic

Text Generation • 80B • Updated Sep 23, 2025 • 35 • 4

KitsuVp/NeoLLM

0.1B • Updated 3 days ago • 1.03k • 2

chutesai/DeepSeek-V3.1-NextN

12B • Updated Sep 17, 2025 • 7

dasLOL/Affine-12412414412124123

Text Generation • 1T • Updated Sep 18, 2025 • 15

danikhan632/TinyDeepseek-V3

99.6M • Updated Sep 18, 2025 • 1

guojinwu/DeepSeek-V3.1-NEXTN

12B • Updated Sep 18, 2025 • 4 • 1

RedHatAI/Apertus-8B-Instruct-2509-FP8-dynamic

Text Generation • 8B • Updated Apr 28 • 1.94k • 4

RedHatAI/Apertus-70B-Instruct-2509-FP8-dynamic

Text Generation • 71B • Updated Sep 30, 2025 • 574 • 1

meituan-longcat/LongCat-Flash-Thinking-FP8

Text Generation • 562B • Updated Sep 24, 2025 • 51 • 8

RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16

Text Generation • 11B • Updated Sep 23, 2025 • 415 • 1

Qwen/Qwen3-Next-80B-A3B-Instruct-FP8

Text Generation • 81B • Updated Sep 22, 2025 • 287k • 91

Qwen/Qwen3-Next-80B-A3B-Thinking-FP8

Text Generation • 81B • Updated Sep 22, 2025 • 23.3k • 54

deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 29, 2025 • 10.7k • • 366

unsloth/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 22, 2025 • 33 • 4

chutesai/DeepSeek-V3.1-Terminus-NextN

12B • Updated Sep 22, 2025 • 7 • 2

DevQuasar-2/deepseek-ai.DeepSeek-V3.1-Terminus-BF16

Text Generation • 684B • Updated Sep 23, 2025 • 36

testmymodel112/Affine-new-model-152

Text Generation • 235B • Updated Sep 23, 2025 • 3

MattisR/Voxtral-Small-24B-2507-FP8-dynamic

Automatic Speech Recognition • 24B • Updated Sep 24, 2025 • 35 • 3

Sunbird/Sunflower-14B-FP8

Text Generation • 15B • Updated Oct 9, 2025

ZixiQi/DeepSeek-V3-4layers-MTP-FP8

7B • Updated Sep 24, 2025 • 41.4k

Sunbird/Sunflower-32B-FP8

Text Generation • 33B • Updated Oct 9, 2025

RedHatAI/Voxtral-Small-24B-2507-FP8-dynamic

Automatic Speech Recognition • 24B • Updated Feb 5 • 3.82k • 2

chutesai/DeepSeek-R1T-Chimera-NextN

Text Generation • 12B • Updated Sep 26, 2025 • 8

chutesai/DeepSeek-TNG-R1T2-Chimera-NextN

Text Generation • 12B • Updated Sep 26, 2025 • 9

mouse/DeepSeek-R1-0528-FP4-NVFP4KV

394B • Updated Sep 28, 2025 • 3

k-l-lambda/DeepSeek-V3.1-Terminus-FP4

397B • Updated Sep 28, 2025 • 138 • 1