Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

299

Base only

Active filters: draft-model

lightseekorg/kimi-k2.5-eagle3

3B • Updated Mar 16 • 90.8k • 15

z-lab/Alpamayo-R1-10B-DFlash

Robotics • 0.5B • Updated 13 days ago • 649 • 3

Thr45h/MEDUSA-Llama-3.1-8B-Instruct

Text Generation • 3B • Updated Mar 17 • 8

darkmaniac7/TokForge-AccelerationPack-Draft

Text Generation • Updated 4 days ago • 358

darkmaniac7/TokForge-AccelerationPack-Qwen35-Draft

Text Generation • Updated 4 days ago • 297

darkmaniac7/Qwen3-0.6B-kl-baseline-20k-MNN

Text Generation • Updated 4 days ago • 122

darkmaniac7/Qwen3-0.6B-lk-alpha-20k-MNN

Text Generation • Updated 4 days ago • 131

darkmaniac7/Qwen3-0.6B-lk-alpha-40k-MNN

Text Generation • Updated Mar 25 • 3

darkmaniac7/Qwen3-0.6B-lk-alpha-14b-paired-MNN

Text Generation • Updated Mar 25 • 4

darkmaniac7/Qwen3.5-0.8B-lk-alpha-ep4-MNN

Text Generation • Updated Apr 9 • 6

thoughtworks/Qwen2.5-7B-Instruct-Eagle3

Text Generation • 0.4B • Updated Mar 28 • 222 • 1

thoughtworks/Llama-3.2-3B-Instruct-Eagle3

Text Generation • 0.2B • Updated Mar 28 • 157 • 1

thoughtworks/DeepSeek-R1-Distill-Qwen-7B-Eagle3

Text Generation • 0.4B • Updated Mar 28 • 96

thoughtworks/DeepSeek-R1-Distill-Qwen-14B-Eagle3

Text Generation • 0.6B • Updated Mar 28 • 46

thoughtworks/Llama-3.1-8B-Instruct-Eagle3

Text Generation • 0.4B • Updated Mar 28 • 15

thoughtworks/Qwen2.5-14B-Instruct-Eagle3

Text Generation • 0.6B • Updated Mar 28 • 73

thoughtworks/Qwen3-8B-Eagle3

Text Generation • 0.4B • Updated Mar 28 • 48 • 2

thoughtworks/Qwen3-14B-Eagle3

Text Generation • 0.6B • Updated Mar 28 • 38 • 1

thoughtworks/Qwen3-32B-Eagle3

Text Generation • 0.8B • Updated Mar 28 • 11

z-lab/Alpamayo-1.5-10B-DFlash

Robotics • 0.5B • Updated 13 days ago • 1.53k • 4

mradermacher/Harmonic-2B-GGUF

2B • Updated Apr 6 • 13

mradermacher/Harmonic-2B-i1-GGUF

2B • Updated May 24 • 24

thoughtworks/Gemma-4-31B-Eagle3

Text Generation • 0.6B • Updated Apr 7 • 116 • 6

chankhavu/Nemotron-Cascade2-30B-A3B-Eagle3-Long-Context

Text Generation • 0.2B • Updated Apr 22 • 49 • 2

chankhavu/c2.eagle3-test-v2

Text Generation • 0.2B • Updated Apr 9 • 8

jaygala223/llada-distilled-16L-checkpoint-500

Text Generation • 5B • Updated Apr 9 • 9

thoughtworks/MiniMax-M2.5-Eagle3

Text Generation • 0.2B • Updated Apr 12 • 1.3k • 7

jaygala223/llada-distilled-24L-v2

5B • Updated Apr 10 • 5

jaygala223/llada-distilled-24L-v3

5B • Updated Apr 12 • 4

sulabhkatiyar/eagle3-sarvam-30b

Text Generation • 2B • Updated May 29 • 12