Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

646

Base only

Active filters: pruning

NexVeridian/Qwen3-Coder-REAP-25B-A3B-8bit

Text Generation • 25B • Updated Jan 25 • 115 • 1

DarqueDante/gemma3-270m-pruned-base-Q4_0-GGUF

Text Generation • 0.3B • Updated Dec 7, 2025 • 18

CrystalRaindropsFall/llava-heads-30pct

Image-to-Text • 6B • Updated Dec 9, 2025 • 2

CrystalRaindropsFall/llava-glu-70pct

Image-to-Text • 4B • Updated Dec 9, 2025 • 2

CrystalRaindropsFall/llava-heads-70pct

Image-to-Text • 6B • Updated Dec 9, 2025 • 2

CrystalRaindropsFall/llava-glu-30pct

Image-to-Text • 6B • Updated Dec 9, 2025 • 2

CrystalRaindropsFall/llava-l1-30pct

Image-to-Text • 7B • Updated Dec 9, 2025 • 3

CrystalRaindropsFall/llava-l1-70pct

Image-to-Text • 7B • Updated Dec 9, 2025 • 3 • 1

CrystalRaindropsFall/llava-glu30-heads30

Image-to-Text • 5B • Updated Dec 9, 2025 • 3

CrystalRaindropsFall/llava-glu70-heads70

Image-to-Text • 2B • Updated Dec 9, 2025 • 2

cerebras/DeepSeek-V3.2-REAP-345B-A37B

Text Generation • 345B • Updated Dec 9, 2025 • 49 • 34

cerebras/DeepSeek-V3.2-REAP-508B-A37B

Text Generation • 508B • Updated Dec 9, 2025 • 13 • 15

TobDeBer/Qwen3-30B-Hirma

Text Generation • Updated Dec 10, 2025 • 3

epfl-ml-ytf/apertus-8b-pruned-latin-94237

8B • Updated Dec 18, 2025 • 2

AfriNLP/AfriNLLB-12enc-8dec-middle-548m-ft

Translation • 0.5B • Updated Mar 6 • 11

naveedashfaq/llama-3-8b-pruned-30-percent

6B • Updated Dec 16, 2025 • 8

naveedashfaq/llama-3-8b-pruned-30-percent-taylor

6B • Updated Dec 16, 2025 • 16 • 1

epfl-ml-ytf/apertus-8b-pruned-eng-66663

8B • Updated Dec 18, 2025 • 2

AfriNLP/AfriNLLB-8enc-8dec-middle-498m-ft

Translation • 0.5B • Updated Mar 6 • 10

avtc/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16-V2

Text Generation • 271B • Updated Dec 20, 2025 • 8

epfl-ml-ytf/apertus-8b-pruned-english-ds-63159

7B • Updated Dec 18, 2025 • 6

dnaymont15/Qwen3-Coder-REAP-25B-A3B-Q3_K_S-GGUF

Text Generation • 25B • Updated Dec 18, 2025 • 11

dnaymont15/Qwen3-Coder-REAP-25B-A3B-Q4_K_M-GGUF

Text Generation • 25B • Updated Dec 18, 2025 • 7

dnaymont15/Qwen3-Coder-REAP-25B-A3B-Q3_K_L-GGUF

Text Generation • 25B • Updated Dec 18, 2025 • 8

muchad/deberta-hybrid-7030-30k

0.1B • Updated Dec 20, 2025 • 43

muchad/deberta-single-30k

0.1B • Updated Dec 20, 2025 • 2

muchad/deberta-single-20k

0.1B • Updated Dec 20, 2025 • 2

Echoes123-3/qwen2.5-0.5b-coding-pruned

0.5B • Updated Dec 20, 2025 • 3

AfriNLP/AfriNLLB-12enc-8dec-iterative-548m-ft

Translation • 0.5B • Updated Mar 6 • 12 • 1

StarTech792/Qwen3-Coder-REAP-25B-A3B-Q4_K_M-GGUF

Text Generation • 25B • Updated Dec 25, 2025 • 14