Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

3,559

Base only

Active filters: instruct

mlx-community/LFM2-8B-A1B-8bit-MLX

Text Generation • 8B • Updated Oct 8, 2025 • 40 • 7

mlx-community/LFM2-8B-A1B-6bit-MLX

Text Generation • 8B • Updated Oct 8, 2025 • 26 • 3

mradermacher/Qwen3-4B-Thinking-2507-Esper3.1-GGUF

4B • Updated Oct 8, 2025 • 104 • 3

mradermacher/Qwen3-4B-Thinking-2507-Esper3.1-i1-GGUF

4B • Updated Dec 9, 2025 • 133 • 2

mlx-community/LFM2-8B-A1B-3bit-MLX

Text Generation • 1B • Updated Oct 8, 2025 • 83 • 2

yaraabdalaziz/SmolLM2Instruct-360-FT-MealRecommendation-v1

Text Generation • 0.4B • Updated Oct 9, 2025 • 3

yaraabdalaziz/SmolLM2Instruct-FT-MealRecommendation-v3

Text Generation • 0.1B • Updated Oct 9, 2025 • 3 • 1

EmanuelOverride/wise-llama-Q4_K_M-GGUF

8B • Updated Oct 11, 2025 • 1

Babsie/DeepHermes3_24B_textonly

24B • Updated Oct 27, 2025 • 3 • 1

philkuz/llama-3.3-70b-instruct-fp8

Text Generation • 71B • Updated Oct 14, 2025 • 17 • 1

mradermacher/DeepHermes-Egregore-8B-131K-GGUF

Reinforcement Learning • 8B • Updated Oct 16, 2025 • 95 • 1

mradermacher/DeepHermes-Egregore-8B-131K-i1-GGUF

Reinforcement Learning • 8B • Updated Dec 10, 2025 • 190 • 1

tbilisi-ai-lab/kona2-12B-Instruct

Text Generation • 12B • Updated May 16 • 51 • 4

SaptivaAI/KAL-24B-mx-v1

Text Generation • 22B • Updated Oct 18, 2025 • 8 • 10

Gokul-A-100/llama-3.1-8B-Instruct-atty-finetuned

Text Generation • Updated Oct 18, 2025

mradermacher/KAL-24B-mx-v1-GGUF

Text Generation • 22B • Updated Oct 19, 2025 • 7

mradermacher/kona2-12B-Instruct-GGUF

12B • Updated about 1 month ago • 71

mradermacher/KAL-24B-mx-v1-i1-GGUF

Text Generation • 22B • Updated Dec 6, 2025 • 39

mradermacher/kona2-12B-Instruct-i1-GGUF

12B • Updated about 1 month ago • 231

samunder12/Llama-3.2-3B-small_Shiro_roleplay-gguf

Text Generation • 3B • Updated Oct 20, 2025 • 534 • 4

sweatSmile/SmolLM-360M-CustomerSupport-Instruct

0.4B • Updated Oct 19, 2025 • 6 • 1

ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v1

Text Generation • 357B • Updated Oct 28, 2025 • 3

ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v2

Text Generation • 357B • Updated Oct 28, 2025 • 12 • 1

arogister/Qwen3-8B-ShiningValiant3-mlx-4Bit

Text Generation • 1B • Updated Oct 20, 2025 • 9

arogister/Qwen3-8B-Esper3-mlx-8Bit

Text Generation • 8B • Updated Oct 20, 2025 • 2

ValiantLabs/gpt-oss-20b-Esper3.1

Text Generation • 21B • Updated Apr 22 • 6 • 3

Godheritage/Qwen2.5-14B-Instruct-BesiegeField-Gemini2.5ProColdStart

Text Generation • 15B • Updated Oct 21, 2025 • 3

Godheritage/Qwen2.5-14B-Instruct-BesiegeField-CatapultRL

Reinforcement Learning • 15B • Updated Oct 21, 2025 • 2

BesiegeField/Qwen2.5-14B-Instruct-BesiegeField-CarRL

Reinforcement Learning • 15B • Updated Oct 22, 2025 • 1

nightmedia/Qwen3-4B-Thinking-2507-Esper3.1-qx86-hi-mlx

Text Generation • 1B • Updated Dec 25, 2025 • 9