Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

15,867

Base only

Active filters: mlx

smdesai/paligemma-3b-mix-448-6bit

Image-Text-to-Text • 0.7B • Updated Jan 1, 2025 • 3

smdesai/paligemma2-3b-pt-448-4bit

Image-Text-to-Text • 0.6B • Updated Jan 2, 2025 • 6

smdesai/paligemma2-3b-pt-448-6bit

Image-Text-to-Text • 0.8B • Updated Jan 2, 2025 • 6

ggbetz/Llama-3.1-Argunaut-1-8B-SFT-Q4-mlx

Text Generation • 1B • Updated Jan 2, 2025 • 6

cmcmaster/Llama-3.2-3B-Q4-mlx

Text Generation • 0.5B • Updated Jan 2, 2025 • 4

mlx-community/Llama-3.2-1B-Instruct-MLXTuned

Text Generation • 1B • Updated Jan 2, 2025 • 737 • 5

alexander583/DeepSeek-V2-Lite-Chat-Q4-mlx

2B • Updated Jan 3, 2025 • 29

WaveCut/PRIME-RL_Eurus-2-7B-PRIME-Q8-mlx

2B • Updated Jan 3, 2025 • 10 • 1

mlx-community/Llama-3.2-1B-Instruct-mlx-FinGreyLit-finetuned

1B • Updated Jan 4, 2025 • 21 • 1

ubaitur5/SmallThinker-3B-Preview-Q4-mlx

Text Generation • Updated Jan 4, 2025 • 2

mlx-community/DeepSeek-V3-3bit

105B • Updated Jan 5, 2025 • 128 • 3

mlx-community/smallthinker-3b-preview-q8

Text Generation • Updated Jan 5, 2025 • 7

mlx-community/smallthinker-3b-preview-q4

Text Generation • Updated Jan 5, 2025 • 6

mlx-community/DeepSeek-V3-3bit-bf16

105B • Updated Jan 5, 2025 • 90 • 2

mlx-community/Dolphin3.0-Llama3.1-8B-4bit

1B • Updated Jan 5, 2025 • 201

mlx-community/Dolphin3.0-Llama3.1-8B-8bit

2B • Updated Jan 5, 2025 • 91

mlx-community/Dolphin3.0-Llama3.1-8B-bf16

8B • Updated Jan 5, 2025 • 38

mlx-community/Mistral-Nemo-Instruct-2407-3bit

Updated Jan 9, 2025 • 59 • 1

mlx-community/HuatuoGPT-o1-72B-4bit

Text Generation • 11B • Updated Jan 6, 2025 • 35 • 1

mlx-community/HuatuoGPT-o1-7B-4bit

Text Generation • 1B • Updated Jan 6, 2025 • 11

ivanfioravanti/Phi-3.5-mini-instruct-italian-wine

Text Generation • 4B • Updated Jan 6, 2025 • 10 •

CuckmeisterFuller/Dolphin3.0-Qwen2.5-3b-Q4-mlx

0.5B • Updated Jan 6, 2025 • 44

mlx-community/Dolphin3.0-Llama3.1-8B-6bit

2B • Updated Jan 6, 2025 • 42

mlx-community/Qwen2.5-Coder-32B-Instruct-abliterated-4bit

Text Generation • Updated Jan 6, 2025 • 185 • 1

mlx-community/Tiger-Gemma-9B-v3-Q4-mlx

Updated Jan 6, 2025 • 67 • 1

mlx-community/Qwen2.5-Coder-32B-Instruct-abliterated-3bit

Text Generation • Updated Jan 6, 2025 • 67 • 1

prdnr/UwU-7B-Instruct-Q4-mlx

Text Generation • 1B • Updated Jan 6, 2025 • 6 • 1

pcuenq/gemma-2-2b-it-4bit

Text Generation • 0.4B • Updated Jan 7, 2025 • 12

pcuenq/gemma-2-2b-it-4bit-test

Text Generation • 0.4B • Updated Jan 7, 2025 • 8

mlx-community/SmallThinker-3B-Preview-4bit

Text Generation • Updated Jan 7, 2025 • 12 • 1