Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

16,292

Base only

Active filters: mlx

mlx-community/JOSIE-TinyLlama-1.1B-32k-base-4bit

Text Generation • 0.2B • Updated May 24, 2024 • 33 • 1

mlx-community/JOSIE-TinyLlama-1.1B-32k-base-8bit

Text Generation • 0.3B • Updated May 24, 2024 • 17 • 1

mlx-community/Phi-3-small-8k-instruct-aq4_64

Text Generation • Updated May 24, 2024 • 23

mlx-community/openchat-3.6-8b-20240522-8bit

Text Generation • Updated May 24, 2024 • 32 • 2

mlx-community/openchat-3.6-8b-20240522-4bit

Text Generation • Updated May 24, 2024 • 37 • 1

mlx-community/openchat-3.6-8b-20240522-2bit

Text Generation • Updated May 24, 2024 • 23

lmbelo/OpenELM-270M-Function-Calling

0.3B • Updated May 25, 2024 • 5

cstr/Llama3-DiscoLeo-Instruct-8B-v0.1-mlx

Text Generation • 1B • Updated May 25, 2024 • 8

dfurman/Mistral-7B-Instruct-v0.3-mlx-4bit

1B • Updated May 26, 2024 • 58

mayflowergmbh/Llama3-German-8B-4bit

Text Generation • 2B • Updated May 26, 2024 • 12

mayflowergmbh/Llama3-German-8B-32k-4bit

Text Generation • 2B • Updated May 26, 2024 • 24 • 1

mayflowergmbh/Llama3-DiscoLeo-Instruct-8B-v0.1-4bit

Text Generation • 2B • Updated May 26, 2024 • 2

mayflowergmbh/Llama3-DiscoLeo-Instruct-8B-32k-v0.1-4bit

Text Generation • 2B • Updated May 26, 2024 • 2

mlx-community/OpenELM-270M-Instruct-4bit

Updated May 26, 2024 • 13 • 1

lmbelo/Phi-3-mini-4k-instruct

Text Generation • 4B • Updated May 28, 2024 • 7

cstr/llama3-8b-spaetzle-v33-mlx-4bit

1B • Updated May 28, 2024 • 2

mlx-community/Codestral-22B-v0.1-4bit

3B • Updated May 29, 2024 • 1.06k • 13

mlx-community/Codestral-22B-v0.1-8bit

6B • Updated May 29, 2024 • 312 • 8

lmbelo/Phi-3-mini-4k-Function-Calling

Text Generation • 4B • Updated May 30, 2024 • 13

mlx-community/AutoCoder-33B-4bit

Updated May 30, 2024 • 55 • 2

xiaotianxt/llama-3-chinese-8b-instruct-v3-4bit-mlx

1B • Updated May 31, 2024 • 50

mlx-community/Phi-3-small-8k-instruct-AQ4_32

Text Generation • Updated May 31, 2024 • 31 • 2

xiaotianxt/llama-3-treehole-8b-instruct

8B • Updated Jun 1, 2024 • 1

xiaotianxt/llama-3-treehole-8b-v3

1B • Updated Jun 2, 2024 • 8

mlx-community/dolphin-2.9.2-Phi-3-Medium-4bit

Updated Jun 2, 2024 • 21 • 1

mlx-community/dolphin-2.9.2-Phi-3-Medium-8bit

Updated Jun 2, 2024 • 20 • 1

mlx-community/dolphin-2.9.2-Phi-3-Medium-2bit

Updated Jun 2, 2024 • 18

ipihq/Phi-3-medium-128k-instruct

Text Generation • 14B • Updated Jun 3, 2024 • 7

ipihq/Phi-3-medium-128k-instruct_q

Text Generation • 2B • Updated Jun 3, 2024 • 6

meriamcherif/Llama-2-7b-chat-hf-Quantized

Text Generation • 1B • Updated Jun 4, 2024 • 14