Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

4,744

Base only

Active filters: meta

jwiggerthale/Llama-3.2-3B-Q4_0-GGUF

Text Generation • 3B • Updated Dec 2, 2024 • 10

jwiggerthale/Llama-3.2-3B-Q8_0-GGUF

Text Generation • 3B • Updated Dec 2, 2024 • 7

jwiggerthale/Llama-3.2-3B-Instruct-Q2_K-GGUF

Text Generation • 3B • Updated Dec 2, 2024 • 9

fbaldassarri/meta-llama_Llama-3.2-3B-Instruct-auto_awq-int4-gs128-sym

Text Generation • 4B • Updated Dec 2, 2024 • 6

jwiggerthale/Llama-3.2-3B-Instruct-Q4_0-GGUF

Text Generation • 3B • Updated Dec 2, 2024 • 4

fbaldassarri/meta-llama_Llama-3.2-1B-Instruct-auto_awq-int4-gs128-asym

Text Generation • 1B • Updated Dec 2, 2024 • 33

fbaldassarri/meta-llama_Llama-3.2-1B-Instruct-auto_awq-int4-gs128-sym

Text Generation • 1B • Updated Dec 2, 2024 • 7

fbaldassarri/meta-llama_Llama-3.2-1B-auto_round-int4-gs128-asym

Text Generation • 0.4B • Updated Dec 2, 2024 • 5

fbaldassarri/meta-llama_Llama-3.2-1B-auto_round-int4-gs128-sym

Text Generation • 0.4B • Updated Dec 2, 2024 • 6

BronioInt/Lake-1

Text Generation • 3B • Updated Feb 1, 2025

fbaldassarri/meta-llama_Llama-3.2-1B-auto_gptq-int4-gs128-asym

Text Generation • 1B • Updated Dec 2, 2024 • 4

fbaldassarri/meta-llama_Llama-3.2-1B-auto_gptq-int4-gs128-sym

Text Generation • 1B • Updated Dec 2, 2024 • 6

spicychickennoodles/Llama-3.2-1B-Alpaca

Text Generation • 1B • Updated Dec 2, 2024 • 13

fbaldassarri/meta-llama_Llama-3.2-1B-auto_awq-int4-gs128-asym

Text Generation • 1B • Updated Dec 2, 2024 • 4

fbaldassarri/meta-llama_Llama-3.2-1B-auto_awq-int4-gs128-sym

Text Generation • 1B • Updated Dec 2, 2024 • 9

mradermacher/ContaLLM-Beauty-8B-Instruct-GGUF

8B • Updated Dec 3, 2024 • 57 • 1

mradermacher/ContaLLM-Beauty-8B-Instruct-i1-GGUF

8B • Updated Dec 3, 2024 • 125 • 1

tensorblock/Meta-Llama-3.1-70B-Instruct-bf16-CORRECTED-GGUF

Text Generation • 71B • Updated Jan 27 • 66

smaluk/maluk-whatsapp

Text Generation • 0.5B • Updated Dec 4, 2024 • 32

ContaAI/ContaLLM-Beauty-8B-Instruct

Text Generation • 8B • Updated Dec 19, 2024 • 7 • 1

ContaAI/ContaLLM-Beauty-8B-Instruct-8bit

Text Generation • 8B • Updated Dec 19, 2024 • 3 • 1

ContaAI/ContaLLM-Beauty-8B-Instruct-4bit

Text Generation • 8B • Updated Dec 19, 2024 • 4 • 1

unsloth/Llama-3.2-11B-Vision-unsloth-bnb-4bit

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 66 • 5

unsloth/Llama-3.2-90B-Vision-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • 91B • Updated Dec 4, 2024 • 18 • 2

mradermacher/GOAT-70B-Storytelling-GGUF

69B • Updated Dec 4, 2024 • 37

fbaldassarri/meta-llama_Llama-3.2-3B-auto_round-int4-gs128-asym

Text Generation • 0.8B • Updated Dec 4, 2024 • 10

fbaldassarri/meta-llama_Llama-3.2-3B-auto_round-int4-gs128-sym

Text Generation • 0.8B • Updated Dec 4, 2024 • 7

fbaldassarri/meta-llama_Llama-3.2-3B-auto_gptq-int4-gs128-asym

Text Generation • 3B • Updated Dec 4, 2024 • 6

fbaldassarri/meta-llama_Llama-3.2-3B-auto_gptq-int4-gs128-sym

Text Generation • 3B • Updated Dec 4, 2024 • 6

fbaldassarri/meta-llama_Llama-3.2-3B-auto_awq-int4-gs128-asym

Text Generation • 4B • Updated Dec 4, 2024 • 8