Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,741

Base only

Active filters: 2-bit

MaziyarPanahi/gemma-3-1b-thinking-v2-GGUF

Text Generation • 1.0B • Updated Mar 24, 2025 • 131 • 1

MaziyarPanahi/Oriental-1.5b-GGUF

Text Generation • 2B • Updated Mar 24, 2025 • 27

MaziyarPanahi/Dirty-Shirley-Writer-v01-Uncensored-GGUF

Text Generation • 9B • Updated Mar 24, 2025 • 667 • 1

MaziyarPanahi/Dirty-Shirley-Writer-v0-abliterated-GGUF

Text Generation • 9B • Updated Mar 24, 2025 • 187 • 1

MaziyarPanahi/Qwen2.5-7B-sft-ultrachat-SPIN-gpt4o-GGUF

Text Generation • 8B • Updated Mar 24, 2025 • 37 • 1

MaziyarPanahi/MT-Merge9-gemma-2-9B-GGUF

Text Generation • 9B • Updated Mar 24, 2025 • 13 • 1

MaziyarPanahi/DeepSeek-V3-0324-GGUF

Text Generation • 671B • Updated Mar 25, 2025 • 75k • 23

ipetrukha/Llama-3.2-1B-Instruct-2bit

Text Generation • 0.1B • Updated Mar 27, 2025 • 12

AlignQuant/Llama-2-7b-chat-hf-GPTQ-2bit

Text Generation • 7B • Updated Mar 27, 2025 • 2

AlignQuant/Meta-Llama-3-8B-Instruct-GPTQ-2bit

Text Generation • 8B • Updated Mar 27, 2025 • 1

AlignQuant/Llama-2-13b-chat-hf-GPTQ-2bit

Text Generation • 13B • Updated Mar 28, 2025 • 6

oskarraszkiewicz/gemma-3-1b-it-abliterated-mlx-2bit

Text Generation • 0.1B • Updated Apr 3, 2025 • 21

YuHaaa/QwQ-32B-mlx-2Bit

Text Generation • 3B • Updated Apr 8, 2025 • 5

justinmeans/DeepCoder-14B-Preview-mlx-2Bit

Text Generation • 1B • Updated Apr 8, 2025 • 4

MikeRoz/Meta-Llama-3.1-405B-Instruct-2.0bpw-h4-exl3

Text Generation • 53B • Updated Apr 9, 2025 • 1

sleepdeprived3/Baptist-Christian-Bible-Expert-v2.0-24B_EXL2_2bpw_H8

Text Generation • Updated Apr 15, 2025 • 2

phires/Llama-3.2-1B-Instruct-GGUF-rk3588-1.1.2

Text Generation • Updated Apr 20, 2025 • 6

MaziyarPanahi/GLM-4-32B-0414-GGUF

Text Generation • 33B • Updated Apr 22, 2025 • 201 • 1

MaziyarPanahi/cogito-v1-preview-llama-3B-GGUF

Text Generation • 4B • Updated Apr 21, 2025 • 194 • 1

MaziyarPanahi/cogito-v1-preview-llama-8B-GGUF

Text Generation • 8B • Updated Apr 21, 2025 • 106 • 1

MaziyarPanahi/cogito-v1-preview-llama-70B-GGUF

Text Generation • 71B • Updated Apr 22, 2025 • 41 • 1

Erland/softpick-340M-4096-model-GPTQ-2bit

Text Generation • 0.4B • Updated May 21, 2025 • 1

Erland/vanilla-340M-4096-model-GPTQ-2bit

Text Generation • 0.4B • Updated Apr 24, 2025 • 2

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-2048-gptq

73B • Updated May 7, 2025 • 5

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-2048-gptq

73B • Updated May 7, 2025 • 3

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-2048-gptq

73B • Updated May 7, 2025 • 2

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq

73B • Updated May 7, 2025 • 3

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-4096-gptq

73B • Updated May 7, 2025 • 4

kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-4096-gptq

73B • Updated May 7, 2025 • 5

steampunque/Llama-4-Scout-17B-16E-Instruct-MP-GGUF

Text Generation • 108B • Updated Feb 18 • 216 • 1