Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

24,135

Base only

Active filters: llama-cpp

Ransss/Quantum-Citrus-9B-Q8_0-GGUF

9B • Updated May 21, 2024 • 3

Ransss/Silver-Sun-11B-Q8_0-GGUF

11B • Updated May 21, 2024

Ransss/Silver-Sun-v2-11B-Q8_0-GGUF

11B • Updated May 21, 2024

Tj-Grewal/hippomistral-Q4_K_M-GGUF

7B • Updated May 21, 2024 • 5

wwe180/Phi-3-medium-128k-27B-lingyang-v0.1-Q4_K_M-GGUF

28B • Updated May 21, 2024 • 8

mudler/Phi-3-medium-4k-instruct-Q4_K_M-GGUF

Text Generation • 14B • Updated May 21, 2024 • 3

wwe180/Phi-3-medium-128k-10B-lingyang-v0.1-Q6_K-GGUF

11B • Updated May 21, 2024 • 4

int2eh/Phi-3-medium-128k-instruct-Q5_K_M-GGUF

Text Generation • 14B • Updated May 21, 2024 • 4

e2jhiubyiiyvw/Phi-3-medium-128k-instruct-Q5_K_M-GGUF

Text Generation • 14B • Updated May 22, 2024 • 2

aayushg159/Phi-3-medium-4k-instruct-Q4_K_M-GGUF

Text Generation • 14B • Updated May 22, 2024 • 33

farpluto/Phi-3-medium-4k-instruct-Q4_K_S-GGUF

Text Generation • 14B • Updated May 22, 2024 • 1

ivankris/gemma-2b-Q4_K_M-GGUF

3B • Updated May 22, 2024 • 12

AlirezaF138/Llama-3-Persian-8B-LoRA-Q6_K-GGUF

8B • Updated May 22, 2024 • 13 • 5

AlirezaF138/persian_llama_7B_merged-Q6_K-GGUF

Text Generation • 7B • Updated May 22, 2024 • 47 • 1

jeiku/Aura_3B-Q4_K_M-GGUF

3B • Updated May 22, 2024 • 6

Ken0751/Meta-Llama-3-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated May 22, 2024 • 1

Salekeen/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • 4B • Updated May 22, 2024 • 4

ybelkada/tiny-random-llama-Q4_K_M-GGUF

1.03M • Updated May 22, 2024 • 34

AlirezaF138/AVA-Qwen1.5-7B-Chat-Q6_K-GGUF

8B • Updated May 22, 2024 • 4

Bendy121165/gpt2-Q4_K_M-GGUF

0.2B • Updated May 22, 2024 • 7

reach-vb/TinyLlama-1.1B-Chat-v0.5-Q2_K-GGUF

1B • Updated May 22, 2024 • 2

linh5nb/Llama-2-7b-chat-luat-hon-nhan-1-Q4_K_M-GGUF

7B • Updated May 22, 2024

GeorgeBredis/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • 4B • Updated May 22, 2024 • 8

moshecohentheking/Hebrew-Mistral-7B-Q4_K_M-GGUF

8B • Updated May 22, 2024 • 1

FilippoToso/Mistral-RAG-Q8_0-GGUF

7B • Updated May 22, 2024

SixOpen/Phi-3-mini-4k-instruct-IQ4_NL-imat.gguf

Text Generation • 4B • Updated May 22, 2024 • 13

aayushg159/Phi-3-mini-128k-instruct-Q4_K_M-GGUF

Text Generation • 4B • Updated May 22, 2024 • 19

JeffreyLind/Meta-Llama-3-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated May 22, 2024 • 1

VlSav/saiga_llama3_8b_v7-Q6_K-GGUF

8B • Updated Jul 9, 2024 • 36 • 1

NNet/saiga_llama3_8b-Q6_K-GGUF

8B • Updated May 22, 2024 • 2