Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

24,103

Base only

Active filters: llama-cpp

martintomov/falcon-11B-Q8_0-GGUF

11B • Updated Jul 9, 2024 • 2

fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-Q4_K_M-GGUF

Text Generation • 4B • Updated Jul 9, 2024 • 12

porterrigby/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF

Text Generation • 8B • Updated Jul 9, 2024 • 6

fernandovmacedo/portuguese-Phi3-Tom-Cat-128k-instruct-IQ4_NL-GGUF

Text Generation • 4B • Updated Jul 9, 2024 • 15

v000000/SwallowMaid-8B-L3-SPPO-abliterated-Q8_0-GGUF

8B • Updated Jul 11, 2024 • 13 • 2

bunnycore/FunLLama3-8B-Q5_K_M-GGUF

8B • Updated Jul 9, 2024 • 1

Phraios/google-gemma-2-27b-it-ortho-Q5_K_M-GGUF

27B • Updated Jul 9, 2024 • 23

jeiku/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF

7B • Updated Jul 9, 2024 • 14

NikolayKozloff/Gemma-2-9b-indic-Q8_0-GGUF

9B • Updated Jul 9, 2024 • 3 • 1

Akul/gpt2-xl-Q6_K-GGUF

2B • Updated Jul 9, 2024 • 12

yoeldcd/OpenELM-270M-oasstguanaco-2e-ORPO-Q8_0-GGUF

0.3B • Updated Jul 10, 2024

ugurcelebi/llama3-8b-tr-finetuned-Q8_0-GGUF

8B • Updated Jul 10, 2024

MegaTom/TinyLlama-1.1B-Chat-v1.0-Q4_K_M-GGUF

1B • Updated Jul 10, 2024 • 34

leonn71/gte-Qwen2-1.5B-instruct-Q6_K-GGUF

Sentence Similarity • 2B • Updated Jul 10, 2024 • 4

andy0124/llama-3-Korean-Bllossom-8B-Q5_K_S-GGUF

8B • Updated Jul 10, 2024 • 7

huggioface/Meta-Llama-3-8B-Q5_K_M-GGUF

Text Generation • 8B • Updated Jul 10, 2024 • 1

Minuano/Qwen2-7B-Instruct-Q8_0-GGUF

Text Generation • 8B • Updated Jul 10, 2024 • 1

utterlygreat/omost-dolphin-2.9-llama3-8b-IQ4_XS-GGUF

8B • Updated Jul 10, 2024 • 33

leonn71/Mistral-7B-v0.3-Q4_K_M-GGUF

7B • Updated Jul 10, 2024 • 3

NikolayKozloff/ArliAI-Llama-3-8B-Formax-v1.0-Q4_0-GGUF

8B • Updated Jul 10, 2024 • 2 • 1

NikolayKozloff/ArliAI-Llama-3-8B-Formax-v1.0-Q5_0-GGUF

8B • Updated Jul 10, 2024 • 1 • 1

NikolayKozloff/ArliAI-Llama-3-8B-Formax-v1.0-IQ4_XS-GGUF

8B • Updated Jul 10, 2024 • 3 • 1

NikolayKozloff/ArliAI-Llama-3-8B-Formax-v1.0-IQ4_NL-GGUF

8B • Updated Jul 10, 2024 • 2 • 1

adityadhakal/Meta-Llama-3-8B-Q2_K-GGUF

Text Generation • 8B • Updated Jul 10, 2024 • 5

utterlygreat/omost-dolphin-2.9-llama3-8b-Q5_K_S-GGUF

8B • Updated Jul 10, 2024 • 14

utterlygreat/omost-dolphin-2.9-llama3-8b-Q8_0-GGUF

8B • Updated Jul 10, 2024 • 1

kscommhit/Llama3-ChatQA-1.5-8B-Q8_0-GGUF

Text Generation • 8B • Updated Jul 10, 2024 • 7

utterlygreat/omost-dolphin-2.9-llama3-8b-Q6_K-GGUF

8B • Updated Jul 10, 2024 • 16

NikolayKozloff/NuminaMath-7B-TIR-Q8_0-GGUF

Text Generation • 7B • Updated Jul 10, 2024 • 2 • 1

NikolayKozloff/NuminaMath-7B-TIR-Q5_0-GGUF

Text Generation • 7B • Updated Jul 10, 2024 • 6 • 1