Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,899

Base only

Active filters: nvidia

mradermacher/Llama3-ChatQA-1.5-8B-GGUF

8B • Updated May 5, 2024 • 138

mradermacher/Llama3-ChatQA-1.5-8B-i1-GGUF

8B • Updated May 5, 2024 • 104 • 1

mradermacher/Llama3-ChatQA-1.5-70B-GGUF

71B • Updated May 5, 2024 • 133 • 1

mradermacher/Llama3-ChatQA-1.5-70B-i1-GGUF

71B • Updated May 5, 2024 • 141 • 10

Sreenington/Llama-3-8B-ChatQA-AWQ

Text Generation • 8B • Updated May 5, 2024 • 4 • 2

beratcmn/Llama3-ChatQA-1.5-8B-lora

Text Generation • Updated May 3, 2024 • 4

beratcmn/Llama3-ChatQA-1.5-8B-256K

Text Generation • 8B • Updated May 3, 2024 • 5 • 6

LoneStriker/Llama3-ChatQA-1.5-70B-GGUF

Text Generation • 71B • Updated May 3, 2024 • 4 • 2

MoMonir/Llama3-ChatQA-1.5-8B-GGUF

Text Generation • 8B • Updated May 3, 2024 • 15

blockblockblock/Llama3-ChatQA-1.5-8B-bpw4.2-exl2

Text Generation • Updated May 3, 2024 • 1

LoneStriker/Llama3-ChatQA-1.5-70B-2.4bpw-h6-exl2

Text Generation • Updated May 3, 2024 • 2

LoneStriker/Llama3-ChatQA-1.5-70B-3.5bpw-h6-exl2

Text Generation • Updated May 3, 2024 • 2

LoneStriker/Llama3-ChatQA-1.5-70B-4.0bpw-h6-exl2

Text Generation • Updated May 3, 2024 • 2

LoneStriker/Llama3-ChatQA-1.5-70B-4.65bpw-h6-exl2

Text Generation • Updated May 3, 2024 • 3

LoneStriker/Llama3-ChatQA-1.5-70B-5.0bpw-h6-exl2

Text Generation • Updated May 3, 2024 • 1

LoneStriker/Llama3-ChatQA-1.5-70B-6.0bpw-h6-exl2

Text Generation • Updated May 4, 2024 • 1

bartowski/Llama3-ChatQA-1.5-70B-GGUF

Text Generation • 71B • Updated May 4, 2024 • 154 • 9

alexcovo/Llama3-ChatQA-1.5-8B-256K-Q4_K_M-GGUF

Text Generation • 8B • Updated May 4, 2024 • 13

lmstudio-community/Llama3-ChatQA-1.5-70B-GGUF

Text Generation • 71B • Updated May 4, 2024 • 121 • 6

lmstudio-community/Llama3-ChatQA-1.5-8B-GGUF

Text Generation • 8B • Updated May 4, 2024 • 154 • 6

QuantFactory/NVIDIA-Llama3-ChatQA-1.5-8B-GGUF

Text Generation • 8B • Updated May 6, 2024 • 313 • 2

LiteLLMs/Llama3-ChatQA-1.5-8B-GGUF

Text Generation • 8B • Updated May 28, 2024 • 3

ivanzou/fdsfsdf24e34e

Updated Jul 29, 2024

rollend/Llama3-ChatQA-1.5-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated May 31, 2024 • 4

narainp/Llama3-ChatQA-1.5-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated May 31, 2024 • 14

nvidia/Llama3-70B-SteerLM-RM

Updated Jun 19, 2024 • 18 • 43

nvidia/Llama3-70B-PPO-Chat

Updated Jun 14, 2024 • 7

nvidia/mamba2-8b-3t-4k

Text Generation • Updated Jun 13, 2024 • 23

nvidia/mamba2-hybrid-8b-3t-128k

Text Generation • Updated Jun 13, 2024 • 46

nvidia/mamba2-hybrid-8b-3t-32k

Text Generation • Updated Jun 13, 2024 • 6