maitreyaz
/

Llama-3-8B-AWQ-4bit

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Llama 3 8B AWQ 4-bit Quantized

This is an AWQ 4-bit Quantized version of Meta's Llama 3 8B.

Downloads last month: 5

Safetensors

Model size

8B params

Tensor type

I32

·

F16

·