JPishikawa
/

Llama-3.3-Swallow-70B-Instruct-v0.4-W4A16

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

This model is the quantized version of tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 by LLM Compressor.
This model adheres to the same licensing terms as the original model.

Downloads last month: 260

Safetensors

Model size

11B params

Tensor type

I64

·

I32

·

BF16

·

Model tree for JPishikawa/Llama-3.3-Swallow-70B-Instruct-v0.4-W4A16

Base model

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Quantized

(11)

this model