RedHatAI
/

Qwen3-4B-Thinking-2507-quantized.w8a8

Text Generation

text-generation-inference

8-bit precision

compressed-tensors

Model card Files Files and versions

Qwen3-4B-Thinking-2507-quantized.w8a8

5.21 GB

Ctrl+K

Ctrl+K

1 contributor

History: 5 commits

ChibuUkachi's picture

update quantization message

a6d3fbb verified 1 day ago

.gitattributes

1.57 kB
Upload folder using huggingface_hub about 2 months ago
README.md

7.55 kB
update quantization message 1 day ago
added_tokens.json

707 Bytes
Upload folder using huggingface_hub about 2 months ago
chat_template.jinja

4.05 kB
Upload folder using huggingface_hub about 2 months ago
config.json

2.7 kB
Upload folder using huggingface_hub about 2 months ago
generation_config.json

214 Bytes
Upload folder using huggingface_hub about 2 months ago
merges.txt

1.67 MB
Upload folder using huggingface_hub about 2 months ago
model-00001-of-00002.safetensors

4.41 GB
xet

Upload folder using huggingface_hub about 2 months ago
model-00002-of-00002.safetensors

778 MB
xet

Upload folder using huggingface_hub about 2 months ago
model.safetensors.index.json

54.9 kB
Upload folder using huggingface_hub about 2 months ago
recipe.yaml

1.27 kB
Upload folder using huggingface_hub about 2 months ago
special_tokens_map.json

613 Bytes
Upload folder using huggingface_hub about 2 months ago
tokenizer.json

11.4 MB
xet

Upload folder using huggingface_hub about 2 months ago
tokenizer_config.json

5.4 kB
Upload folder using huggingface_hub about 2 months ago
vocab.json

2.78 MB
Upload folder using huggingface_hub about 2 months ago