inference-optimization
/

granite-4.0-h-tiny-quantized.w8a8

granitemoehybrid

8-bit precision

compressed-tensors

Model card Files Files and versions

granite-4.0-h-tiny-quantized.w8a8

1 contributor

History: 2 commits

krishnateja95's picture

Upload folder using huggingface_hub

5cae4ef verified about 2 months ago

.gitattributes
1.52 kB

initial commit about 2 months ago
README.md
31 Bytes

initial commit about 2 months ago
chat_template.jinja
6.42 kB

Upload folder using huggingface_hub about 2 months ago
config.json
5.27 kB

Upload folder using huggingface_hub about 2 months ago
generation_config.json
147 Bytes

Upload folder using huggingface_hub about 2 months ago
merges.txt
917 kB

Upload folder using huggingface_hub about 2 months ago
model-00001-of-00002.safetensors
3.56 GB
xet

Upload folder using huggingface_hub about 2 months ago
model-00002-of-00002.safetensors
3.56 GB
xet

Upload folder using huggingface_hub about 2 months ago
model.safetensors.index.json
72.4 kB

Upload folder using huggingface_hub about 2 months ago
recipe.yaml
660 Bytes

Upload folder using huggingface_hub about 2 months ago
special_tokens_map.json
579 Bytes

Upload folder using huggingface_hub about 2 months ago
tokenizer.json
7.15 MB

Upload folder using huggingface_hub about 2 months ago
tokenizer_config.json
17.7 kB

Upload folder using huggingface_hub about 2 months ago
vocab.json
1.61 MB

Upload folder using huggingface_hub about 2 months ago