inference-optimization
/

granite-4.0-h-tiny-FP8-block

Text Generation

granitemoehybrid

compressed-tensors

Model card Files Files and versions

granite-4.0-h-tiny-FP8-block

7.59 GB

1 contributor

History: 5 commits

krishnateja95's picture

Update README.md

ec4f380 verified about 2 months ago

.gitattributes

1.52 kB

initial commit about 2 months ago
README.md

6.97 kB

Update README.md about 2 months ago
chat_template.jinja

6.42 kB

Upload folder using huggingface_hub about 2 months ago
config.json

8.54 kB

Update config.json about 2 months ago
generation_config.json

147 Bytes

Upload folder using huggingface_hub about 2 months ago
merges.txt

917 kB

Upload folder using huggingface_hub about 2 months ago
model-00001-of-00002.safetensors

3.79 GB
xet

Upload folder using huggingface_hub about 2 months ago
model-00002-of-00002.safetensors

3.79 GB
xet

Upload folder using huggingface_hub about 2 months ago
model.safetensors.index.json

65.5 kB

Upload folder using huggingface_hub about 2 months ago
recipe.yaml

220 Bytes

Upload folder using huggingface_hub about 2 months ago
special_tokens_map.json

579 Bytes

Upload folder using huggingface_hub about 2 months ago
tokenizer.json

7.15 MB

Upload folder using huggingface_hub about 2 months ago
tokenizer_config.json

17.7 kB

Upload folder using huggingface_hub about 2 months ago
vocab.json

1.61 MB

Upload folder using huggingface_hub about 2 months ago