Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Firworks
/
Nemotron-Cascade-8B-nvfp4
like
0
Safetensors
HuggingFaceH4/ultrachat_200k
qwen3
8-bit precision
compressed-tensors
License:
nvidia-open-model-license
Model card
Files
Files and versions
xet
Community
main
Nemotron-Cascade-8B-nvfp4
6.41 GB
1 contributor
History:
3 commits
Firworks
Update README.md
69a4a0b
verified
25 days ago
.gitattributes
1.57 kB
Add NVFP4 quantized checkpoint
25 days ago
README.md
1.22 kB
Update README.md
25 days ago
added_tokens.json
707 Bytes
Add NVFP4 quantized checkpoint
25 days ago
chat_template.jinja
4.62 kB
Add NVFP4 quantized checkpoint
25 days ago
config.json
2.74 kB
Add NVFP4 quantized checkpoint
25 days ago
generation_config.json
139 Bytes
Add NVFP4 quantized checkpoint
25 days ago
merges.txt
1.67 MB
Add NVFP4 quantized checkpoint
25 days ago
model-00001-of-00002.safetensors
4.99 GB
xet
Add NVFP4 quantized checkpoint
25 days ago
model-00002-of-00002.safetensors
1.41 GB
xet
Add NVFP4 quantized checkpoint
25 days ago
model.safetensors.index.json
104 kB
Add NVFP4 quantized checkpoint
25 days ago
recipe.yaml
252 Bytes
Add NVFP4 quantized checkpoint
25 days ago
special_tokens_map.json
613 Bytes
Add NVFP4 quantized checkpoint
25 days ago
tokenizer.json
11.4 MB
xet
Add NVFP4 quantized checkpoint
25 days ago
tokenizer_config.json
5.4 kB
Add NVFP4 quantized checkpoint
25 days ago
vocab.json
2.78 MB
Add NVFP4 quantized checkpoint
25 days ago