Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
amd
/
gpt-oss-20b-WFP8-AFP8-KVFP8
like
0
Follow
AMD
2.31k
Safetensors
gpt_oss
quark
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
6
main
gpt-oss-20b-WFP8-AFP8-KVFP8
22.1 GB
1 contributor
History:
5 commits
XuebinWang
update readme with disclaimer (
#4
)
2431671
verified
30 days ago
.gitattributes
1.57 kB
KV cache quantization in FP8 (#1)
4 months ago
LICENSE
11.4 kB
update README (results etc) and upload LICENSE and USAGE_POLICY (#2)
about 2 months ago
README.md
6.97 kB
update readme with disclaimer (#4)
30 days ago
USAGE_POLICY
200 Bytes
update README (results etc) and upload LICENSE and USAGE_POLICY (#2)
about 2 months ago
chat_template.jinja
16.7 kB
KV cache quantization in FP8 (#1)
4 months ago
config.json
9.37 kB
KV cache quantization in FP8 (#1)
4 months ago
generation_config.json
172 Bytes
KV cache quantization in FP8 (#1)
4 months ago
model-00001-of-00005.safetensors
4.99 GB
xet
KV cache quantization in FP8 (#1)
4 months ago
model-00002-of-00005.safetensors
5 GB
xet
KV cache quantization in FP8 (#1)
4 months ago
model-00003-of-00005.safetensors
4.99 GB
xet
KV cache quantization in FP8 (#1)
4 months ago
model-00004-of-00005.safetensors
4.99 GB
xet
KV cache quantization in FP8 (#1)
4 months ago
model-00005-of-00005.safetensors
2.11 GB
xet
KV cache quantization in FP8 (#1)
4 months ago
model.safetensors.index.json
624 kB
KV cache quantization in FP8 (#1)
4 months ago
special_tokens_map.json
323 Bytes
KV cache quantization in FP8 (#1)
4 months ago
tokenizer.json
27.9 MB
xet
KV cache quantization in FP8 (#1)
4 months ago
tokenizer_config.json
4.22 kB
KV cache quantization in FP8 (#1)
4 months ago