Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
amd
/
gpt-oss-20b-WFP8-AFP8-KVFP8
like
0
Follow
AMD
2.5k
Safetensors
gpt_oss
quark
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
6
KV cache quantization in FP8
#1
by
XuebinWang
- opened
Nov 4, 2025
base:
refs/heads/main
←
from:
refs/pr/1
Discussion
Files changed
+7593
-0
XuebinWang
AMD org
Nov 4, 2025
No description provided.
initial commit to firstly support kv cache quantization in FP8
e370f343
XuebinWang
changed pull request status to
open
Nov 4, 2025
XuebinWang
changed pull request status to
merged
Nov 4, 2025
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment