Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
inference-optimization
/
Qwen3-0.6B-debug-multiply-W4A16-G128
like
0
Follow
Inference Optimization
23
Safetensors
qwen3
compressed-tensors
Model card
Files
Files and versions
xet
Community
main
Qwen3-0.6B-debug-multiply-W4A16-G128
/
recipe.yaml
kylesayrs
Copy from nm-testing/Qwen3-0.6B-debug-multiply-W4A16-G128
3eca842
verified
3 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
130 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets:
[
Linear
]
ignore:
[
lm_head
]
scheme:
W4A16