Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
2.23k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
d26f884
quantization
/
cutlass_extensions
/
epilogue
Ctrl+K
Ctrl+K
2 contributors
History:
3 commits
danieldk
HF Staff
Sync on vLLM 20240402
d26f884
about 1 year ago
broadcast_load_epilogue_array_c3x.hpp
Safe
16.9 kB
Sync on vLLM 20240402
about 1 year ago
broadcast_load_epilogue_c2x.hpp
Safe
15.5 kB
Add cutlass_w8a8
over 1 year ago
broadcast_load_epilogue_c3x.hpp
Safe
16.6 kB
Add cutlass_w8a8
over 1 year ago
scaled_mm_epilogues_c2x.hpp
Safe
13.4 kB
Sync with vLLM
over 1 year ago
scaled_mm_epilogues_c3x.hpp
Safe
13.5 kB
Sync with vLLM
over 1 year ago