Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
moe
like
3
Follow
Red Hat AI
1.87k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
refs/pr/1
moe
/
torch-ext
/
moe
484 kB
4 contributors
History:
2 commits
danieldk
HF Staff
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
b41d28a
12 months ago
configs
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
utils
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
__init__.py
2.4 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
fp8.py
2.45 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
fp8_utils.py
12.2 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
fused_marlin_moe.py
12.9 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
fused_moe.py
48.3 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
platforms.py
1.77 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago
scalar_type.py
11.8 kB
Vendor `w8a8_block_fp8_matmul` and `per_token_group_quant_fp8`
12 months ago