leideng/QCFuse / srt /layers
466 GB
2,808 files
Updated 13 days ago
Name
Size
attention
deep_gemm_wrapper
moe
quantization
activation.py13.4 kB
xet
amx_utils.py2.87 kB
xet
communicator.py24.5 kB
xet
dp_attention.py16.8 kB
xet
elementwise.py18.8 kB
xet
flashinfer_comm_fusion.py6.73 kB
xet
layernorm.py12.3 kB
xet
linear.py56.1 kB
xet
logits_processor.py35.6 kB
xet
model_parallel.py6.08 kB
xet
modelopt_utils.py335 Bytes
xet
multimodal.py5.11 kB
xet
parameter.py18.4 kB
xet
pooler.py3.81 kB
xet
radix_attention.py5.2 kB
xet
rocm_linear_utils.py1.37 kB
xet
rotary_embedding.py101 kB
xet
sampler.py20.6 kB
xet
sparse_pooler.py3.42 kB
xet
torchao_utils.py4.06 kB
xet
utils.py1.91 kB
xet
vocab_parallel_embedding.py22.7 kB
xet
Total size
466 GB
Files
2,808
Last updated
Jun 16
Pre-warmed CDN
US EU US EU

Contributors