Code Collaborate: Dissecting Team Dynamics in First-Semester Programming Students
Paper
•
2410.20939
•
Published
CUDA kernels for HIGGS quantization, packaged for the Hugging Face Kernel Hub.
Extracted from galqiwi/higgs-kernels.
higgs_dequantize_2_256 - codebook lookup: uint8 indices -> 2D fp16/bf16 vectorshiggs_quantize_2_256_f16 - nearest codebook entry search (fp16)higgs_quantize_2_256_bf16 - nearest codebook entry search (bf16)from kernels import get_kernel
higgs = get_kernel("galqiwi/higgs-kernels")
out = higgs.higgs_dequantize_2_256_kernel(x_uint8, grid_256x2)
indices = higgs.higgs_quantize_2_256_kernel(x_fp16_Nx2, grid_256x2, grid_norms_256)
Pre-trained codebook included in grids.safetensors (256x2, key "2_256").
from kernels import get_kernel
higgs = get_kernel("galqiwi/higgs-kernels")
grid = higgs.load_optimal_grid_2_256(device="cuda", dtype=torch.float16)
Apache-2.0