[work in progress] Upload optimized language model w/ WebGPU-compatible GQA 2bb3f5c verified Xenova HF Staff commited on Apr 3