GSQ - a ISTA-DASLab Collection

ISTA-DASLab 's Collections

GSQ

updated May 25

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling, https://huggingface.co/papers/2604.18556

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Paper • 2604.18556 • Published Apr 20 • 9
ISTA-DASLab/Kimi-K2.6-2Bit-GSQ

Image-Text-to-Text • 84B • Updated 16 days ago • 30
ISTA-DASLab/Kimi-K2.5-2Bit-GSQ

Image-Text-to-Text • 84B • Updated 16 days ago • 27
ISTA-DASLab/Llama-3.1-70B-Instruct-2Bit-GSQ

Text Generation • 7B • Updated May 18 • 39
ISTA-DASLab/Llama-3.1-70B-Instruct-3Bit-GSQ

Text Generation • 9B • Updated May 18 • 35
ISTA-DASLab/Qwen3.6-35B-A3B-2Bit-GSQ

Image-Text-to-Text • 5B • Updated 16 days ago • 376
ISTA-DASLab/Qwen3-4B-GGUF-GSQ

Text Generation • 4B • Updated May 18 • 28
ISTA-DASLab/Qwen3-8B-GGUF-GSQ

Text Generation • 8B • Updated May 18 • 56 • 3
ISTA-DASLab/Qwen3.5-4B-GGUF-GSQ

Text Generation • 4B • Updated May 26 • 143 • 1