Intel/Step-3.5-Flash-int4-mixed-AutoRound Text Generation • 28B • Updated about 22 hours ago • 4
Intel/Step-3.5-Flash-int4-mixed-AutoRound Text Generation • 28B • Updated about 22 hours ago • 4
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 11