Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 8 days ago • 66
Qwen3 DWQ Quants Collection High-quality 4-bit quants of the Qwen3 model family. • 8 items • Updated Jul 11 • 7
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 550