EXL2 Tool Calling Model Quants Collection A collection of tool calling models quanted for EXL2 by yours truly • 7 items • Updated Jul 18, 2024 • 1
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 11 days ago • 129