view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 4 days ago • 43
Distil Efficiency Benchmarks Collection Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated 11 days ago • 3
Quantized Qwen3.5 Collection Verified models. Compatible with Transformers v5.3 and vLLM v0.16.1rc1 (nightly). Under evaluation. • 9 items • Updated 1 day ago • 9
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated Image-Text-to-Text • 36B • Updated 12 days ago • 30.4k • 217
Qwen3-MoE Collection Compressed Qwen3 MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated about 1 month ago • 3
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 17 days ago • 130