unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 3 days ago • 33.6k • 53
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 3 days ago • 113
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 23 days ago • 485
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published 30 days ago • 54