Diffusion Policy: Visuomotor Policy Learning via Action Diffusion Paper • 2303.04137 • Published Mar 7, 2023 • 6
view article Article The Optimal Architecture for Small Language Models codelion • Dec 26, 2025 • 121
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 5
view article Article I trained a Language Model to schedule events with GRPO! anakin87 • Apr 29, 2025 • 95
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 728