view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 3 days ago • 37
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU Jan 2 • 16
🧮functiongemma ft mobile-actions Collection A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated Jan 5 • 3
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 5 items • Updated 23 days ago • 12
SYNTH Collection Fully generalist synthetic dataset and SOTA small reasoners • 3 items • Updated Nov 10, 2025 • 12
view article Article Extract Text and Knowledge from Images with Open Vision Language Models Oct 23, 2025 • 5
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn Sep 4, 2025 • 30
view article Article Some Safety and Security tests using LlamaGuard 4 12B and PromptGuard2 Aug 28, 2025 • 1
Mergenetic: a Simple Evolutionary Model Merging Library Paper • 2505.11427 • Published May 16, 2025 • 14