view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 9 days ago • 63
view article Article Did GPT 5.2 make a breakthrough discovery in theoretical physics? 27 days ago • 61
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 27 days ago • 487
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published Feb 3 • 12
view article Article Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments Jan 20 • 11
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 107
Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR Paper • 2602.05261 • Published Feb 5 • 51
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods +3 Jan 18, 2024 • 79
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307