Data-Efficient RLVR via Off-Policy Influence Guidance Paper β’ 2510.26491 β’ Published Oct 30, 2025 β’ 11
Running on CPU Upgrade Featured 2.93k The Smol Training Playbook π 2.93k The secrets to building world-class LLMs
Paused Featured 804 Qwen Image Edit β 804 Edit and enhance images based on descriptive instructions
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper β’ 2508.06471 β’ Published Aug 8, 2025 β’ 205
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper β’ 2508.06471 β’ Published Aug 8, 2025 β’ 205