CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 4 days ago • 56
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 27 days ago • 83
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 19 days ago • 58
WorldCompass: Reinforcement Learning for Long-Horizon World Models Paper • 2602.09022 • Published 22 days ago • 20
What Drives Success in Physical Planning with Joint-Embedding Predictive World Models? Paper • 2512.24497 • Published Dec 30, 2025 • 7
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6, 2025 • 51
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 Jan 29 • 103
view article Article Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments Jan 20 • 11
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench Paper • 2512.02942 • Published Dec 2, 2025 • 5
Self-Improving VLM Judges Without Human Annotations Paper • 2512.05145 • Published Dec 2, 2025 • 20
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published Dec 8, 2025 • 39
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 Sep 23, 2025 • 137