RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 3 days ago • 16
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 8 days ago • 310
Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images Paper • 2604.07338 • Published 8 days ago • 5
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 14 days ago • 470
Devy1/Qwen2.5-Coder-CONTROL-checkpoints_multi_language_2k-1.5B-Base-3 2B • Updated 15 days ago • 18 • 1
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263