LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 23 days ago • 42
STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media Paper • 2605.25162 • Published 28 days ago • 4
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 25 days ago • 429