FutureSim: Replaying World Events to Evaluate Adaptive Agents Paper • 2605.15188 • Published 6 days ago • 6
Running Agents 1 FutureSim Agent Trajectories 🚀 1 Trajectories of frontier agents on the FutureSim benchmark.
Running Agents 1 FutureSim Agent Trajectories 🚀 1 Trajectories of frontier agents on the FutureSim benchmark.
Training AI Co-Scientists Using Rubric Rewards Paper • 2512.23707 • Published Dec 29, 2025 • 21
Scaling Open-Ended Reasoning to Predict the Future Paper • 2512.25070 • Published Dec 31, 2025 • 20