MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 3 days ago • 14
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering Paper • 2601.10402 • Published Jan 15 • 37