ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents Paper • 2604.23781 • Published 15 days ago • 33
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences Paper • 2603.27813 • Published Mar 29 • 23
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published Dec 26, 2025 • 2
Tool-Genesis: A Task-Driven Tool Creation Benchmark for Self-Evolving Language Agent Paper • 2603.05578 • Published Mar 5
GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows Paper • 2603.12155 • Published Mar 12
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences Paper • 2603.27813 • Published Mar 29 • 23
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences Paper • 2603.27813 • Published Mar 29 • 23