MobileForge: Annotation-Free Adaptation for Mobile GUI Agents with Hierarchical Feedback-Guided Policy Optimization Paper • 2606.19930 • Published 10 days ago • 42
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 12 days ago • 207
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 27 days ago • 16
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 27 days ago • 16
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published May 20 • 85
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories Paper • 2605.04036 • Published May 5 • 72
MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools Paper • 2510.24284 • Published Nov 1, 2025 • 1
MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools Paper • 2510.24284 • Published Nov 1, 2025 • 1
FedMABench: Benchmarking Mobile Agents on Decentralized Heterogeneous User Data Paper • 2503.05143 • Published Mar 7, 2025 • 1
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published Apr 28, 2025 • 23
InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents Paper • 2510.02271 • Published Oct 2, 2025 • 8
MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments Paper • 2602.06075 • Published Feb 3 • 14
FedMABench: Benchmarking Mobile Agents on Decentralized Heterogeneous User Data Paper • 2503.05143 • Published Mar 7, 2025 • 1
MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments Paper • 2602.06075 • Published Feb 3 • 14
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published Dec 26, 2025 • 31