MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 3 days ago • 14
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 15 days ago • 83
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 21 days ago • 145
SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding Paper • 2605.10453 • Published 24 days ago • 9
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 243
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 365
Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy Paper • 2603.25764 • Published Mar 26 • 5