World2Minecraft: Occupancy-Driven Simulated Scenes Construction Paper • 2604.27578 • Published 8 days ago • 4 • 2
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 15 days ago • 34 • 2
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 17 days ago • 249 • 7
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 17 days ago • 249 • 7
Evaluation-driven Scaling for Scientific Discovery Paper • 2604.19341 • Published 17 days ago • 3 • 2
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 25 days ago • 143 • 8
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 25 days ago • 143 • 8
VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics Paper • 2604.06182 • Published Feb 6 • 4 • 2
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published Apr 1 • 14 • 2
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published Mar 30 • 23 • 2
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published Mar 25 • 47 • 4
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published Mar 24 • 33 • 7
RealMaster: Lifting Rendered Scenes into Photorealistic Video Paper • 2603.23462 • Published Mar 24 • 33 • 7
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR Paper • 2603.10101 • Published Mar 10 • 5 • 2
Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? Paper • 2603.03202 • Published Mar 3 • 17 • 2