5 221 19

QRQ

RichardQRQ

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper 8 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

liked a dataset 9 days ago

xlangai/CUA-Gym

View all activity

Organizations

upvoted a paper 1 day ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 2 days ago • 101

upvoted a paper 8 days ago

GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?

Paper • 2606.17861 • Published 9 days ago • 55

liked a dataset 9 days ago

xlangai/CUA-Gym

Viewer • Updated 22 days ago • 10.9k • 1.33k • 21

upvoted a paper 9 days ago

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published 10 days ago • 30

upvoted a paper 10 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 22 days ago • 363

upvoted a paper 12 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 17 days ago • 102

upvoted a paper 14 days ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Paper • 2606.11926 • Published 15 days ago • 118

upvoted a paper 16 days ago

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 20 days ago • 119

liked a dataset 20 days ago

agents-last-exam/agents-last-exam

Viewer • Updated 13 days ago • 153 • 8.08k • 190

upvoted 4 papers about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 147

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published May 14 • 121

upvoted 7 papers about 2 months ago

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

Paper • 2605.01371 • Published May 2 • 6

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 286

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published Apr 23 • 38

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published Apr 27 • 68

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 231

QRQ

AI & ML interests

Recent Activity

Organizations

RichardQRQ's activity