3

xuzishan

AI & ML interests

None yet

Recent Activity

updated a dataset 9 days ago

xuzishan/ib-attribution-160

published a dataset 14 days ago

xuzishan/ib-attribution-160

updated a model 23 days ago

xuzishan/drmas-checklist-envscaler-share-b200-step20

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents

Paper • 2605.25971 • Published May 25 • 16

upvoted 2 papers 2 months ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Paper • 2605.10923 • Published May 11 • 13

upvoted 3 papers 5 months ago

CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability

Paper • 2602.03012 • Published Feb 3 • 3

V_0: A Generalist Value Model for Any Policy at State Zero

Paper • 2602.03584 • Published Feb 3 • 22

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

upvoted 4 papers 6 months ago

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Paper • 2601.08276 • Published Jan 13 • 7

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published Jan 11 • 82

RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction

Paper • 2601.06966 • Published Jan 11 • 9

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 215

upvoted a paper 7 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 160

upvoted 2 papers 9 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 100

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published Oct 27, 2025 • 123