Xixi Wu's picture

Xixi Wu

xxwu

·

https://wxxshirley.github.io/

WxxShirley

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

updated a collection about 2 months ago

updated a collection about 2 months ago

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Paper • 2605.10923 • Published May 11 • 13

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Paper • 2603.21972 • Published Mar 23 • 5

upvoted a paper 3 months ago

Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells

Paper • 2603.25240 • Published Mar 26 • 78

upvoted a collection 3 months ago

Agent-STAR

Resources for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe" • 11 items • Updated May 12 • 2

upvoted a paper 5 months ago

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

upvoted 3 papers 8 months ago

ParallelMuse: Agentic Parallel Thinking for Deep Information Seeking

Paper • 2510.24698 • Published Oct 28, 2025 • 21

Repurposing Synthetic Data for Fine-grained Search Agent Supervision

Paper • 2510.24694 • Published Oct 28, 2025 • 25

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 104

upvoted 6 papers 9 months ago

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16, 2025 • 80

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Paper • 2509.13309 • Published Sep 16, 2025 • 68

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16, 2025 • 73

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 118

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 92

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16, 2025 • 107

upvoted a paper 11 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7, 2025 • 143

upvoted a paper 12 months ago

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3, 2025 • 127