Yibo Wang

yiboowang

2 18 1

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

upvoted a paper 19 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

upvoted a paper about 1 month ago

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

View all activity

Organizations

upvoted a paper 14 days ago

Learning from the Self-future: On-policy Self-distillation for dLLMs

Paper • 2606.18195 • Published 15 days ago • 76

upvoted a paper 19 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 20 days ago • 142

upvoted a paper about 1 month ago

HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs

Paper • 2605.28398 • Published May 27 • 15

upvoted a paper about 2 months ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

upvoted a paper 2 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

upvoted a paper 3 months ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published Mar 30 • 72

upvoted 2 papers 4 months ago

MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Paper • 2603.03756 • Published Mar 4 • 90

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published Feb 27 • 60

upvoted 2 papers 5 months ago

VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

Paper • 2601.22069 • Published Jan 29 • 7

Language-based Trial and Error Falls Behind in the Era of Experience

Paper • 2601.21754 • Published Jan 29 • 16

upvoted 3 papers 6 months ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published Jan 14 • 92

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 128

upvoted 2 papers 7 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 160

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96

upvoted a paper about 1 year ago

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Paper • 2504.21659 • Published Apr 30, 2025 • 14

upvoted 2 papers over 1 year ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 39

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

Paper • 2410.07484 • Published Oct 9, 2024 • 51

Yibo Wang

AI & ML interests

Recent Activity

Organizations

yiboowang's activity