2 10

Zhongyu Yang

yzzyu

https://01yzzyu.github.io/

AI & ML interests

None yet

Recent Activity

published a dataset 11 days ago

yzzyu/UFO

updated a dataset 12 days ago

yzzyu/UFO

authored a paper about 1 month ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

View all activity

Organizations

published a dataset 11 days ago

yzzyu/UFO

Viewer • Updated 11 days ago • 3.37k • 1.2k

updated a dataset 12 days ago

yzzyu/UFO

Viewer • Updated 11 days ago • 3.37k • 1.2k

authored a paper about 1 month ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

upvoted a paper about 1 month ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

authored 2 papers about 2 months ago

MultiHaystack: Benchmarking Multimodal Retrieval and Reasoning over 40K Images, Videos, and Documents

Paper • 2603.05697 • Published Mar 5

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

upvoted a paper about 2 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

upvoted a paper 4 months ago

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Paper • 2602.21778 • Published Feb 25 • 15

authored a paper 5 months ago

XR: Cross-Modal Agents for Composed Image Retrieval

Paper • 2601.14245 • Published Jan 20 • 8

upvoted a paper 5 months ago

XR: Cross-Modal Agents for Composed Image Retrieval

Paper • 2601.14245 • Published Jan 20 • 8

submitted a paper to Daily Papers 5 months ago

XR: Cross-Modal Agents for Composed Image Retrieval

Paper • 2601.14245 • Published Jan 20 • 8

authored a paper 5 months ago

InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration

Paper • 2512.02981 • Published Dec 2, 2025 • 2

upvoted a paper 5 months ago

InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration

Paper • 2512.02981 • Published Dec 2, 2025 • 2

authored a paper 7 months ago

Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Paper • 2512.01949 • Published Dec 1, 2025 • 9

upvoted a paper 7 months ago

Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Paper • 2512.01949 • Published Dec 1, 2025 • 9

commented a paper 7 months ago

Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Paper • 2512.01949 • Published Dec 1, 2025 • 9 •

upvoted 2 papers 7 months ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 188

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96

upvoted a paper 11 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 137

authored a paper about 1 year ago

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Paper • 2503.19065 • Published Mar 24, 2025 • 11

Zhongyu Yang

AI & ML interests

Recent Activity

Organizations

yzzyu's activity