7 34 27

Zuhao Yang

mwxely

https://mwxely.github.io/

AI & ML interests

Large Multimodal Models

Recent Activity

updated a dataset 21 days ago

ParaVT/ParaVT-Source

updated a dataset 21 days ago

ParaVT/ParaVT-Parquet

updated a model 21 days ago

ParaVT/ParaVT-8B

View all activity

Organizations

updated 2 datasets 21 days ago

ParaVT/ParaVT-Source

Updated 21 days ago • 1.64k • 2

ParaVT/ParaVT-Parquet

Viewer • Updated 21 days ago • 101k • 333 • 3

updated a model 21 days ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated 21 days ago • 468 • 4

liked a Space 28 days ago

ParaVT

🎬

Parallel Video Tool Calling with Multi-Agent RL

updated a Space 28 days ago

ParaVT

🎬

Parallel Video Tool Calling with Multi-Agent RL

published a Space 28 days ago

ParaVT

🎬

Parallel Video Tool Calling with Multi-Agent RL

upvoted a paper 28 days ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

submitted a paper to Daily Papers 28 days ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

authored 2 papers about 1 month ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published May 19 • 34

upvoted a paper about 1 month ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

liked a model about 1 month ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated 21 days ago • 468 • 4

liked 2 datasets about 1 month ago

ParaVT/ParaVT-Source

Updated 21 days ago • 1.64k • 2

ParaVT/ParaVT-Parquet

Viewer • Updated 21 days ago • 101k • 333 • 3

published 2 datasets about 1 month ago

ParaVT/ParaVT-Source

Updated 21 days ago • 1.64k • 2

ParaVT/ParaVT-Parquet

Viewer • Updated 21 days ago • 101k • 333 • 3

published a model about 1 month ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated 21 days ago • 468 • 4

authored 2 papers about 1 month ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

upvoted a paper about 1 month ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

Zuhao Yang

AI & ML interests

Recent Activity

Organizations

mwxely's activity

ParaVT

ParaVT

ParaVT