3 9 3

Yunze Man

yunzeman

YunzeMan

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

nvidia/LocateAnything-3B

authored a paper 28 days ago

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

authored a paper 28 days ago

RandAR: Decoder-only Autoregressive Visual Generation in Random Orders

View all activity

Organizations

liked a model 23 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 13 days ago • 408k • 2.36k

authored 10 papers 28 days ago

AgMMU: A Comprehensive Agricultural Multimodal Understanding and Reasoning Benchmark

Paper • 2504.10568 • Published Apr 14, 2025

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

Paper • 2505.23766 • Published May 29, 2025

PPTArena: A Benchmark for Agentic PowerPoint Editing

Paper • 2512.03042 • Published Dec 2, 2025 • 1

LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight

Paper • 2511.20648 • Published Nov 25, 2025 • 1

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 56

OSGym: Scalable Distributed Data Engine for Generalizable Computer Agents

Paper • 2511.11672 • Published Nov 11, 2025

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published about 1 month ago • 144

upvoted a paper 29 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published about 1 month ago • 144

liked a model 4 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated Apr 30 • 1.67M • • 2.83k

upvoted a paper 5 months ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 56

liked a model 12 months ago

BAAI/RoboBrain2.0-7B

Robotics • 8B • Updated Aug 7, 2025 • 227 • 124

updated a dataset about 1 year ago

AgMMU/AgMMU_v1

Viewer • Updated Jul 29, 2025 • 50.2k • 467 • 2

upvoted a paper about 1 year ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 69

New activity in AgMMU/AgMMU_v1 about 1 year ago

Create README.md

#3 opened about 1 year ago by

ziqipang

updated a Space over 1 year ago

README

📊

published a Space over 1 year ago

README

📊

Yunze Man

AI & ML interests

Recent Activity

Organizations

yunzeman's activity

Create README.md

README

README