Pu Fanyi's picture

Pu Fanyi

pufanyi

·

https://pufanyi.github.io

AI & ML interests

CV

Recent Activity

liked a model about 12 hours ago

sensenova/SenseNova-U1-8B-MoT

upvoted a collection 11 days ago

liked a model 16 days ago

google/tipsv2-b14

View all activity

Organizations

authored a paper about 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

authored a paper 6 months ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published Nov 17, 2025 • 49

authored a paper over 1 year ago

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 24

authored a paper almost 2 years ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 35

authored a paper over 2 years ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 34

authored a paper almost 3 years ago

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 12