Shuhai, Peng

psh24

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

3ViewSense: Spatial and Mental Perspective Reasoning from Orthographic Views in Vision-Language Models

Paper • 2603.07751 • Published Mar 8 • 12

upvoted a paper 10 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 130