Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wensong Song's picture
2 6 7

Wensong Song

WensongSong
wikeeyang's profile picture sanaka87's profile picture UnderController's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
Qwen/Qwen-Image-Edit-2511
liked a Space 3 months ago
akhaliq/sora-2
upvoted a paper 5 months ago
Visual Representation Alignment for Multimodal Large Language Models
View all activity

Organizations

Zhejiang University's profile picture Zhejiang University's profile picture

upvoted 6 papers 5 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

Reconstruction Alignment Improves Unified Multimodal Models

Paper • 2509.07295 • Published Sep 8, 2025 • 40

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 32

Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding

Paper • 2509.06923 • Published Sep 8, 2025 • 22

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9, 2025 • 59

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 102
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs