Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yifan Zhang's picture
3 4 7

Yifan Zhang

CyanTransformer
crashidian's profile picture
·
  • BlueSocksFF

AI & ML interests

None yet

Organizations

Purdue Research Center for Open Digital Innovation's profile picture

upvoted 2 papers 7 months ago

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Paper • 2506.09040 • Published Jun 10 • 34

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Paper • 2505.18675 • Published May 24 • 26
upvoted 2 papers 8 months ago

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published Apr 24 • 40

Decoupled Global-Local Alignment for Improving Compositional Understanding

Paper • 2504.16801 • Published Apr 23 • 14
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs