Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Fanheng Kong's picture
20 8 44

Fanheng Kong

friedrichor
zhikangzhang's profile picture 21world's profile picture hahuy2004's profile picture
·
https://friedrichor.github.io/
  • friedrichor

AI & ML interests

Multimodal LLM, LLM, Vibe Coding

Organizations

NEU-Datamining's profile picture

upvoted a collection 4 months ago

GVE

Collection
Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3, 2025 • 20
upvoted 2 papers 5 months ago

Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26, 2025 • 31

STICKERCONV: Generating Multimodal Empathetic Responses from Scratch

Paper • 2402.01679 • Published Jan 20, 2024 • 2
upvoted a collection 9 months ago

MMaDA Series

Collection
4 items • Updated Nov 14, 2025 • 8
upvoted a collection 10 months ago

UNITE

Collection
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval • 7 items • Updated May 29, 2025 • 3
upvoted a paper 10 months ago

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Paper • 2505.19650 • Published May 26, 2025 • 5
upvoted a collection 12 months ago

MegaPairs

Collection
8 items • Updated Feb 4 • 13
upvoted a collection about 1 year ago

X2I Dataset

Collection
Datasets used in OmniGen. • 5 items • Updated Jul 5, 2025 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs