Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Fanheng Kong's picture
20 8 44

Fanheng Kong

friedrichor
hahuy2004's profile picture zhikangzhang's profile picture rongjian's profile picture
·
https://friedrichor.github.io/
  • friedrichor

AI & ML interests

Multimodal LLM, LLM, Vibe Coding

Organizations

NEU-Datamining's profile picture

upvoted a collection 8 months ago

GVE

Collection
Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3, 2025 • 20
upvoted a paper 8 months ago

Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26, 2025 • 31
upvoted a paper 9 months ago

STICKERCONV: Generating Multimodal Empathetic Responses from Scratch

Paper • 2402.01679 • Published Jan 20, 2024 • 2
upvoted 2 collections about 1 year ago

MMaDA Series

Collection
4 items • Updated Nov 14, 2025 • 8

UNITE

Collection
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval • 7 items • Updated May 29, 2025 • 3
upvoted a paper about 1 year ago

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Paper • 2505.19650 • Published May 26, 2025 • 5
upvoted 2 collections over 1 year ago

MegaPairs

Collection
8 items • Updated Feb 4 • 15

X2I Dataset

Collection
Datasets used in OmniGen. • 5 items • Updated Jul 5, 2025 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs