Fanheng Kong's picture

Fanheng Kong

friedrichor

·

https://friedrichor.github.io/

friedrichor

AI & ML interests

Multimodal LLM, LLM, Vibe Coding

Organizations

upvoted a collection 8 months ago

GVE

Towards General Video Embeddings: Models and Benchmarks • 4 items • Updated Nov 3, 2025 • 20

upvoted a paper 8 months ago

Open Multimodal Retrieval-Augmented Factual Image Generation

Paper • 2510.22521 • Published Oct 26, 2025 • 31

upvoted a paper 9 months ago

STICKERCONV: Generating Multimodal Empathetic Responses from Scratch

Paper • 2402.01679 • Published Jan 20, 2024 • 2

upvoted 2 collections about 1 year ago

MMaDA Series

4 items • Updated Nov 14, 2025 • 8

UNITE

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval • 7 items • Updated May 29, 2025 • 3

upvoted a paper about 1 year ago

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Paper • 2505.19650 • Published May 26, 2025 • 5

upvoted 2 collections over 1 year ago

MegaPairs

8 items • Updated Feb 4 • 15

X2I Dataset

Datasets used in OmniGen. • 5 items • Updated Jul 5, 2025 • 19