zhangyunfeng's picture

zhangyunfeng

yunfeng

·

yunfengsay

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

jina-embeddings-v5-omni

liked a model 8 months ago

moondream/moondream3-preview

liked a dataset 9 months ago

zed-industries/zeta

View all activity

Organizations

None yet

upvoted a collection 2 days ago

jina-embeddings-v5-omni

Multimodal (text + image + video + audio) embedding models aligned with jina-embeddings-v5-text-*. Two sizes, four task variants each. • 27 items • Updated 2 days ago • 30

upvoted an article over 1 year ago

Article

seemore: Implement a Vision Language Model from Scratch

AviSoori1x

•

Jun 23, 2024

• 109

upvoted a collection over 1 year ago

Multimodal RAG

9 items • Updated Mar 2 • 31

upvoted 2 collections about 2 years ago

MGM

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47

From screenshots to HTML

WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 22

upvoted a paper over 2 years ago

Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86