Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhangyunfeng's picture
5 28

zhangyunfeng

yunfeng
·
  • yunfengsay

AI & ML interests

None yet

Organizations

None yet

upvoted an article about 1 year ago
view article
Article

seemore: Implement a Vision Language Model from Scratch

Jun 23, 2024
•
104
upvoted a collection about 1 year ago

Multimodal RAG

Collection
10 items • Updated Sep 5, 2024 • 30
upvoted a collection over 1 year ago

MGM

Collection
Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47
upvoted a collection almost 2 years ago

From screenshots to HTML

Collection
WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 22
upvoted a paper almost 2 years ago

Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs