Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bomin Wei's picture
5 6

Bomin Wei

Deiweiwei
sanaka87's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
upvoted a paper 2 months ago
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
liked a dataset 2 months ago
Wakals/CoVT-Dataset
View all activity

Organizations

None yet

upvoted a paper about 21 hours ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 4 days ago • 29
upvoted a paper 2 months ago

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

Paper • 2511.19418 • Published Nov 24, 2025 • 29
upvoted a collection 2 months ago

CoVT: Chain-of-Visual-Thought

Collection
Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought! • 7 items • Updated Nov 25, 2025 • 6
upvoted a collection 3 months ago

Qwen3-VL

Collection
37 items • Updated 27 days ago • 605
upvoted a paper 5 months ago

Reconstruction Alignment Improves Unified Multimodal Models

Paper • 2509.07295 • Published Sep 8, 2025 • 40
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs