Bomin Wei's picture

5 6

Bomin Wei

Deiweiwei

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 5 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

upvoted a paper 7 months ago

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

Paper • 2511.19418 • Published Nov 24, 2025 • 29

upvoted a collection 7 months ago

CoVT: Chain-of-Visual-Thought

Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought! • 7 items • Updated Nov 25, 2025 • 6

upvoted a collection 8 months ago

Qwen3-VL

37 items • Updated Dec 31, 2025 • 747

upvoted a paper 10 months ago

Reconstruction Alignment Improves Unified Multimodal Models

Paper • 2509.07295 • Published Sep 8, 2025 • 40