Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 10 days ago • 88
DREAM: Where Visual Understanding Meets Text-to-Image Generation Paper • 2603.02667 • Published 10 days ago • 4
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding? Paper • 2603.03241 • Published 10 days ago • 81
Running on Zero Featured 171 ReconViaGen 🖥 171 High-fidelity 3D Geometry Generation from multi-view images