Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning Paper • 2601.14750 • Published 25 days ago • 17
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning Paper • 2508.08098 • Published Aug 11, 2025
U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs Paper • 2507.14902 • Published Jul 20, 2025 • 1
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset Paper • 2507.21033 • Published Jul 28, 2025 • 23