Images in Sentences: Scaling Interleaved Instructions for Unified Visual Generation Paper • 2605.12305 • Published May 12 • 2
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details Paper • 2604.06870 • Published Apr 8 • 44
Running on Zero MCP 3.44k Z Image Turbo 🖼 3.44k Generate vivid images from your text prompts instantly
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 27.2k • 1.61k