MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14 • 164
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Paper • 2410.02416 • Published Oct 3, 2024 • 34
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper • 2504.01934 • Published Apr 2 • 22
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation Paper • 2403.08857 • Published Mar 13, 2024 • 3
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published May 14, 2024 • 23
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation Paper • 2403.08857 • Published Mar 13, 2024 • 3
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Paper • 2303.02489 • Published Mar 4, 2023