Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published 2 days ago • 34
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Paper • 2412.13871 • Published Dec 18, 2024 • 18