Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images Paper • 2512.17306 • Published Dec 19, 2025 • 3
Ovis: Structural Embedding Alignment for Multimodal Large Language Model Paper • 2405.20797 • Published May 31, 2024 • 32