GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper โข 2604.26752 โข Published 8 days ago โข 97
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards Paper โข 2604.14967 โข Published 21 days ago โข 15