ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published Dec 4, 2025 • 49 • 2
MM-IFEngine: Towards Multimodal Instruction Following Paper • 2504.07957 • Published Apr 10, 2025 • 35 • 2