GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1, 2025 β’ 250