Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains Paper • 2602.13235 • Published 22 days ago • 2
Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains Paper • 2602.13235 • Published 22 days ago • 2
M2IO-R1: An Efficient RL-Enhanced Reasoning Framework for Multimodal Retrieval Augmented Multimodal Generation Paper • 2508.06328 • Published Aug 8, 2025 • 1