Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum
Paper • 2603.05256 • Published
This is the collection for the ICLR26 paper WikiR1. Project Page: https://artanic30.github.io/project_pages/WikiR1/