Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Artanic30 's Collections
Wiki-R1
DA-DPO
NoisyGRPO

Wiki-R1

updated 2 days ago

This is the collection for the ICLR26 paper WikiR1. Project Page: https://artanic30.github.io/project_pages/WikiR1/

Upvote
-

  • Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum

    Paper • 2603.05256 • Published Mar 5

  • Artanic30/Wiki_R1_Train

    Viewer • Updated 6 days ago • 40k • 35

  • Artanic30/Wiki-R1-3B

    4B • Updated 1 day ago • 160

  • Artanic30/Wiki-R1-7B

    8B • Updated 1 day ago • 219
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs