Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
XueZhang-bjtu 's Collections
M-Thinker-Data
M-Thinker

M-Thinker-Data

updated Oct 14, 2025

Data of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)

Upvote
-

  • XueZhang-bjtu/M-Thinker-SFT-data

    Viewer • Updated Oct 13, 2025 • 20.1k • 2

  • XueZhang-bjtu/Light-R1-SFTData-question-translated-76K

    Viewer • Updated Oct 14, 2025 • 151k

  • XueZhang-bjtu/M-Thinker-1.5B-RL-Iter1-data

    Viewer • Updated Oct 14, 2025 • 15.1k • 2

  • XueZhang-bjtu/M-Thinker-1.5B-RL-Iter2-data

    Viewer • Updated Oct 14, 2025 • 15.1k • 2

  • XueZhang-bjtu/M-Thinker-7B-RL-Iter1-data

    Viewer • Updated Oct 14, 2025 • 15.1k • 2

  • XueZhang-bjtu/M-Thinker-7B-RL-Iter2-data

    Viewer • Updated Oct 14, 2025 • 15.1k • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs