Data of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)
Xue Zhang
XueZhang-bjtu
AI & ML interests
None yet
Organizations
None yet
M-Thinker-Data
Data of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)
M-Thinker
Models of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)
models 7
XueZhang-bjtu/M-Thinker-7B-Iter2
Text Generation • 8B • Updated • 7
XueZhang-bjtu/Native-RL
Updated
XueZhang-bjtu/7B-cold-start-SFT
Text Generation • 8B • Updated
XueZhang-bjtu/1.5B-cold-start-SFT
Text Generation • 2B • Updated • 120 •
XueZhang-bjtu/M-Thinker-1.5B-Iter2
Text Generation • 2B • Updated • 79 •
XueZhang-bjtu/M-Thinker-7B-Iter1
Text Generation • 8B • Updated • 4
XueZhang-bjtu/M-Thinker-1.5B-Iter1
Text Generation • 2B • Updated • 73
datasets 6
XueZhang-bjtu/Light-R1-SFTData-question-translated-76K
Viewer • Updated • 151k • 19
XueZhang-bjtu/M-Thinker-7B-RL-Iter2-data
Viewer • Updated • 15.1k • 3
XueZhang-bjtu/M-Thinker-7B-RL-Iter1-data
Viewer • Updated • 15.1k • 6
XueZhang-bjtu/M-Thinker-1.5B-RL-Iter2-data
Viewer • Updated • 15.1k • 5
XueZhang-bjtu/M-Thinker-1.5B-RL-Iter1-data
Viewer • Updated • 15.1k • 2
XueZhang-bjtu/M-Thinker-SFT-data
Viewer • Updated • 20.1k • 17