M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
models 51
JunxiongWang/M1-3B
Text Generation • 3B • Updated • 9 • 2
JunxiongWang/M1-3B-SFT
Text Generation • 3B • Updated • 8 • 1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B • Updated • 2
JunxiongWang/MambaInLlama3B_SFT_MATH
3B • Updated • 2
JunxiongWang/MambaInLlama3B_DPO2
3B • Updated • 5
JunxiongWang/MambaInLlama3B_DPO1
3B • Updated • 2
JunxiongWang/MambaInLlama3B_Distill_MATH
3B • Updated • 3
JunxiongWang/MambaInLlama3B_v3
3B • Updated • 1
JunxiongWang/MambaInLlama1B_Distill_MATH
1B • Updated • 3
JunxiongWang/mamba_0_5_distill
Updated • 2
datasets 20
JunxiongWang/QwenFineMATH
Viewer • Updated • 6.71M • 53
JunxiongWang/R1_GR_SFT
Viewer • Updated • 44k • 16
JunxiongWang/R1_SFT
Updated • 94
JunxiongWang/R1_Sythetic_SFT
Viewer • Updated • 1M • 214
JunxiongWang/MATH_SFT
Viewer • Updated • 19.1M • 89
JunxiongWang/R1_OpenThoughts_SFT
Viewer • Updated • 862k • 96
JunxiongWang/R1_am_SFT
Viewer • Updated • 1.4M • 93
JunxiongWang/qwen1b_it_math
Viewer • Updated • 19.1M • 85
JunxiongWang/test_math
Viewer • Updated • 89.1k • 109
JunxiongWang/FineMathV4
Viewer • Updated • 6.7M • 184