xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2epoch-frozen-router Text Generation • 16B • Updated 3 days ago • 256
xd2010/Qwen1.5-MOE-sft-math7k-sft-2epochs-frozen-router Text Generation • 14B • Updated 3 days ago • 175
xd2010/OLMoE-1B-7B-0125-sft-math7k-2epochs-frozen-router Text Generation • 7B • Updated 3 days ago • 177
xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2ndepoch-1e-4-gamma-1condenser Text Generation • 16B • Updated 3 days ago • 508
xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2ndepoch-1e-4-gamma-4condenser Text Generation • 16B • Updated 3 days ago • 490
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2-test Text Generation • 14B • Updated 9 days ago • 229
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2 Text Generation • 14B • Updated 9 days ago • 250
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epo Text Generation • 14B • Updated 10 days ago • 240
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epoch Text Generation • 14B • Updated 10 days ago • 271
xd2010/gpt-oss-20b-math7k-1epoch-lr4e-5-1e-4-gamma-part2 Text Generation • 4.76M • Updated Nov 12, 2025