xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2epoch-frozen-router Text Generation • 16B • Updated 3 days ago • 258
xd2010/Qwen1.5-MOE-sft-math7k-sft-2epochs-frozen-router Text Generation • 14B • Updated 3 days ago • 176
xd2010/OLMoE-1B-7B-0125-sft-math7k-2epochs-frozen-router Text Generation • 7B • Updated 3 days ago • 178
xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2ndepoch-1e-4-gamma-1condenser Text Generation • 16B • Updated 4 days ago • 511
xd2010/DeepSeek-V2-Lite-aux-free-sft-math7k-2ndepoch-1e-4-gamma-4condenser Text Generation • 16B • Updated 4 days ago • 492
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2-test Text Generation • 14B • Updated 10 days ago • 231
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2 Text Generation • 14B • Updated 10 days ago • 252
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epo Text Generation • 14B • Updated 11 days ago • 242
xd2010/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-1epoch Text Generation • 14B • Updated 11 days ago • 273
xd2010/gpt-oss-20b-math7k-1epoch-lr4e-5-1e-4-gamma-part2 Text Generation • 4.76M • Updated Nov 12, 2025 • 1
xd2010/gpt-oss-20b-math7k-1epoch-lr4e-5-1e-4-gamma Text Generation • 4.76M • Updated Nov 10, 2025 • 1