divelab/combined_gsm8k_math_dataset_dapo_math_17k_Qwen3-4B_ntokens2048_sft Viewer • Updated 10 days ago • 186k • 43
divelab/combined_gsm8k_math_dataset_dapo_math_17k_Qwen3-4B_ntokens2048_sft Viewer • Updated 10 days ago • 186k • 43
jacob-helwig/SDAR-1.7B-Chat_kd1epoch_Qwen3-4B-Instruct-2507-GRPO-MATH-1024 2B • Updated 22 days ago • 17
jacob-helwig/SDAR-1.7B-Chat_kd1epoch_Qwen3-4B-Instruct-2507-GRPO-MATH-1024 2B • Updated 22 days ago • 17
jacob-helwig/Qwen2.5-0.5B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Updated Sep 13, 2025
jacob-helwig/Qwen2.5-1.5B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Updated Sep 13, 2025
jacob-helwig/Qwen2.5-7B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Updated Sep 13, 2025
jacob-helwig/dive7_Qwen2.5-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Text Generation • 242k • Updated Sep 13, 2025
jacob-helwig/dive7_Qwen2.5-1.5B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Updated Sep 12, 2025
jacob-helwig/dive7_Qwen2.5-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Text Generation • 242k • Updated Sep 13, 2025
jacob-helwig/Qwen2.5-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600 Updated Sep 12, 2025
ShockCast Collection Temporally-adaptive datasets from "A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling" • 2 items • Updated Jun 7, 2025