JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_1024 Viewer • Updated Dec 17, 2025 • 5k • 9
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_2048 Viewer • Updated Dec 17, 2025 • 5k • 9
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_4096 Viewer • Updated Dec 17, 2025 • 5k • 8
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_1024 Viewer • Updated Dec 17, 2025 • 2.5k • 8
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_2048 Viewer • Updated Dec 17, 2025 • 2.5k • 8
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_4096 Viewer • Updated Dec 17, 2025 • 2.5k • 9
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_8192 Viewer • Updated Dec 17, 2025 • 2.5k • 8
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3 Viewer • Updated Dec 17, 2025 • 2.5k • 11
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_2_random Viewer • Updated Dec 17, 2025 • 2.5k • 11
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_4_random Viewer • Updated Dec 17, 2025 • 2.5k • 11
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_32 Viewer • Updated Dec 17, 2025 • 2.5k • 10
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_8 Viewer • Updated Dec 16, 2025 • 2.5k • 11
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_16 Viewer • Updated Dec 16, 2025 • 2.5k • 11
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_16_random Viewer • Updated Dec 16, 2025 • 2.5k • 9
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7 Viewer • Updated Dec 16, 2025 • 2.5k • 13
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_10 Viewer • Updated Dec 16, 2025 • 10 • 13
JingweiNi/train_prm800k_qwen3_1.7b_thinking_5000_extracted_0 Viewer • Updated Dec 15, 2025 • 625 • 13