JingweiNi/train_prm800k_qwen3_1.7b_thinking__extracted_0-3_14 Viewer • Updated Dec 20, 2025 • 182 • 1
JingweiNi/train_prm800k_qwen3_1.7b_thinking__extracted_0-3_15 Viewer • Updated Dec 20, 2025 • 182 • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_1024 Viewer • Updated Dec 17, 2025 • 5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_2048 Viewer • Updated Dec 17, 2025 • 5k • 3
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_response_length_4096 Viewer • Updated Dec 17, 2025 • 5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_1024 Viewer • Updated Dec 17, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_2048 Viewer • Updated Dec 17, 2025 • 2.5k • 2
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_4096 Viewer • Updated Dec 17, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3_response_length_8192 Viewer • Updated Dec 17, 2025 • 2.5k • 5
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_0_3 Viewer • Updated Dec 17, 2025 • 2.5k • 2
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_2_random Viewer • Updated Dec 17, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_4_random Viewer • Updated Dec 17, 2025 • 2.5k • 2
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_32 Viewer • Updated Dec 17, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_8 Viewer • Updated Dec 16, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_16 Viewer • Updated Dec 16, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7_16_random Viewer • Updated Dec 16, 2025 • 2.5k • 1
JingweiNi/train_prm800k_gpt-oss-120b_annotated_qwen3_1.7b_thinking_5000_shards_4_7 Viewer • Updated Dec 16, 2025 • 2.5k • 3