ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_stepwise_dpo_chunk_17 Viewer • Updated Jun 30, 2025 • 800 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_stepwise_dpo_chunk_3 Viewer • Updated Jun 30, 2025 • 800 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_stepwise_dpo_chunk_15 Viewer • Updated Jun 30, 2025 • 800 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_stepwise_dpo_chunk_6 Viewer • Updated Jun 30, 2025 • 800 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_stepwise_dpo_chunk_20 Viewer • Updated Jun 30, 2025 • 18 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_11 Viewer • Updated Jun 30, 2025 • 21 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_2 Viewer • Updated Jun 30, 2025 • 29 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_1 Viewer • Updated Jun 30, 2025 • 23 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_train_chunk_11 Viewer • Updated Jun 30, 2025 • 570 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_31 Viewer • Updated Jun 30, 2025 • 18 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_train_chunk_31 Viewer • Updated Jun 30, 2025 • 37 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_37 Viewer • Updated Jun 30, 2025 • 19 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_3 Viewer • Updated Jun 30, 2025 • 35 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_33 Viewer • Updated Jun 30, 2025 • 22 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_35 Viewer • Updated Jun 30, 2025 • 46 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_36 Viewer • Updated Jun 30, 2025 • 44 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_34 Viewer • Updated Jun 30, 2025 • 23 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_31 Viewer • Updated Jun 30, 2025 • 15 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_32 Viewer • Updated Jun 30, 2025 • 19 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_train_chunk_31 Viewer • Updated Jun 30, 2025 • 37 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_train_chunk_3 Viewer • Updated Jun 30, 2025 • 533 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_33 Viewer • Updated Jun 30, 2025 • 22 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_36 Viewer • Updated Jun 30, 2025 • 40 • 4
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_35 Viewer • Updated Jun 30, 2025 • 49 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_37 Viewer • Updated Jun 30, 2025 • 10 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_34 Viewer • Updated Jun 30, 2025 • 22 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwen3-32b_dpo_val_chunk_32 Viewer • Updated Jun 30, 2025 • 17 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_6 Viewer • Updated Jun 30, 2025 • 35 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_25 Viewer • Updated Jun 30, 2025 • 19 • 3
ZixuanKe/cfa_extracted_exercise_sup_sample_from_policy_v1.1_genrm_qwq-32b_dpo_val_chunk_2 Viewer • Updated Jun 30, 2025 • 26 • 3