TAUR-dev/D-EVAL__standard_eval_v3__FinEval_RL_rlonly-eval_rl
Viewer
• Updated
• 12.5k • 6
TAUR-dev/D-ExpTracker__BASELINE_r1_distillation__v1
Viewer
• Updated
• 98 • 8
TAUR-dev/D-ExpTracker__0921__zayne1_alltask1_grpo_resume__v1
Viewer
• Updated
• 5 • 6
TAUR-dev/D-ExpTracker__0921__zayne1_alltask2_grpo_resume__v1
Viewer
• Updated
• 5 • 6
TAUR-dev/D-SFT_C-BASELINE_r1_distillation-sft-data
Viewer
• Updated
• 4k • 7
TAUR-dev/SFT_C__BASELINE_R1_distillation_sft_data
Viewer
• Updated
• 4k • 7
TAUR-dev/r1_outputs_for_countdown_with_3arguments
Viewer
• Updated
• 4k • 7
TAUR-dev/D-reflection_accuracy_commonsenseQA_longmult_3dig_10ex
Viewer
• Updated
• 151 • 13
TAUR-dev/D-ExpTracker__testing_new_setup__v1
Viewer
• Updated
• 18 • 5
TAUR-dev/D-ExpTracker__RC_VarFix_bolt_baseline_all_tasks__v1
Viewer
• Updated
• 3.3k • 6
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_bolt_baseline_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 5
TAUR-dev/D-ExpTracker__test_scratch__v1
Viewer
• Updated
• 68 • 25
TAUR-dev/skillfactory-pvv2-sft-llama_reflections5_formats-C_full
Viewer
• Updated
• 50 • 5
TAUR-dev/D-verification_bf_commonsenseQA_longmult_3dig_10ex
Viewer
• Updated
• 76 • 5
TAUR-dev/trace_analysis_2
Viewer
• Updated
• 288 • 6
Viewer
• Updated
• 288 • 5
TAUR-dev/D-reflection_accuracy_commonsenseQA_longmult_3dig_100ex
Viewer
• Updated
• 653 • 6
TAUR-dev/zs_D-reflection_accuracy_commonsenseQA_longmult_3dig_10ex
Viewer
• Updated
• 20 • 5
TAUR-dev/D-ExpTracker__0921__0epoch_alltask2_grpo__v1
Viewer
• Updated
• 6 • 6
TAUR-dev/D-ExpTracker__bolt_gpt4o_baseline__v1
Viewer
• Updated
• 6 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_countdown_3arg_4_16_last
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_countdown_3arg_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_countdown_5arg_16_4
Viewer
• Updated
• 400 • 5
TAUR-dev/D-ExpTracker__0921__pv2_CT3and4arg_grpo__v1
Viewer
• Updated
• 5 • 5
TAUR-dev/D-test_reflection_bf_RC_VarFix_low_qual_all_tasks_countdown_3arg_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_letter_countdown_4o_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_low_qual_all_tasks_countdown_4arg_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_countdown_4arg_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_pv_v2_all_tasks_letter_countdown_5o_16_4
Viewer
• Updated
• 400 • 6
TAUR-dev/D-test_reflection_bf_RC_VarFix_low_qual_all_tasks_letter_countdown_4o_16_4
Viewer
• Updated
• 400 • 6