TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_SFT_ourstruct_ss-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-ExpTracker__RC_VarFix_SFT_qwen__v1
Viewer
• Updated
• 3.3k • 7
TAUR-dev/sft_letter_countdown_4o_1000_end1400
Viewer
• Updated
• 2.8k • 4
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_SFT_qwen-eval_0
Viewer
• Updated
• 3.29k • 6
TAUR-dev/sft_letter_countdown_4o_300_end700
Viewer
• Updated
• 2.8k • 5
TAUR-dev/sft_acronym_4o_300_end600
Viewer
• Updated
• 2.1k • 6
TAUR-dev/sft_letter_countdown_4o_1400_end1700
Viewer
• Updated
• 2.1k • 5
TAUR-dev/sft_letter_countdown_4o_700_end1000
Viewer
• Updated
• 2.1k • 4
TAUR-dev/sft_letter_countdown_4o_0_end300
Viewer
• Updated
• 2.1k • 6
TAUR-dev/sft_acronym_4o_0_end300
Viewer
• Updated
• 2.1k • 5
TAUR-dev/sft_acronym_5o_2500_end2700
Viewer
• Updated
• 1.4k • 5
TAUR-dev/D-ExpTracker__RC_VarFix_sample_only_corrects_all_tasks__v1
Viewer
• Updated
• 3.3k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_sample_only_corrects_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-ExpTracker__RC_VarFix_all_samples_all_tasks__v1
Viewer
• Updated
• 3.3k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_all_samples_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-ExpTracker__RC_VarFix_bon_corrects_all_tasks__v1
Viewer
• Updated
• 3.3k • 6
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_bon_corrects_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 5
TAUR-dev/D-ExpTracker__RC_VarFix_no_reflects_all_tasks__v1
Viewer
• Updated
• 3.3k • 6
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_no_reflects_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 6
TAUR-dev/D-ExpTracker__RC_VarFix_random_3args_all_tasks__v1
Viewer
• Updated
• 2 • 6
TAUR-dev/D-ExpTracker__RC_VarFix_origonlyprompts_all_tasks__v1
Viewer
• Updated
• 3.3k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_origonlyprompts_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-ExpTracker__RC_VarFix_rlonly_all_tasks__v1
Viewer
• Updated
• 3.3k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_rlonly_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-ExpTracker__RC_VarFix_rlonly_GDY_4argsonly__v1
Viewer
• Updated
• 254 • 6
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_rlonly_GDY_4argsonly-eval_rl
Viewer
• Updated
• 250 • 6
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_pv_v2_all_tasks-eval_rl
Viewer
• Updated
• 3.29k • 6
TAUR-dev/D-ExpTracker__RC_VarFix_pv_v2_GDY_4argsonly__v1
Viewer
• Updated
• 254 • 5
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_pv_v2_GDY_4argsonly-eval_rl
Viewer
• Updated
• 250 • 6
TAUR-dev/D-ExpTracker__RC_VarFix_rl_only_5argsonly__v1
Viewer
• Updated
• 254 • 6