TAUR-dev/sft_acronym_5o_2700_end3000
Viewer
• Updated • 2.1k • 1
TAUR-dev/sft_letter_countdown_4o_2300_end2500
Viewer
• Updated • 1.4k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_SFT_ourstruct_ss__v1
Viewer
• Updated • 3.3k • 3
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_SFT_ourstruct_ss-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_SFT_qwen__v1
Viewer
• Updated • 3.3k • 1
TAUR-dev/sft_letter_countdown_4o_1000_end1400
Viewer
• Updated • 2.8k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_SFT_qwen-eval_0
Viewer
• Updated • 3.29k • 1
TAUR-dev/sft_letter_countdown_4o_300_end700
Viewer
• Updated • 2.8k • 1
TAUR-dev/sft_acronym_4o_300_end600
Viewer
• Updated • 2.1k TAUR-dev/sft_letter_countdown_4o_1400_end1700
Viewer
• Updated • 2.1k • 1
TAUR-dev/sft_letter_countdown_4o_700_end1000
Viewer
• Updated • 2.1k • 1
TAUR-dev/sft_letter_countdown_4o_0_end300
Viewer
• Updated • 2.1k • 1
TAUR-dev/sft_acronym_4o_0_end300
Viewer
• Updated • 2.1k • 1
TAUR-dev/sft_acronym_5o_2500_end2700
Viewer
• Updated • 1.4k • 2
TAUR-dev/D-ExpTracker__RC_VarFix_sample_only_corrects_all_tasks__v1
Viewer
• Updated • 3.3k • 3
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_sample_only_corrects_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_all_samples_all_tasks__v1
Viewer
• Updated • 3.3k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_all_samples_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_bon_corrects_all_tasks__v1
Viewer
• Updated • 3.3k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_bon_corrects_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_no_reflects_all_tasks__v1
Viewer
• Updated • 3.3k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_no_reflects_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_random_3args_all_tasks__v1
Viewer
• Updated • 2 • 1
TAUR-dev/D-ExpTracker__RC_VarFix_origonlyprompts_all_tasks__v1
Viewer
• Updated • 3.3k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_origonlyprompts_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_rlonly_all_tasks__v1
Viewer
• Updated • 3.3k • 3
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_rlonly_all_tasks-eval_rl
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-ExpTracker__RC_VarFix_rlonly_GDY_4argsonly__v1
Viewer
• Updated • 254 • 3
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_rlonly_GDY_4argsonly-eval_rl
Viewer
• Updated • 250 • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_VarFix_pv_v2_all_tasks-eval_rl
Viewer
• Updated • 3.29k