TAUR-dev/BF16kEval_FinEval_RL_rlonly-eval_rl_countdown_4arg
Viewer
• Updated • 4k • 3
TAUR-dev/BF16kEval_FinEval_RL_R1_distill-fixed_countdown_3arg
Viewer
• Updated • 4k • 2
TAUR-dev/BF16kEval_FinEval_RL_sf_ours_pvv2-eval_rl_countdown_3arg
Viewer
• Updated • 4k • 3
TAUR-dev/BF16kEval_FinEval_RL_rlonly-eval_rl_countdown_3arg
Viewer
• Updated • 4k • 1
TAUR-dev/D-EVAL__standard_eval_v3__Fin16kEval_SFT_r1_translated-countdown_5arg-eval_sft
Viewer
• Updated • 1k • 1
TAUR-dev/D-ExpTracker__Fin16kEval_SFT_r1_translated-countdown_5arg__v1
Viewer
• Updated • 2 • 5
TAUR-dev/D-ExpTracker__Fin16kEval_SFT_r1_translated-countdown_4arg__v1
Viewer
• Updated • 1k • 3
TAUR-dev/D-EVAL__standard_eval_v3__Fin16kEval_SFT_r1_translated-countdown_4arg-eval_sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-ExpTracker__Fin16kEval_SFT_r1_translated-countdown_3arg__v1
Viewer
• Updated • 1.01k • 4
TAUR-dev/D-EVAL__standard_eval_v3__Fin16kEval_SFT_r1_translated-countdown_3arg-eval_sft
Viewer
• Updated • 1k • 3
TAUR-dev/D-ExpTracker__r1_translated_BASELINE__v1
Viewer
• Updated • 4.01k • 1
TAUR-dev/D-SFT_C-r1_translated_BASELINE-sft-data
Viewer
• Updated • 4k • 1
TAUR-dev/SFT_C-cd3args_r1_translated_baseline
Viewer
• Updated • 4k • 1
TAUR-dev/r1_outputs_cd3arg_translator_gpt5_real_verbatim
Viewer
• Updated • 4k • 28
TAUR-dev/r1_outputs_cd3arg_translator_gpt5_verbatim
Viewer
• Updated • 10 • 17
TAUR-dev/r1_outputs_cd3arg_translator_gpt5-mini_verbatim
Viewer
• Updated • 10 • 1
TAUR-dev/r1_outputs_cd3arg_translator_gpt5_structured
Viewer
• Updated • 10 • 22
TAUR-dev/r1_outputs_cd3arg_translator_gpt5-mini_structured
Viewer
• Updated • 10 • 3
TAUR-dev/r1_outputs_cd3arg_translator_gpt5
Viewer
• Updated • 10 • 19
TAUR-dev/r1_outputs_cd3arg_translator_gpt5-mini
Viewer
• Updated • 10 • 1
TAUR-dev/r1_outputs_cd3arg_translator_gpt5mini_structured
Viewer
• Updated • 20 • 1
TAUR-dev/r1_outputs_cd3arg_translator_gpt5mini_verbatim
Viewer
• Updated • 20 • 3
TAUR-dev/r1_outputs_cd3arg_translator_gpt5mini
Viewer
• Updated • 20 • 1
TAUR-dev/r1_outputs_cd3arg_translator_qwen
Viewer
• Updated • 4k • 3
TAUR-dev/r1_outputs_cd3arg_translator_qwen2.5-1.5B-I
Viewer
• Updated • 4k • 1
TAUR-dev/D-ExpTracker__rl_rlonly_AT_fixed__v1
Viewer
• Updated • 17 • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval__3args_basemodel-eval_0
Viewer
• Updated • 11.5k • 1
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval__3args_ours_sft-eval_sft
Viewer
• Updated • 11.5k • 1
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval__3args_r1-eval_0
Viewer
• Updated • 11.5k • 3
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval__3args_bolt-eval_rl
Viewer
• Updated • 11.5k • 1