TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-letter_countdown_4o-eval_sft
Viewer
• Updated
• 300 • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-acronym_4o__v1
Viewer
• Updated
• 201 • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-acronym_4o-eval_sft
Viewer
• Updated
• 197 • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-letter_countdown_5o__v1
Viewer
• Updated
• 304 • 9
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-acronym_5o__v1
Viewer
• Updated
• 148 • 10
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-letter_countdown_5o-eval_sft
Viewer
• Updated
• 300 • 6
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-acronym_5o-eval_sft
Viewer
• Updated
• 144 • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-longmult_5dig__v1
Viewer
• Updated
• 1k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-longmult_5dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_Random-RL-countdown_4arg__v1
Viewer
• Updated
• 1k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_Random-RL-countdown_4arg-eval_rl
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-acronym_4o__v1
Viewer
• Updated
• 201 • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-acronym_4o-eval_sft
Viewer
• Updated
• 197 • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-acronym_5o__v1
Viewer
• Updated
• 148 • 10
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-acronym_5o-eval_sft
Viewer
• Updated
• 144 • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-longmult_4dig__v1
Viewer
• Updated
• 1k • 10
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-longmult_4dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_5dig__v1
Viewer
• Updated
• 1k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_5dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-longmult_3dig__v1
Viewer
• Updated
• 1k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-longmult_3dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_4dig__v1
Viewer
• Updated
• 1k • 10
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_4dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-longmult_2dig__v1
Viewer
• Updated
• 1k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-longmult_2dig-eval_sft
Viewer
• Updated
• 1k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_3dig__v1
Viewer
• Updated
• 1k • 10
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_3dig-eval_sft
Viewer
• Updated
• 1k • 8
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_STaR-SFT-gsm8k__v1
Viewer
• Updated
• 1.32k • 8
TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-gsm8k-eval_sft
Viewer
• Updated
• 1.32k • 7
TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_2dig__v1
Viewer
• Updated
• 1k • 10