TAUR-dev/zaynes_dataset__gsm8k__best_of_n__scored__gpt4_1_annotated_v2
Viewer
• Updated • 6.47k • 5
TAUR-dev/zaynes_dataset__commonsenseQA__best_of_n__scored__gpt4_1_annotated_v2
Viewer
• Updated • 8.74k • 5
TAUR-dev/D-DATA-canonical_dataset_splits-v2-8_13_25
Viewer
• Updated • 31.4k • 4
TAUR-dev/D-DATA-canonical_dataset_splits-v1-7_13_25__copy8_13_25
Viewer
• Updated • 70.6k • 3
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl-3-cd3arg_1e6_sft_cd3arg_rl-rl_eval-eval_rl
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl-2-cd3arg_1e5_sft_all_rl-rl_eval-eval_rl
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl-1-cd3arg_1e5_sft_cd3arg_rl-rl_eval-eval_rl
Viewer
• Updated • 2.45k • 3
TAUR-dev/D-EVAL__standard_eval_v3__checking_evals-eval_0
Viewer
• Updated • 2.45k • 5
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl-3-cd3arg_1e6_sft-rl_eval-eval_rl
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl_1e6_1epch_all_tasks_sft_zayne-eval_sft
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e6_1epch_all_tasks_sft_zayne-sft-data
Viewer
• Updated • 38.4k • 4
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl_1e5_1epch_all_tasks_sft_zayne-eval_sft
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e5_1epch_all_tasks_sft_zayne-sft-data
Viewer
• Updated • 38.4k • 5
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl_1e6_1epch_cd3arg_only_sft_zayne-eval_sft
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e6_1epch_cd3arg_only_sft_zayne-sft-data
Viewer
• Updated • 7.62k • 4
TAUR-dev/D-EVAL__standard_eval_v3__skills_in_rl_1e5_1epch_cd3arg_only_sft_zayne-eval_sft
Viewer
• Updated • 2.45k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e5_1epch_cd3arg_only_sft_zayne-sft-data
Viewer
• Updated • 7.62k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e6_1epch_cd3arg_only_sft-sft-data
Viewer
• Updated • 7.62k • 4
TAUR-dev/D-SFT_C-skills_in_rl_1e5_1epch_cd3arg_only_sft-sft-data
Viewer
• Updated • 7.62k • 3
TAUR-dev/D-SFT-dataset__countdown_3arg__tagged_bon
Viewer
• Updated • 7.62k • 4
TAUR-dev/D-SFT-dataset__commonsenseQA__tagged_bon
Viewer
• Updated • 14.8k • 4
TAUR-dev/D-SFT-dataset__longmult_3dig__tagged_bon
Viewer
• Updated • 5.71k • 4
TAUR-dev/D-SFT-dataset__gsm8k__tagged_bon
Viewer
• Updated • 10.3k • 4
TAUR-dev/zaynes_dataset__commonsenseQA__best_of_n__scored__gpt4_1_annotated
Viewer
• Updated • 8.74k • 5
TAUR-dev/zaynes_dataset__gsm8k__best_of_n__scored__gpt4_1_annotated
Viewer
• Updated • 6.47k • 6
TAUR-dev/dataset__commonsenseQA__best_of_n__scored__gpt4_1_annotated
Viewer
• Updated • 8.74k • 5
TAUR-dev/zaynes_dataset__longmult_3dig__best_of_n__scored__gpt4_1_annotated
Viewer
• Updated • 4k • 5
TAUR-dev/D-EVAL__standard_eval_v3__check_eval-eval_rl
Viewer
• Updated • 1.1k • 4
TAUR-dev/zaynes_dataset__longmult_3dig__best_of_n__scored
Viewer
• Updated • 4k • 5
TAUR-dev/zaynes_dataset__longmult_3dig__best_of_n
Viewer
• Updated • 4k • 4