TAUR-dev/D-EVAL__standard_eval_v3__RC_BF_ab-ss_our_structure-SFT-eval_sft
Viewer
• Updated • 3.29k • 1
Viewer
• Updated • 4.09k • 1
Viewer
• Updated • 782 • 1
Viewer
• Updated • 3.1k • 1
Viewer
• Updated • 926 • 1
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC-ab_sft_bon_corr_samples-sft-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-orig_only-SFT-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-no_reflects-SFT-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-random-SFT-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC-ab_sft_bon_all_samples-sft-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__eval_sft_exp_1e_zayneprompts_v3_sft-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC_ab_sft_our_structure_single_sample-sft-eval_sft-eval_sft
Viewer
• Updated • 3.29k • 1
TAUR-dev/skillfactory-ablations__random_reflections5_formatsrandom
Viewer
• Updated • 14.7k • 3
TAUR-dev/skillfactory-ablations__orig_only_reflections5_formats-C_full
Viewer
• Updated • 3.01k • 1
TAUR-dev/skillfactory-ablations__no_reflections_reflections5_formatsno_reflection
Viewer
• Updated • 14.7k • 1
TAUR-dev/D-SFT_C-RC-ab_sft_bon_corr_samples-sft-data
Viewer
• Updated • 14.7k • 1
TAUR-dev/D-SFT_C-RC-ab_sft_bon_all_samples-sft-data
Viewer
• Updated • 73.6k • 1
TAUR-dev/D-SFT_C-skillfactory-ablations__orig_only_reflections5_formats-C_full-sft-data
Viewer
• Updated • 3.01k • 1
TAUR-dev/D-SFT_C-skillfactory-ablations__no_reflections_reflections5_formatsno_reflection-sft-data
Viewer
• Updated • 14.7k • 1
TAUR-dev/D-SFT_C-skillfactory-ablations__random_reflections5_formatsrandom-sft-data
Viewer
• Updated • 14.7k • 1
TAUR-dev/D-SFT_C-RC-ab_sft_our_structure_single_sample-sft-data
Viewer
• Updated • 8.08k • 1
TAUR-dev/D-EVAL__standard_eval_v3__new_tasks_eval__rl_baseline-eval_rl
Viewer
• Updated • 841 • 1
TAUR-dev/SFT_D-RC_ab-bon_tune_all_samples
Viewer
• Updated • 73.6k • 1
TAUR-dev/SFT_D-RC_ab-single_sample_our_structure
Viewer
• Updated • 8.08k • 1
TAUR-dev/SFT_D-RC_ab-bon_tune_corr_samples_only
Viewer
• Updated • 14.7k • 1
TAUR-dev/skillfactory-ablations__random_reflections3_formatsrandom
Viewer
• Updated • 2.93k • 2
TAUR-dev/skillfactory-ablations__random_reflections3_formatscleanup.py.experiments.new_datasets
Viewer
• Updated • 3
TAUR-dev/test_ablation_qrepeat1_reflections3_formats-C
Viewer
• Updated • 2.93k • 1
TAUR-dev/test_ablation_num_correct_1.1.2.2.3.2.3.4.3.4.5_num_incorrect_0.1.0.1.0.2.1.0.2.1.0
Viewer
• Updated • 6.32k • 1
TAUR-dev/D-EVAL__standard_eval_v3__eval_original_1e_rl_v2-eval_rl
Viewer
• Updated • 3.29k • 1