Viewer
• Updated
• 3.1k • 6
Viewer
• Updated
• 926 • 6
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC-ab_sft_bon_corr_samples-sft-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-orig_only-SFT-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-no_reflects-SFT-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__RC_ab-random-SFT-eval_sft
Viewer
• Updated
• 3.29k • 6
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC-ab_sft_bon_all_samples-sft-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__eval_sft_exp_1e_zayneprompts_v3_sft-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__eval_RC_ab_sft_our_structure_single_sample-sft-eval_sft-eval_sft
Viewer
• Updated
• 3.29k • 7
TAUR-dev/skillfactory-ablations__random_reflections5_formatsrandom
Viewer
• Updated
• 14.7k • 6
TAUR-dev/skillfactory-ablations__orig_only_reflections5_formats-C_full
Viewer
• Updated
• 3.01k • 6
TAUR-dev/skillfactory-ablations__no_reflections_reflections5_formatsno_reflection
Viewer
• Updated
• 14.7k • 4
TAUR-dev/D-SFT_C-RC-ab_sft_bon_corr_samples-sft-data
Viewer
• Updated
• 14.7k • 6
TAUR-dev/D-SFT_C-RC-ab_sft_bon_all_samples-sft-data
Viewer
• Updated
• 73.6k • 5
TAUR-dev/D-SFT_C-skillfactory-ablations__orig_only_reflections5_formats-C_full-sft-data
Viewer
• Updated
• 3.01k • 6
TAUR-dev/D-SFT_C-skillfactory-ablations__no_reflections_reflections5_formatsno_reflection-sft-data
Viewer
• Updated
• 14.7k • 5
TAUR-dev/D-SFT_C-skillfactory-ablations__random_reflections5_formatsrandom-sft-data
Viewer
• Updated
• 14.7k • 4
TAUR-dev/D-SFT_C-RC-ab_sft_our_structure_single_sample-sft-data
Viewer
• Updated
• 8.08k • 4
TAUR-dev/D-EVAL__standard_eval_v3__new_tasks_eval__rl_baseline-eval_rl
Viewer
• Updated
• 841 • 7
TAUR-dev/SFT_D-RC_ab-bon_tune_all_samples
Viewer
• Updated
• 73.6k • 6
TAUR-dev/SFT_D-RC_ab-single_sample_our_structure
Viewer
• Updated
• 8.08k • 6
TAUR-dev/SFT_D-RC_ab-bon_tune_corr_samples_only
Viewer
• Updated
• 14.7k • 5
TAUR-dev/skillfactory-ablations__random_reflections3_formatsrandom
Viewer
• Updated
• 2.93k • 6
TAUR-dev/skillfactory-ablations__random_reflections3_formatscleanup.py.experiments.new_datasets
Viewer
• Updated
• 6
TAUR-dev/test_ablation_qrepeat1_reflections3_formats-C
Viewer
• Updated
• 2.93k • 5
TAUR-dev/test_ablation_num_correct_1.1.2.2.3.2.3.4.3.4.5_num_incorrect_0.1.0.1.0.2.1.0.2.1.0
Viewer
• Updated
• 6.32k • 6
TAUR-dev/D-EVAL__standard_eval_v3__eval_original_1e_rl_v2-eval_rl
Viewer
• Updated
• 3.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3_eval_original_1e_updated-eval_rl
Viewer
• Updated
• 5.29k • 7
TAUR-dev/D-EVAL__standard_eval_v3__eval_test_original1e-eval_rl
Viewer
• Updated
• 144 • 6
TAUR-dev/D-EVAL__standard_eval_v3__eval_original_1e_rl-eval_rl
Viewer
• Updated
• 3.29k • 7