TAUR-dev/D-back_to_og_mix__simple_retries__sbon-sft-data
Viewer
• Updated • 7.62k • 5
TAUR-dev/D-VAL_SFT-config_hash__686d5cfa7fc7c17f
Viewer
• Updated • 5 • 5
TAUR-dev/D-VAL_SFT-config_hash__2b3684a11a6b45f5
Viewer
• Updated • 50 • 5
TAUR-dev/llama_factory__countdown_3arg__val_sample_eval
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-8707783415938120676
Viewer
• Updated • 2k • 5
TAUR-dev/D-VAL_SFT-config_hash__701d0e178820be4c
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-6463082379206781726
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-5955901347446409479
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-3721503065897118847
Viewer
• Updated • 2k • 5
TAUR-dev/D-VAL_SFT-config_hash-5666993983057710268
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-205753953167594443
Viewer
• Updated • 2k • 6
TAUR-dev/D-VAL_SFT-config_hash-3645486539525341560
Viewer
• Updated • 2k • 5
TAUR-dev/llama_factory__countdown_3arg__val
Viewer
• Updated • 250 • 4
TAUR-dev/D-EVAL__standard_eval_v3__sft_gs_fixed_evals__masked_high_lr__multi_task_5ep_rl-eval_rl
Viewer
• Updated • 2.45k • 5
TAUR-dev/D-exp_masked_ranges_basic_v1_high_lr-sft-data_validation
Viewer
• Updated • 250 • 5
TAUR-dev/D-SFT_C-cd3arg-Qwen2.5-1.5B-Mixed-all_examples_with_skills_validation
Viewer
• Updated • 250 • 5
TAUR-dev/D-sft_gs__singleton_structures__N_1__masked_low_lr-sft-data
Viewer
• Updated • 3.81k • 5
TAUR-dev/D-SFT_C-cd3arg-Qwen2.5-1.5B-Instruct-Mix_ss_pse_vote_ansrev_validation
Viewer
• Updated • 250 • 5
TAUR-dev/dataset__countdown__num_range-3__bon_scored__AReC_convos_format_fixed_validation
Viewer
• Updated • 250 • 5
TAUR-dev/dataset__countdown__num_range-3__bon_scored__AReC_convos_format_fixed
Viewer
• Updated • 3.8k • 5
TAUR-dev/dataset__countdown__num_range-3__bon_scored__AReC_convos_validation
Viewer
• Updated • 500 • 5
TAUR-dev/D-sft_gs__structure_types__mix_only__masked_high_lr-sft-data
Viewer
• Updated • 28.8k • 4
TAUR-dev/D-SFTv1_C-cd3arg-Qwen2.5-1.5B-MockSearchV2-7_24_25_validation
Viewer
• Updated • 500 • 6
TAUR-dev/D-EVAL__standard_eval_v3__sft_gs__structure_types__answer_revision_only__masked_high_lr-eval_sft
Viewer
• Updated • 2.45k • 6
TAUR-dev/D-SFTv1_C-cd3arg-Qwen2.5-1.5B-MockSearchV2-7_24_25_sft_test_with_validation_tracking_sft_data
Viewer
• Updated • 31k • 6
TAUR-dev/D-SFTv1_C-cd3arg-Qwen2.5-1.5B-MockSearchV2-7_24_25-sft_test_with_validation_tracking-sft-data
Viewer
• Updated • 31k • 6
TAUR-dev/D-SFTv1_C-cd3arg-Qwen2.5-1.5B-MockSearchV2-7_24_25
Viewer
• Updated • 31k • 76
TAUR-dev/D-sft_gs__structure_types__answer_revision_only__masked_high_lr-sft-data
Viewer
• Updated • 30k • 6
TAUR-dev/D-EVAL__standard_eval_v3__sft_gs_fixed_evals__masked_low_lr__multi_task_rl-eval_rl
Viewer
• Updated • 2.45k • 5
TAUR-dev/D-EVAL__standard_eval_v3__sft_gs_fixed_evals__masked_high_lr__multi_task_rl-eval_rl
Viewer
• Updated • 2.45k • 4