TAUR-dev/D-EVAL__standard_eval_v1__SIE-Countdown3arg-Distilled_QWQ-rl
Viewer
• Updated • 1.7k • 1
TAUR-dev/D-EVAL__standard_eval_v1__SIE-countdown3arg_ans_rev_think_p25chance-sft
Viewer
• Updated • 1.7k • 2
TAUR-dev/backup___SF-D_EVAL-raw_eval_datasets-7-5_25
Viewer
• Updated • 1.7k • 2
TAUR-dev/SF-D_EVAL-raw_eval_datasets-7-5_25
Viewer
• Updated • 1.7k • 4
TAUR-dev/SF-D_RL-rl_datasets-7-13_25
Viewer
• Updated • 5k • 3
TAUR-dev/SF-D_RL-raw_rl_datasets__pre_filter-7-13_25
Viewer
• Updated • 21k • 5
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__sft
Viewer
• Updated • 3.71k • 5
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__sft
Viewer
• Updated • 514 • 2
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__base_model
Viewer
• Updated • 3.71k • 2
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__base_model
Viewer
• Updated • 514 • 3
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__rl
Viewer
• Updated • 3.71k • 2
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__rl
Viewer
• Updated • 514 • 5
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy_base_model
Viewer
• Updated • 5 • 3
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy_tokens_base_model
Viewer
• Updated • 152 • 3
TAUR-dev/D-SFT_C-cd3arg-Qwen2.5-1.5B-Instruct-Mix_ss_pse_vote_ansrev
Viewer
• Updated • 15.2k • 2
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_sample_diversity_metrics_2
Viewer
• Updated • 284 • 2
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_sample_diversity_metrics
Viewer
• Updated • 284 • 3
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_diversity_metrics
Viewer
• Updated • 284 • 3
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_continuations
Viewer
• Updated • 284 • 3
TAUR-dev/IFBench__qwen__bon__single
Viewer
• Updated • 294 • 2
TAUR-dev/IFEval__qwen__bon__single
Viewer
• Updated • 100 • 2
Viewer
• Updated • 107 • 4
TAUR-dev/SF-D_EVAL-raw_eval_datasets__pre_filter-7-5_25
Viewer
• Updated • 15.3k • 2
TAUR-dev/SIE_EVAL__countdown3arg_vote_think__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 2
TAUR-dev/SIE_EVAL__countdown3arg_psebon_think_werror_idx__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 2
TAUR-dev/SIE_EVAL__countdown3arg_ans_rev_think_p25chance__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 2
TAUR-dev/SIE_EVAL__countdown3arg_ssbon_think_p5chance__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 3
TAUR-dev/SIE_EVAL__countdown3arg_ans_rev_think_p25chance__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 3
TAUR-dev/SIE_EVAL__countdown3arg_ssbon_think_p5chance__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 3
TAUR-dev/SIE_EVAL__countdown3arg_psebon_think_werror_idx__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 2