TAUR-dev/SF-D_EVAL-raw_eval_datasets-7-5_25
Viewer
• Updated • 1.7k • 5
TAUR-dev/SF-D_RL-rl_datasets-7-13_25
Viewer
• Updated • 5k • 63
TAUR-dev/SF-D_RL-raw_rl_datasets__pre_filter-7-13_25
Viewer
• Updated • 21k • 65
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__sft
Viewer
• Updated • 3.71k • 63
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__sft
Viewer
• Updated • 514 • 65
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__base_model
Viewer
• Updated • 3.71k • 66
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__base_model
Viewer
• Updated • 514 • 65
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy__rl
Viewer
• Updated • 3.71k • 67
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy__rl
Viewer
• Updated • 514 • 61
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__entropy_base_model
Viewer
• Updated • 5 • 64
TAUR-dev/dataset__countdown2arg__qwen2.5-1.5b-I__BoN__altered__convos__high_low_entropy_tokens_base_model
Viewer
• Updated • 152 • 66
TAUR-dev/D-SFT_C-cd3arg-Qwen2.5-1.5B-Instruct-Mix_ss_pse_vote_ansrev
Viewer
• Updated • 15.2k • 64
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_sample_diversity_metrics_2
Viewer
• Updated • 284 • 51
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_sample_diversity_metrics
Viewer
• Updated • 284 • 51
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_diversity_metrics
Viewer
• Updated • 284 • 5
TAUR-dev/SIE_EVAL__countdown2arg_bon_au__sft__samples__bf_evaluated__prefix_continuations
Viewer
• Updated • 284 • 6
TAUR-dev/IFBench__qwen__bon__single
Viewer
• Updated • 294 • 18
TAUR-dev/IFEval__qwen__bon__single
Viewer
• Updated • 100 • 25
Viewer
• Updated • 107 • 39
TAUR-dev/SF-D_EVAL-raw_eval_datasets__pre_filter-7-5_25
Viewer
• Updated • 15.3k • 4
TAUR-dev/SIE_EVAL__countdown3arg_vote_think__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_psebon_think_werror_idx__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_ans_rev_think_p25chance__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_ssbon_think_p5chance__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_ans_rev_think_p25chance__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_ssbon_think_p5chance__rl__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_psebon_think_werror_idx__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_vote_think__sft__samples__bf_evaluated
Viewer
• Updated • 2.1k • 5
TAUR-dev/SIE_EVAL__countdown3arg_vote_think__rl__results
Viewer
• Updated • 6 • 4
TAUR-dev/SIE_EVAL__countdown3arg_vote_think__rl__samples
Viewer
• Updated • 2.1k • 5