TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify__multiturn-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify__multiturn-sft
Viewer
• Updated • 1.92k • 6
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__multiturn__rl__results
Viewer
• Updated • 6 • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__multiturn__rl__samples
Viewer
• Updated • 2.1k • 6
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__all_in_user__rl__results
Viewer
• Updated • 6 • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__all_in_user__rl__samples
Viewer
• Updated • 2.1k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify__all_in_user-rl
Viewer
• Updated • 1.92k • 5
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify__all_in_user-sft
Viewer
• Updated • 1.92k • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__all_in_user__sft__results
Viewer
• Updated • 6 • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__all_in_user__sft__samples
Viewer
• Updated • 2.1k • 6
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__multiturn__sft__results
Viewer
• Updated • 6 • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__multiturn__sft__samples
Viewer
• Updated • 2.1k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_M-cd3arg-should_verify-sft
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-SFT_critique_cdarg3_Qwen2.5-1.5B-Instruct
Viewer
• Updated • 5
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_Qwen2.5-1.5B-Instruct
Viewer
• Updated • 1.92k • 5
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-CoUCF-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-Distilled_QWQ-sft-7_6_25
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-CoUCF-sft
Viewer
• Updated • 1.92k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-rl-7_6_25
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-sft-7_6_25
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify__all_in_user-rl-7_6_25
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify__all_in_user-sft-7_6_25
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify-rl-7_6_25
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify-sft-7_6_25
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify__multiturn-rl-7_6_25
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-countdown3arg_should_verify__multiturn-sft-7_6_25
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__Qwen2.5-1.5B-Instruct-7_6_25
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-AReC-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-AReC-sft
Viewer
• Updated • 1.92k • 6