TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__multiturn-sft
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__all_in_user-rl
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__all_in_user-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify-sft
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__Qwen2.5-1.5B-Instruct
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-FiRC-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-FiRC-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-CoUCF-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-CoUCF-sft
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AReC-rl
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AReC-sft
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-Distilled_QWQ-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-Distilled_QWQ-sft
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-BoN-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-BoN-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-sft
Viewer
• Updated • 1k • 6
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__rl__results
Viewer
• Updated • 6 • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__rl__samples
Viewer
• Updated • 2.1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__multiturn-rl
Viewer
• Updated • 1k • 6
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__multiturn-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__all_in_user-rl
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__all_in_user-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-FiRC-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-FiRC-sft
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-AU_BoN-rl
Viewer
• Updated • 1.92k • 6
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-AU_BoN-sft
Viewer
• Updated • 1.92k • 5
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__sft__results
Viewer
• Updated • 6 • 6
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__sft__samples
Viewer
• Updated • 2.1k • 6