TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__SIE-Countdown3arg-AU_BoN-rl
Viewer
• Updated • 1k • 1
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__SIE-Countdown3arg-AU_BoN-sft
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__multiturn-rl
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__multiturn-sft
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__all_in_user-rl
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify__all_in_user-sft
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify-rl
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__M-cd3arg-should_verify-sft
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd3arg__Qwen2.5-1.5B-Instruct
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-FiRC-rl
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-FiRC-sft
Viewer
• Updated • 1k • 1
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-CoUCF-rl
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-CoUCF-sft
Viewer
• Updated • 1k • 1
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AReC-rl
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AReC-sft
Viewer
• Updated • 1k • 2
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-Distilled_QWQ-rl
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-Distilled_QWQ-sft
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-BoN-rl
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-BoN-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-rl
Viewer
• Updated • 1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__SIE-Countdown3arg-AU_BoN-sft
Viewer
• Updated • 1k • 3
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__rl__results
Viewer
• Updated • 6 • 4
TAUR-dev/SIE_EVAL__countdown3arg_should_verify__rl__samples
Viewer
• Updated • 2.1k • 4
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__multiturn-rl
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__multiturn-sft
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__all_in_user-rl
Viewer
• Updated • 1k • 3
TAUR-dev/D-EXP_ask_lm_if_is_correct__cd4arg__M-cd3arg-should_verify__all_in_user-sft
Viewer
• Updated • 1k • 5
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-FiRC-rl
Viewer
• Updated • 1.92k • 3
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-FiRC-sft
Viewer
• Updated • 1.92k • 4
TAUR-dev/D-EXP_bmodel_bon_verif_cdarg3_SIE-Countdown3arg-AU_BoN-rl
Viewer
• Updated • 1.92k • 2