mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b079 Viewer • Updated about 1 hour ago • 1.22k
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b069 Viewer • Updated about 8 hours ago • 1.22k • 3
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b059 Viewer • Updated about 16 hours ago • 1.22k • 5
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b049 Viewer • Updated about 24 hours ago • 1.22k • 4
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b039 Viewer • Updated 1 day ago • 1.22k • 9
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b029 Viewer • Updated 1 day ago • 1.22k • 10
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b019 Viewer • Updated 2 days ago • 1.22k • 9
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S42-R0-train_eval-b009 Viewer • Updated 2 days ago • 1.22k • 11
mzio/aprm-sft_genthinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b099 Viewer • Updated 3 days ago • 1.22k • 6
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b099 Viewer • Updated 3 days ago • 1.22k • 8
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b089 Viewer • Updated 4 days ago • 1.22k • 5
mzio/aprm-sft_genthinkact-Eaprm_tw_treasure_medium_sp-Gnobandit_aprm_qw3_ap-S42-Rmt128_nb_treasu Viewer • Updated 4 days ago • 448 • 7
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_medium_sp-Gnobandit_aprm_qw3_ap-S42-Rmt128_nb_treasure_ Viewer • Updated 4 days ago • 448 • 35
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b079 Viewer • Updated 4 days ago • 1.22k • 7
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b069 Viewer • Updated 4 days ago • 1.22k • 7
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_hard_sp-Gnobandit_aprm_qw3_ap-S42-Rmt128_nb_treasure_ha Viewer • Updated 4 days ago • 881 • 9
mzio/aprm-sft_genthinkact-Eaprm_tw_treasure_easy_sp-Gnobandit_aprm_qw3_ap-S42-Rmt128_nb_treasure Viewer • Updated 4 days ago • 244 • 6
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_easy_sp-Gnobandit_aprm_qw3_ap-S42-Rmt128_nb_treasure_ea Viewer • Updated 4 days ago • 244 • 31
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b059 Viewer • Updated 4 days ago • 1.22k • 7
mzio/aprm-sft_thinkact-Eaprm_tw_coin_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_coin_medium_g4-train_ Viewer • Updated 5 days ago • 1.22k • 19
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_hard_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_hard-train Viewer • Updated 5 days ago • 881 • 22
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b049 Viewer • Updated 5 days ago • 1.22k • 6
mzio/aprm-sft_genthinkact-Eaprm_tw_treasure_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_mediu Viewer • Updated 5 days ago • 448 • 6
mzio/aprm-sft_thinkact-Eaprm_tw_treasure_medium_sp-Gaprm_qw3_ap-S42-Rmt128_reg_treasure_medium-t Viewer • Updated 5 days ago • 448 • 36
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b039 Viewer • Updated 5 days ago • 1.22k • 6
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_hard_sp-Gaprm_qwen3_ap-S42-Rlr1e4-train_eval-b019 Viewer • Updated 5 days ago • 1.22k • 6
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gnobandit_aprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated 5 days ago • 1.22k • 6
mzio/aprm-sft_thinkact-Eaprm_tw_coin_easy_sp-Gaprm_qw3_ap-S42-Rmt128_reg_coin_easy-train_eval-b0 Viewer • Updated 5 days ago • 1.22k • 14
mzio/aprm-sft_thinkact-Eact_prm_tw_coin_easy_sp-Gaprm_qwen3_ap-S0-R1-train_eval-b029 Viewer • Updated 5 days ago • 1.22k • 8