TAUR-dev/M-1114_newmodels__qwen7b_ct3arg-rl
Updated
TAUR-dev/M-1114_newmodels__qwen0.5b_ct3arg-rl
Updated
TAUR-dev/M-1113_newmodels__qwen7b_bs1_ct3arg-rl
8B • Updated • 1
TAUR-dev/M-1113_newmodels__qwen7b_ct3arg-rl
8B • Updated • 3
TAUR-dev/M-1113_newmodels__llama3b_bs1_ct3arg-rl
4B • Updated • 1
TAUR-dev/M-1113_newmodels__llama3b_ct3arg-rl
4B • Updated • 2
TAUR-dev/M-1113_newmodels__qwen0.5b_ct3arg-rl
0.6B • Updated • 2
TAUR-dev/M-1110_star__oursfixed_alltask-rl
2B • Updated • 1
TAUR-dev/M-1110_star__star_alltask-rl
2B • Updated • 1
TAUR-dev/M-AT_ours_sft-sft
2B • Updated • 1
TAUR-dev/M-AT_star_sft-sft
2B • Updated • 1
TAUR-dev/M-r1_distill_baseline-rl
2B • Updated • 1
TAUR-dev/M-r1_translated_BASELINE-sft
2B • Updated • 2
TAUR-dev/M-rl_rlonly_AT_fixed-rl
Updated
TAUR-dev/testing__pvv2_lora
2B • Updated • 2
TAUR-dev/testing__lf_pvv2_resume
2B • Updated • 1
TAUR-dev/M-1023_longmult__0epoch_longmult3dig-rl
2B • Updated • 1
TAUR-dev/M-1022_longcontext__maxlen8192_1e_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen8192_1e_3and4arg-rl
Updated
TAUR-dev/sf_pvv2_cd3arg_10resps_sft
2B • Updated • 1
TAUR-dev/M-1022_longcontext__maxlen8192_0epoch_3args-rl
Updated
TAUR-dev/pvv2_longmult3dig_sft
2B • Updated • 1
TAUR-dev/M-1022_longcontext__maxlen4096_1e_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_0epoch_3args-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen8192_0epoch_3and4arg-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_0epoch_3and4arg-rl
Updated
TAUR-dev/M-1022_longcontext__maxlen4096_1e_3and4arg-rl
Updated
TAUR-dev/testing_llamafactory_helper_quick_test
0.5B • Updated • 3
TAUR-dev/testing_llamafactory_helper_quick_test__interactive
0.5B • Updated • 4
TAUR-dev/testing_llamafactory_helper_quick_test__local
0.5B • Updated • 3