·
AI & ML interests
None yet
Recent Activity
Organizations
None yet
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_48
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-d4ks256-forget_at_1_fw_split_Math_7B_grpo_DSR_rl_4k_40
8B • Updated
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_40
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-d4ks256-forget_at_1_fw_split_Math_7B_grpo_DSR_rl_4k_32
8B • Updated
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_32
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-d4ks256-forget_at_1_fw_split_Math_7B_grpo_DSR_rl_4k_24
8B • Updated
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_24
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-d4ks256-forget_at_1_fw_split_Math_7B_grpo_DSR_rl_4k_16
8B • Updated
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_16
8B • Updated
cutelemonlili/Qwen2.5-7B-deepscaler_5k_prime_256
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-d4ks256-forget_at_1_fw_split_Math_7B_grpo_DSR_rl_4k_8
8B • Updated
cutelemonlili/Qwen2.5-Math-7B-deepscaler_5k_prime_256
8B • Updated
cutelemonlili/Qwen2.5-7B-d4ks256-forget_at_1_fw_split_general_7B_grpo_DSR_rl_4k_8
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_96
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_88
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_80
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_72
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_64
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_56
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_48
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_40
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_32
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_24
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_16
8B • Updated
cutelemonlili/Qwen2.5-7B-Instruct-d4ks256-forget_at_1_fw_split_general_7B_instruct_grpo_DSR_rl_4k_8
8B • Updated
cutelemonlili/Qwen2.5-1.5B-Instruct-deepscaler_4k_8nf_step_320
2B • Updated
cutelemonlili/Qwen2.5-1.5B-Instruct-deepscaler_4k_8nf_step_288
2B • Updated
cutelemonlili/Qwen2.5-1.5B-Instruct-deepscaler_4k_8nf_step_256
2B • Updated
cutelemonlili/Qwen2.5-1.5B-Instruct-deepscaler_4k_8nf_step_224
2B • Updated
cutelemonlili/Qwen2.5-1.5B-Instruct-deepscaler_4k_8nf_step_192
2B • Updated