·
AI & ML interests
LLM Post-Training
Organizations
None yet
Renjie-Ranger/CCFT-v1-GPT5nano-critique-math_no_concise_default-global_step_20
8B • Updated
• 4
Renjie-Ranger/v1-GPT5nano-critique-general_reasoner_summary_C-plus_no_concise_p-online-global_step_25
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_normal-global_step_20
8B • Updated
Renjie-Ranger/CCFT-v1-GPT5nano-critique-math_no_concise_default-global_step_15
8B • Updated
• 4
Renjie-Ranger/v1-GPT5nano-critique-general_reasoner_summary_C-plus_no_concise_p-online-global_step_20
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_normal-global_step_15
8B • Updated
• 4
Renjie-Ranger/CCFT-v1-GPT5nano-critique-math_no_concise_default-global_step_10
8B • Updated
• 4
Renjie-Ranger/v1-GPT5nano-critique-general_reasoner_summary_C-plus_no_concise_p-online-global_step_15
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_normal-global_step_10
8B • Updated
Renjie-Ranger/v1-GPT5nano-critique-general_reasoner_summary_C-plus_no_concise_p-online-global_step_10
8B • Updated
• 2
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_90
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_85
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_80
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_75
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_90
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_70
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_85
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_65
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_80
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_60
8B • Updated
• 5
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_75
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_55
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_70
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_50
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_65
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_5
8B • Updated
• 5
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_60
8B • Updated
• 4
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_45
8B • Updated
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online-global_step_55
8B • Updated
• 3
Renjie-Ranger/GRPO-GPT5nano-critique-big_math_vanilla_partial_online_rft-global_step_40
8B • Updated