·
AI & ML interests
LLM Post-Training
Organizations
None yet
Renjie-Ranger/verl-grpo-128k-Qwen2.5-1.5B-Instruct-global_step_10
2B • Updated
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_90
0.6B • Updated
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_80
0.6B • Updated
• 3
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_70
0.6B • Updated
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_60
0.6B • Updated
• 3
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_50
0.6B • Updated
• 3
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_40
0.6B • Updated
• 4
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_30
0.6B • Updated
• 3
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_20
0.6B • Updated
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_110
0.6B • Updated
• 5
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_100
0.6B • Updated
• 5
Renjie-Ranger/verl-grpo-128k-Qwen2.5-0.5B-Instruct-global_step_10
0.6B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Llama-3.2-1B-Instruct
Text Generation
• 1B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-0.5B-Instruct
Text Generation
• 0.5B • Updated
Renjie-Ranger/curriculum_220k_long-cot_gemma-3-1b-it
Text Generation
• 1.0B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-1.5B
Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-14B-Instruct
Text Generation
• 15B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-Math-1.5B-Instruct
Text Generation
• 2B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Llama-3.1-8B-Instruct
Text Generation
• 8B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-7B-Instruct
Text Generation
• 8B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-1.5B-Instruct
Text Generation
• 2B • Updated
Renjie-Ranger/curriculum_220k_long-cot_Qwen2.5-3B-Instruct
Text Generation
• 3B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Llama-3.2-1B-Instruct
Text Generation
• 1B • Updated
Renjie-Ranger/curriculum_32k_long-cot_gemma-3-1b-it
Text Generation
• 1.0B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Qwen2.5-1.5B
Text Generation
• 2B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Qwen2.5-14B-Instruct
Text Generation
• 15B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Qwen2.5-Math-1.5B-Instruct
Text Generation
• 2B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Qwen-2.5-3B
Text Generation
• 3B • Updated
Renjie-Ranger/curriculum_32k_long-cot_Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated