AI & ML interests
None yet
Organizations
None yet
cg666/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-ghpo-beta0-epoch2
Updated
cg666/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2
Text Generation
• 3B • Updated • 2
cg666/Qwen-2.5-Base-3B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3
Text Generation
• 3B • Updated • 5
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3-NP
8B • Updated cg666/Qwen-2.5-Base-3B-gen8-scale-math_selected-grpo-beta0-epoch3
Text Generation
• 3B • Updated • 3
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-grpo-beta0-epoch2
Text Generation
• 8B • Updated • 3
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch2
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta1e-4-epoch2
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo-beta0
Updated
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo-test111
Updated
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-grpo
Text Generation
• 8B • Updated • 3
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.5-epoch1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-mixed-gen8-noscale-ghpo-hint0.5-epoch1
Text Generation
• 8B • Updated • 1
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.4-epoch1
Text Generation
• 8B • Updated • 3
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.6-epoch1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.9-epoch1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.3-epoch1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo-hint0.7
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-mixed-gen8-scale-ghpo
Text Generation
• 8B • Updated • 3
cg666/Qwen-2.5-Base-7B-mixed-gen8-noscale-ghpo
Text Generation
• 8B • Updated • 3
cg666/Qwen-2.5-Base-7B-hard8000-gen8-ghpo
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-mixed-gen8-ghpo
Updated
cg666/Qwen-2.5-Base-7B-hint-test
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-mixed-hard-gen8-ghpo
Updated
cg666/Qwen-2.5-Base-7B-mixed-hard-hint0.9-gen14
Text Generation
• 8B • Updated • 2
cg666/Qwen2.5-0.5B-Open-R1-debug
Updated
cg666/Qwen-2.5-Base-7B-mixed-hard-hint0.6-gen14
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-mixed-hard-hint-gen14
Text Generation
• 8B • Updated • 2
• cg666/Qwen-2.5-Base-7B-Zero-CL-gen8-om220k-hard-data8000-hint
Text Generation
• 8B • Updated • 2