AI & ML interests
None yet
Organizations
None yet
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold10-3Dhint-prompt1-dp
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold30-3Dhint-prompt1-cosine
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold30-3Dhint-prompt1-dp
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-grpo-prompt1-dp
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold30-3Dhint-prompt1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold0-3Dhint-prompt1-epoch3
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt0
Text Generation
• 8B • Updated • 1
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1
Text Generation
• 8B • Updated • 2
cg666/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold-Dhint35-epoch1
8B • Updated • 1
cg666/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-epoch1
Text Generation
• 8B • Updated • 2
cg666/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold-Dhint5-7-epoch5
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold-Dhint-epoch5
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold-Dhint-epoch5
8B • Updated • 1
cg666/Qwen-2.5-Base-7B-gen8-math3to5-100-ghpo-hint0.5-epoch2-test
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-hint0.5-epoch2-save20
8B • Updated • 1
cg666/openr1-Qwen-2.5-Base-7B-grpo-epoch2-save2-debug
Updated
cg666/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-hint0.5-epoch2-save10
8B • Updated • 1
cg666/openr1-Qwen-2.5-Base-7B-grpo-epoch2-save2
Updated
cg666/openr1-Qwen-2.5-Base-7B-grpo-epoch2-save
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-grpo-epoch3-test
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-grpo-epoch3
Text Generation
• 8B • Updated • 3
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-ghpo-hint0.5-epoch3-v3
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-ghpo-hint0.5-epoch3-v2
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-ghpo-hint0.5-epoch3-new
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5-ghpo-hint0.5-epoch3
Updated
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-hint0.7-epoch3
8B • Updated • 1
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-epoch3
Text Generation
• 8B • Updated • 2
cg666/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-grpo-epoch3
Text Generation
• 8B • Updated • 1
cg666/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-ghpo-beta0-epoch2-test
Updated
cg666/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2-2
Updated