·
AI & ML interests
None yet
Organizations
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_50
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_70
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_75
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_5
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_30
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_10
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_35
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_20
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_55
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_80
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_60
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_40
8B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_110
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-no-global_step_45
3B • Updated
• 68
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-no-global_step_45
8B • Updated
xinpeng/big-math-hard-tiny-llama-3.2-3b-ins-og-rloo-implicit-cheat-no-global_step_60
4B • Updated
xinpeng/big-math-hard-tiny-llama-3.2-3b-ins-og-rloo-implicit-cheat-no-global_step_145
4B • Updated
xinpeng/big-math-hard-tiny-llama-3.2-3b-ins-60-base-og-rloo-implicit-cheat-rm-global_step_145
4B • Updated
xinpeng/big-math-hard-tiny-llama-3.2-3b-ins-60-base-og-rloo-implicit-cheat-direct-global_step_145
4B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_85
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_70
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_20
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_55
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_40
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_30
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_5
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_80
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_15
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_65
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_25
3B • Updated