·
AI & ML interests
None yet
Organizations
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_75
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_30
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_55
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_5
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_85
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_35
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_15
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_60
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-rm-loophole-global_step_80
4B • Updated • 2
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-no-global_step_15
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_40
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_5
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_45
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_55
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_30
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_25
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_15
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_50
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_10
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_20
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_35
4B • Updated • 1
xinpeng/big-math-hard-tiny-llama-3.2-3b-instruct-og-rloo-implicit-cheat-direct-global_step_60
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_60
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_45
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_50
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_55
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_40
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_35
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_5
3B • Updated • 1
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-mixed-global_step_10
3B • Updated • 1