·
AI & ML interests
None yet
Organizations
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_110
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_100
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_80
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_95
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_105
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_85
Updated
xinpeng/big-math-hard-tiny-qwen2.5-14b-instruct-og-rloo-implicit-cheat-direct-global_step_115
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_5
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_25
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_15
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_20
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_10
3B • Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_40
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_130
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_145
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_70
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_105
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_50
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_30
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_60
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_35
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_75
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_140
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_65
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_85
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_55
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_115
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_125
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_80
Updated
xinpeng/big-math-hard-tiny-qwen2.5-3b-instruct-og-grpo-implicit-cheat-direct-rerun_2-global_step_120
Updated