AI & ML interests
None defined yet.
Recent Activity
ScaleML-RLHF/Qwen2.5-Math-1.5B-raft-plusplus-numina_math_em-sample1n16-sample16-iter2
2B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-1.5B-raft-plusplus-numina_math_em-sample1n16-sample16-iter1
2B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step140
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step130
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step120
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step110
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step100
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step90
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step80
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step70
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step60
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step50
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step40
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step30
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step20
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-ppo-plusplus-numina_math_15_all-n1-step10
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step140
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step130
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step120
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step110
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step100
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step90
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step80
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step70
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step60
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step50
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step40
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step30
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step20
8B • Updated • 1
ScaleML-RLHF/Qwen2.5-Math-7B-grpo-plusplus-numina_math_15_all-n4-step10
8B • Updated • 1