AI & ML interests
None yet
Organizations
None yet
Yuhan123/olmo-cad-rm-cad-maj-vote-eval-acc-0-9065-cad-rm-cad-maj-vote-eval-acc-0-9065-1-steps-20000
Text Generation
• 1B • Updated • 2
Yuhan123/olmo-cad-checkpoint-460-cad-rm-cad-labels-0-eval-acc-0-8385-checkpoint-460-1-steps-20000
Text Generation
• 1B • Updated • 6
Yuhan123/olmo-cad-checkpoint-360-cad-rm-cad-labels-1-eval-acc-0-8354-checkpoint-360-1-steps-20000
Text Generation
• 1B • Updated • 7
Yuhan123/rm_cad_maj_vote_eval_acc_0_9065
Text Classification
• 1B • Updated • 2
Yuhan123/olmo-multipref-ppo-acc-0.6950
1B • Updated Yuhan123/olmo-multipref-reward-model
1B • Updated • 1
Yuhan123/multipref-reward-model-qwen
Text Classification
• 2B • Updated • 1
Yuhan123/multipref-reward-model-qwen-single
Updated
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-preschool-1-steps-1000
Text Generation
• 1B • Updated • 2
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-7th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-12th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-7th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-12th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-7th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-12th-grade-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-gradschool-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-preschool-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-preschool-1-steps-1000
Text Generation
• 1B • Updated • 2
Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-gradschool-1-steps-1000
Text Generation
• 1B • Updated • 1
Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-gradschool-1-steps-1000
Text Generation
• 1B • Updated • 2
Yuhan123/ppo-perplexity-olmo-debug-run-1-lr-1e-6-2025-06-04-00-13-11
Updated
Yuhan123/ppo-perplexity-debug-run-128-lr-1e-6-2025-06-03-18-01-10
Updated
Yuhan123/ppo-perplexity-debug-run-128-lr-1e-6-2025-06-03-16-58-26
Updated
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.316
Text Generation
• 3B • Updated • 3
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.229
Text Generation
• 3B • Updated • 4
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.340
Text Generation
• 3B • Updated • 4
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309
Text Generation
• 3B • Updated • 5
Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.383
Text Generation
• 3B • Updated • 4
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.398
Text Generation
• 3B • Updated • 1