Liu

Yuhan123

·

AI & ML interests

None yet

Organizations

None yet

Yuhan123 's models 436

Yuhan123/olmo-cad-rm-cad-maj-vote-eval-acc-0-9065-cad-rm-cad-maj-vote-eval-acc-0-9065-1-steps-20000

Text Generation • 1B • Updated Dec 7, 2025 • 8

Yuhan123/olmo-cad-checkpoint-460-cad-rm-cad-labels-0-eval-acc-0-8385-checkpoint-460-1-steps-20000

Text Generation • 1B • Updated Dec 7, 2025 • 6

Yuhan123/olmo-cad-checkpoint-360-cad-rm-cad-labels-1-eval-acc-0-8354-checkpoint-360-1-steps-20000

Text Generation • 1B • Updated Dec 7, 2025 • 7

Yuhan123/rm_cad_maj_vote_eval_acc_0_9065

Text Classification • 1B • Updated Oct 24, 2025 • 6

Yuhan123/olmo-multipref-ppo-acc-0.6950

1B • Updated Sep 10, 2025 • 7

Yuhan123/olmo-multipref-reward-model

1B • Updated Aug 29, 2025 • 7

Yuhan123/multipref-reward-model-qwen

Text Classification • 2B • Updated Aug 27, 2025 • 4

Yuhan123/multipref-reward-model-qwen-single

Updated Aug 26, 2025

Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-preschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-7th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-12th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 9

Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-7th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-12th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-7th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-12th-grade-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-gradschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-7th-grade-rejected-preschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-gradschool-rejected-preschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 7

Yuhan123/reading-level-pairwise-reward-chosen-12th-grade-rejected-gradschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/reading-level-pairwise-reward-chosen-preschool-rejected-gradschool-1-steps-1000

Text Generation • 1B • Updated Jul 17, 2025 • 8

Yuhan123/ppo-perplexity-olmo-debug-run-1-lr-1e-6-2025-06-04-00-13-11

Updated Jun 5, 2025

Yuhan123/ppo-perplexity-debug-run-128-lr-1e-6-2025-06-03-18-01-10

Updated Jun 3, 2025

Yuhan123/ppo-perplexity-debug-run-128-lr-1e-6-2025-06-03-16-58-26

Updated Jun 3, 2025

Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.316

Text Generation • 3B • Updated May 27, 2025 • 5

Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.229

Text Generation • 3B • Updated May 27, 2025 • 4

Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.340

Text Generation • 3B • Updated May 27, 2025 • 5

Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309

Text Generation • 3B • Updated May 27, 2025 • 3

Yuhan123/ppo-cn-RM-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361

Text Generation • 3B • Updated May 27, 2025 • 3

Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.383

Text Generation • 3B • Updated May 27, 2025 • 5

Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.398

Text Generation • 3B • Updated May 27, 2025 • 4