AI & ML interests
None yet
Organizations
None yet
Yuhan123/ppo-reading-level-preschool-1-steps-100002025-04-17-06-57-16-epoch-999-eval-score-0.035
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.317
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.526
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.405
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.634
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.336
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.918
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.445
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.154
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.514
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.182
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.305
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.410
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.368
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.254
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.362
Text Generation
• 3B • Updated • 2
Yuhan123/sft-synthetic-one-language-100-2025-04-02-19-20-44
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-full-question-grad-1-steps-10000-epoch-999-best-eval-score-0.247
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-7th-grade-512-steps-1000-epoch-511-best-eval-score-0.667
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-synthetic-one-language-after-sft-lr-1e-6-2025-04-02-18-43-52
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.522
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.356
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.221
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.336
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.362
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.512
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.257
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.557
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-grad-1-steps-10000-epoch-999-best-eval-score-0.203
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.132
Text Generation
• 3B • Updated