AI & ML interests
None yet
Organizations
None yet
Yuhan123/ppo-synthetic-one-language-2025-04-02-10-19-12
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-512-lr-1e-6-2025-04-09-20-56-34
Text Generation
• 3B • Updated Yuhan123/ppo-5000-lr-1e-6-2025-04-02-22-00-58
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.566
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.587
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-lr-1e-6-2025-04-02-19-15-25
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.388
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.858
Text Generation
• 3B • Updated Yuhan123/ppo-perplexity-debug-run-128-lr-1e-6-2025-04-08-23-39-45
Text Generation
• 3B • Updated Yuhan123/ppo-1-lr-1e-6-2025-04-15-19-03-10
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.403
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.642
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.332
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.396
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-12th-1-steps-10000-epoch-999-best-eval-score-0.415
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-7th-1-steps-100002025-04-17-07-49-57-epoch-999-eval-score-0.526
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.526
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.481
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.606
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.378
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.441
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.411
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.501
Text Generation
• 3B • Updated Yuhan123/ppo-synthetic-one-language-after-sft-lr-1e-6-2025-04-02-17-00-00
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.135
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.433
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.472
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.818
Text Generation
• 3B • Updated Yuhan123/sft-synthetic-one-language-2025-04-01-22-15-18
Text Generation
• 3B • Updated • 1
Yuhan123/sft-synthetic-one-language-2025-04-01-19-15-31
Text Generation
• 3B • Updated • 2