AI & ML interests
None yet
Organizations
None yet
Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.755
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.402
Text Generation
• 3B • Updated • 3
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.425
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.592
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.862
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.381
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.256
Text Generation
• 3B • Updated Yuhan123/ppo-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.718
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.120
Text Generation
• 3B • Updated Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.428
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.356
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.580
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-12th-1-steps-10000-epoch-999-best-eval-score-0.309
Text Generation
• 3B • Updated • 2
Yuhan123/ppo-reading-level-grad-1-steps-100002025-04-17-07-38-46-epoch-999-eval-score-0.366
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-7th-1-steps-10000-epoch-999-best-eval-score-0.361
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-reading-level-full-question-7th-1-steps-10000-epoch-999-best-eval-score-0.318
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-grad-1-steps-10000-epoch-999-best-eval-score-0.445
Text Generation
• 3B • Updated • 1
Yuhan123/ppo-cn-RM-reading-level-preschool-1-steps-10000-epoch-999-best-eval-score-0.331
Text Generation
• 3B • Updated • 1
Yuhan123/mistral-7b-instruct-baseline_var_1
Text Generation
• 7B • Updated • 2
Yuhan123/mistral-7b-semantics-multiturn
Text Generation
• 7B • Updated • 2
Yuhan123/mistral-7b-instruct-semantics_var_1
Text Generation
• 7B • Updated • 2
Yuhan123/qwen-1.5-4b-kto-sft
Text Generation
• 4B • Updated • 1
Yuhan123/qwen-1.5-4b-kto-sft-wildchat
Text Generation
• 4B • Updated Text Generation
• 7B • Updated Text Generation
• 7B • Updated Yuhan123/sft-llava-1.5-7b-hf
Updated
Yuhan123/mistral-7b-kto-wildchat
Text Generation
• 7B • Updated Yuhan123/vicuna-7b-kto-wildchat
Text Generation
• 7B • Updated • 1
Yuhan123/qwen-1.5-4b-wildchat-semantics_var_3
Text Generation
• 4B • Updated Yuhan123/vicuna-7b-kto-sft-wildchat
Text Generation
• 7B • Updated • 1