·
AI & ML interests
None yet
Organizations
None yet
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch3
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch2-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch2
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch3-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch3
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch1-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot-epoch1
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch2-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch2
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch3-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch3
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch1-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-v0.2-math-4shot-epoch1
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch2-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch2
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch2-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch2
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch1-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot-epoch1
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch1-critic
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot-epoch1
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot
Updated
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-math-4shot
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-math-4shot
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo-gsm8k-4shot
Updated
gohsyi/Llama-3.1-8B-Instruct-ppo4-rwt-gsm8k-4shot
8B • Updated gohsyi/Llama-3.1-8B-Instruct-ppo4-gsm8k-4shot
8B • Updated gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step40
Text Generation
• 8B • Updated gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step30
Text Generation
• 8B • Updated gohsyi/Meta-Llama-3.1-8B-Instruct-ppo4-ultrafeedback-v0.1-ckpt-step20
Text Generation
• 8B • Updated