·
AI & ML interests
None yet
Organizations
None yet
models 239
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3-critic
7B • Updated • 1
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch3
7B • Updated • 1
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3-critic
7B • Updated • 2
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch3
7B • Updated • 2
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2-critic
7B • Updated • 1
gohsyi/Mistral-7B-Instruct-v0.3-ppo4-rwt2.0-math-4shot-epoch2
7B • Updated • 1
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2-critic
7B • Updated • 3
gohsyi/Mistral-7B-Instruct-v0.3-ppo-math-4shot-epoch2
7B • Updated • 3
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3-critic
8B • Updated • 2
gohsyi/Llama-3.1-8B-Instruct-ppo-math-4shot-epoch3
8B • Updated • 2
datasets 33
gohsyi/Mistral-7B-Instruct-v0.3-gsm8k-4shot.jsonl
Viewer
• Updated • 29.9k • 6
gohsyi/gemma-1.1-7b-it-gsm8k-4shot.jsonl
Viewer
• Updated • 29.9k • 7
gohsyi/samples_gsm8k_cot_2024-12-04T19-03-11.038885.jsonl
Viewer
• Updated • 2.64k • 3
gohsyi/meta-llama__Llama-3.1-8B-Instruct
Preview
• Updated • 4
Viewer
• Updated • 8.79k • 8
Viewer
• Updated • 12.5k • 7
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-overweight
Viewer
• Updated • 4.1k • 6
gohsyi/Llama-3.2-3B-Instruct-ultrafeedback-4k-underweight
Viewer
• Updated • 4.1k • 15
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-overweight
Viewer
• Updated • 4.1k • 3
gohsyi/Llama-3.1-8B-Instruct-ultrafeedback-4k-underweight
Viewer
• Updated • 4.1k • 5