·
AI & ML interests
None yet
Organizations
august66/drpo_ultrafeedback_qwen2.5-1.5b_first_iter_20k
Viewer
• Updated • 20k • 1
august66/drpo_ultrafeedback_qwen2.5-1.5b-7
Viewer
• Updated • 2.5k • 2
august66/drpo_ultrafeedback_qwen2.5-1.5b-6
Viewer
• Updated • 2.5k • 2
august66/drpo_ultrafeedback_qwen2.5-1.5b-5
Viewer
• Updated • 1.5k • 9
august66/drpo_ultrafeedback_qwen2.5-1.5b-4
Viewer
• Updated • 1k • 2
august66/drpo_ultrafeedback_qwen2.5-1.5b-3
Viewer
• Updated • 2.5k • 2
august66/drpo_ultrafeedback_qwen2.5-1.5b-2
Viewer
• Updated • 5k • 2
august66/drpo_ultrafeedback_qwen2.5-1.5b-1
Viewer
• Updated • 5k • 6
august66/drpo_ultrafeedback_qwen2.5-1.5b
Viewer
• Updated • 30 • 2
august66/DRPO_data_from_ultrafeed_new_template
Viewer
• Updated • 64k • 3
august66/DRPO_data_from_ultrafeed
Viewer
• Updated • 64k • 2
august66/DRPO_first_iter_completion_label_test
Viewer
• Updated • 200 • 6
Viewer
• Updated • 20k • 5
Viewer
• Updated • 25k • 4
august66/reward_data_for_dpo_train
Viewer
• Updated • 25k • 3