MoeReward/combined_preference_dataset_qwen2.5_sft_alpaca_heavy
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen2.5_sft_qa_heavy
Viewer
• Updated
• 9.23k • 6
MoeReward/combined_preference_dataset_qwen2.5_sft_coding_heavy
Viewer
• Updated
• 10k • 5
MoeReward/combined_preference_dataset_qwen2.5_sft_math_heavy
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen2.5_sft_equal_dist
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen2.5_sft
Viewer
• Updated
• 81.3k • 6
MoeReward/combined_preference_dataset_olmoe_sft
Viewer
• Updated
• 61.7k • 6
MoeReward/combined_preference_dataset_olmoe_base
Viewer
• Updated
• 66.7k • 6
MoeReward/combined_preference_dataset_olmoe_base_alpaca_heavy
Viewer
• Updated
• 10k • 5
MoeReward/combined_preference_dataset_olmoe_base_qa_heavy
Viewer
• Updated
• 9.23k • 5
MoeReward/combined_preference_dataset_olmoe_base_coding_heavy
Viewer
• Updated
• 9.92k • 6
MoeReward/combined_preference_dataset_olmoe_base_math_heavy
Viewer
• Updated
• 10k • 10
MoeReward/combined_preference_dataset_olmoe_base_equal_dist
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen1.5_base_alpaca_heavy
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen1.5_base_qa_heavy
Viewer
• Updated
• 9.23k • 10
MoeReward/combined_preference_dataset_qwen1.5_base_coding_heavy
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen1.5_base_math_heavy
Viewer
• Updated
• 10k • 6
MoeReward/combined_preference_dataset_qwen1.5_base_equal_dist
Preview
• Updated
• 6
MoeReward/combined_preference_dataset_qwen1.5_base
Viewer
• Updated
• 61.9k • 6
MoeReward/combined_preference_dataset_qwen
Viewer
• Updated
• 50k • 6
MoeReward/combined_preference_dataset_olmoe
Viewer
• Updated
• 56.6k • 6
MoeReward/combined_sft_dataset
Viewer
• Updated
• 115k • 5
MoeReward/combined_preference_dataset
Viewer
• Updated
• 52k • 6
MoeReward/combined_rlhf_dataset
Viewer
• Updated
• 125k • 7