kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_combined_datasets_rollout_s1 Viewer • Updated 26 days ago • 47.9k • 21
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_combined_datasets_rollout_s1 Viewer • Updated 26 days ago • 48.7k • 18
kureha295/DeepSeek-R1-Distill-Qwen-7B_combined_datasets_rollout_s1 Viewer • Updated 29 days ago • 47.9k • 16
kureha295/DeepSeek-R1-Distill-Llama-8B_combined_datasets_rollout_s1 Viewer • Updated 29 days ago • 48.7k • 15
kureha295/gpt-oss-20b_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated about 1 month ago • 12.5k • 15
kureha295/Qwen3-8B_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated about 1 month ago • 12.3k • 11
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated about 1 month ago • 12.4k • 11
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_orbench_extra_prompts_cot5_out5 Viewer • Updated about 1 month ago • 12.5k • 11
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_train_harmful_prompts_cot5_out5 Viewer • Updated Mar 5 • 35.6k • 13
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_train_harmful_prompts_cot5_out5 Viewer • Updated Mar 5 • 36.2k • 19
kureha295/DeepSeek-R1-Distill-Qwen-7B_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 4 • 12.4k • 10
kureha295/DeepSeek-R1-Distill-Llama-8B_orbench_extra_prompts_cot5_out5 Viewer • Updated Mar 4 • 12.5k • 10
kureha295/DeepSeek-R1-Distill-Llama-8B_train_harmless_prompts_cot5_out5 Viewer • Updated Mar 4 • 28.7k • 8
kureha295/DeepSeek-R1-Distill-Llama-8B_test_harmful_prompts_cot5_out5 Viewer • Updated Mar 4 • 12.1k • 9
kureha295/DeepSeek-R1-Distill-Llama-8B_test_harmless_prompts_cot5_out5 Viewer • Updated Mar 4 • 6.11k • 9
kureha295/DeepSeek-R1-Distill-Qwen-7B_test_harmless_prompts_cot5_out5 Viewer • Updated Mar 4 • 6.03k • 11
kureha295/DeepSeek-R1-Distill-Qwen-7B_train_harmless_prompts_cot5_out5 Viewer • Updated Mar 4 • 28.4k • 7