simonycl/gsm8k_training_negative_chain_of_thought_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.93k • 3
simonycl/gsm8k_training_negative_multi_turn_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.75k • 3
simonycl/gsm8k_training_positive_direct_multi_turn_1k_transformed Viewer • Updated Sep 11, 2025 • 1k • 3
simonycl/gsm8k_training_negative_combined_1k_gemini-2.5-flash_transformed Viewer • Updated Sep 11, 2025 • 1.76k • 3
simonycl/gsm8k_training_negative_vs_standard_1k_gemini-2.5-flash_transformed Viewer • Updated Sep 11, 2025 • 1.7k • 3
simonycl/gsm8k_training_negative_sequence_1k_gemini-2.5-flash_transformed Viewer • Updated Sep 11, 2025 • 1.78k • 3
simonycl/gsm8k_training_negative_direct_1k_gemini-2.5-flash_transformed Viewer • Updated Sep 11, 2025 • 1.49k • 3
simonycl/gsm8k_training_negative_combined_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.92k • 3
simonycl/gsm8k_training_negative_vs_standard_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.93k • 3
simonycl/gsm8k_training_negative_sequence_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.88k • 3
simonycl/gsm8k_training_negative_direct_1k_gpt-4.1_transformed Viewer • Updated Sep 11, 2025 • 1.65k • 2
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101728 Viewer • Updated Sep 8, 2025 • 11.5k • 3
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101654 Viewer • Updated Sep 8, 2025 • 5.72k • 3
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250908-101501 Viewer • Updated Sep 8, 2025 • 2.3k • 3
simonycl/game-eval-Qwen-Qwen3-32B-vs-Qwen-Qwen3-32B-20250907-135811 Viewer • Updated Sep 7, 2025 • 46.1k • 3
simonycl/game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250829-232628 Viewer • Updated Aug 30, 2025 • 46.7k • 2