reasoning-degeneration-dev/aime-2025-gepa-inputs-only-vanilla-final Viewer • Updated Feb 26 • 150 • 8
reasoning-degeneration-dev/rlm-codeqa-gpt-5-nano-20260225-224658__rlm_call_traces Viewer • Updated Feb 26 • 17 • 27
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b Viewer • Updated Feb 24 • 200 • 2
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b-s5-enhanced Viewer • Updated Feb 24 • 150 • 4
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b-s4-enhanced-strategy Viewer • Updated Feb 24 • 1 • 3
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b-s3-facts Viewer • Updated Feb 24 • 20 • 7
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b-s2-natural-strategy Viewer • Updated Feb 24 • 1 • 3
reasoning-degeneration-dev/t1-musr-prompt-enhancement-together_ai-meta-llama-meta-llama-3-1-8b-s1-base Viewer • Updated Feb 24 • 50 • 5
reasoning-degeneration-dev/wingdings-swebench-verified-1.0-mini-swe-agent-Qwen3-Next-80B-A3B-Thinking-20260223 Viewer • Updated Feb 23 • 10 • 14
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-inst-67693bd3 Viewer • Updated Feb 23 • 10 • 4
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-instruct-f7244bd4 Viewer • Updated Feb 23 • 10 • 4
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-moonshotai-kimi-k2-thinking-b6fc7e16 Viewer • Updated Feb 22 • 10 • 6
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-thinking-20637061 Viewer • Updated Feb 22 • 10 • 3
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-thinking-429343f5 Viewer • Updated Feb 22 • 10 • 3
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-thin-fea1b895 Viewer • Updated Feb 22 • 10 • 5
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-moonshotai-kimi-k2-instruct-b54be491 Viewer • Updated Feb 22 • 10 • 5
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-thin-e3fa99d5 Viewer • Updated Feb 22 • 10 • 4
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-thin-1d8d6196 Viewer • Updated Feb 22 • 10 • 4
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-backward_chaining-together_ai-qwen-qwen3-next-80b-a3b-inst-0898857d Viewer • Updated Feb 22 • 10 • 4
reasoning-degeneration-dev/t1-strategy-arena-frozenlake-baseline-together_ai-qwen-qwen3-next-80b-a3b-inst-97d2a5df Viewer • Updated Feb 22 • 10 • 3
reasoning-degeneration-dev/wingdings-swebench-verified-1.0-mini-swe-agent-Qwen3-Next-80B-A3B-Instruct-20260222 Viewer • Updated Feb 22 • 10 • 15